Skip to content

Commit

Permalink
Update word-embedding-dataset.md
Browse files Browse the repository at this point in the history
  • Loading branch information
astonzhang committed Nov 10, 2022
1 parent 42e6412 commit 62fe9c5
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ import os
import random
```

## 正在读取数据集
## 读取数据集

我们在这里使用的数据集是[Penn Tree Bank(PTB)](https://catalog.ldc.upenn.edu/LDC99T42)。该语料库取自“华尔街日报”的文章,分为训练集、验证集和测试集。在原始格式中,文本文件的每一行表示由空格分隔的一句话。在这里,我们将每个单词视为一个词元。

Expand Down

0 comments on commit 62fe9c5

Please sign in to comment.