1
0
Эх сурвалжийг харах

Add links to the evaluation datasets

Summary: Add links to the evaluation datasets

Differential Revision: D7642373

fbshipit-source-id: 9ff6794703d48cdb95c9b48e85244da8055a0077
Edouard Grave 8 жил өмнө
parent
commit
3e64bf0f5b
1 өөрчлөгдсөн 5 нэмэгдсэн , 0 устгасан
  1. 5 0
      docs/crawl-vectors.md

+ 5 - 0
docs/crawl-vectors.md

@@ -5,6 +5,7 @@ title: Word vectors for 157 languages
 
 We distribute pre-trained word vectors for 157 languages, trained on [*Common Crawl*](http://commoncrawl.org/) and [*Wikipedia*](https://www.wikipedia.org) using fastText.
 These models were trained using CBOW with position-weights, in dimension 300, with character n-grams of length 5, a window of size 5 and 10 negatives.
+We also distribute three new word analogy datasets, for French, Hindi and Polish.
 
 ### Format
 
@@ -59,6 +60,10 @@ E. Grave\*, P. Bojanowski\*, P. Gupta, A. Joulin, T. Mikolov, [*Learning Word Ve
 }
 ```
 
+### Evaluation datasets
+
+The analogy evaluation datasets described in the paper are available here: [French](https://s3-us-west-1.amazonaws.com/fasttext-vectors/word-analogies/questions-words-fr.txt), [Hindi](https://s3-us-west-1.amazonaws.com/fasttext-vectors/word-analogies/questions-words-hi.txt), [Polish](https://s3-us-west-1.amazonaws.com/fasttext-vectors/word-analogies/questions-words-pl.txt).
+
 ### Models
 
 The models can be downloaded from: