Text-corpora have been gathered from:
- Site: http://wortschatz.uni-leipzig.de/en/
- Paper:
D. Goldhahn, T. Eckart & U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages.
In: Proceedings of the 8th International Language Ressources and Evaluation (LREC'12), 2012 - © 2018 Abteilung Automatische Sprachverarbeitung, Universität Leipzig.
- License: CC-BY 4.0
The Wikipedia texts are from Wikipedia, and are therefore licensed by various Creative Commons licenses.