There are no reviews yet. Be the first to send feedback to the community and the maintainers!
Repository Details
This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings like fastText or ELMo Deep contextualized word representations.