tdd TDD

TS Corpus Word List

The TS Corpus Word List is a large-scale lexical dataset containing 3,386,314 Turkish word forms, including roots, derived forms, and inflected variants. It has been systematically compiled from multiple corpora within the TS Corpus project, ensuring representation of actively used vocabulary in contemporary Turkish. This dataset provides a valuable resource for linguistic research, natural language processing, and lexicographic studies, offering comprehensive coverage of word formation processes and usage patterns in the Turkish language.

MIT
Task
lexicon
Language
tr
Size
7.40 MB