List of Resources

Sort by

International Corpus of Arabic

"The International Corpus of Arabic (ICA) is an ambitious attempt to build a representative corpus of the Arabic language as it is used all over the Arab world, with the aim of supporting research on such language. The ICA is planned to contain 100 million words. Once finished, the analyzed version will be the first analyzed Arabic corpus available as a linguistic resource for researchers. It is also the first systematic investigation of national varieties within the Arabic speaking community, this should prove very useful for linguists who believe that their theories and descriptions of language should be based on real, rather than contrived, data." ... read more

Russian National Corpus

"This website contains a corpus of the modern Russian language incorporating over 300 million words.  The corpus of Russian is a reference system based on a collection of Russian texts in electronic form. The Corpus is intended for all who are interested in the Russian language and various associated fields: professional linguists, language teachers, school, university students, and foreigners learning the language." ... read more