5 corpora found

The Tunisian Lecture Corpus (TLC) is a non-native, specialized corpus of academic lectures collected in two institutions of higher education in Tunisia in the acdemic year 2014-2015. The corpus comprises around 106 thousand words and is made up of 12 video and 1 audio recordings. Thirteen lecturers of undergraduate courses in three disciplines: cultural studies, linguistics, and literature, participated in this research. Course descriptions of 9 out of 10 courses recorded are included in the corpus in addition to metadata about the participants such as gender, age, language background, and teaching experience.
 English   speech    106,200 words, 20 hours and 50 minutes    2015  

The Louvain International Database of Spoken English Interlanguage (LINDSEI) is a corpus of informal interviews with higher intermediate to advanced learners of English.
 English   speech    1 million words   cd-rom    2010  

The International Corpus of Learner English (Version 2) is a corpus of writing by higher intermediate to advanced learners of English. It contains 3.7 million words of EFL writing from learners representing 16 different mother tongue backgrounds (Bulgarian, Chinese, Czech, Dutch, Finnish, French, German, Italian, Japanese, Norwegian, Polish, Russian, Spanish, Swedish, Turkish and Tswana).
 English   written    3.7 million words   Book + cd-rom    2009  

Corpus de 30.000 SMS et logiciel de consultation
En quelques années, la communication par SMS est passée du rang de simple procédé technique au rang de véritable phénomène de société. Intrigués, les sociologues, psychologues, linguistes et autres spécialistes des langues ou de la communication se penchent donc depuis peu sur ces nouvelles pratiques de communication, souvent située dans le cadre plus large de la communication médiée par ordinateur.
 French   written    30 000 SMS   Livre + cédérom    2006  

Corpus Madrileño Oral de la Sexualidad. (Madrilenian Spoken Corpus of Sexuality)
Sociolinguistic corpus of ca. 1 million words that focuses on the topic of sexuality. It is composed of 54 sociolinguistic interviews collected in 2010-2012 in Madrid on a pre-stratified sample (district, gender, age and level of education). The interviews were based on a questionnaire (indirect elicitation), face-to-face, recorded. Three interviewers participated.
 Spanish   speech    1 million words   Microsoft Word (.doc)    19/12/2013