Wang, Fang Wu, Fuyun
Published in
Linguistics
In contrast to well-studied prenominal relative clauses (RCs) in Chinese, little has been known about postnominal RCs that are non-canonical but existent in spoken Chinese. Focusing on Standard Mandarin, this paper examines in a large-scale spoken corpus the distributional patterns of postnominal RCs. Using distribution patterns of prenominal RCs i...
(Chin-Chin Tseng), (Dayoung Jeong),
Published in
Chinese as a Second Language Research
The purpose of this study is to explore the use of Mandarin utterance-final particles in online teacher-student interactions. The interactions were carried out by 15 preservice teachers who were graduate students at the Department of Chinese as a Second Language and 6 learners of Chinese as a second language in Singapore. After four online-interact...
Enghels, Renata De Latte, Fien Roels, Linde
CORMA is a corpus of peninsular Spanish including spontaneous conversations recorded in Madrid between 2016 and 2019. The corpus was compiled in order to remedy the scarce documentation of 21st century colloquial Spanish. Indeed, a short overview of the corpora of conversational Spanish shows that there is a sharp contrast between the increasing in...
Bérard, Lolita
Le projet ANR-12-CORP-0005 orféo a abouti à la réalisation d’un Corpus d’Étude pour le Français Contemporain (céfc) qui est diffusé librement. Nous présenterons dans cet article les données orales intégrées au céfc et la répartition chiffrée de ce pluri-corpus, homogénéisé pour donner aux utilisateurs un accès simplifié à un grand nombre de données...
Komrsková, Zuzana Poukarová, Petra
Published in
Journal of Linguistics/Jazykovedný casopis
This paper deals with the position of three Czech subordinating conjunctions že ’that’, když ‘when’, and až ‘when’ within the prosodic word, using the phonetic annotation in the ORTOFON corpus. The position of subordinating conjunctions is traditionally described as initial within the subordinate clause, but the situation in spontaneous speech is n...
Goláňová, Hana Waclawičová, Martina
Published in
Journal of Linguistics/Jazykovedný casopis
DIALEKT, a corpus of Czech dialects, has been continuously curated and expanded by the Spoken Corpora section of the Institute of the Czech National Corpus. The following paper aims first to give a concise characteristic of the corpus, addressing its sociolinguistic parameters and possible subcorpora derivable thereof, its two-layer approach to the...
Rougé, Jean-Louis Schang, Emmanuel Luis, Ana Badin, Flora Tavares, Eugène
Nous présentons la constitution d'une ressource sur le kriol, langue créole à base lexicale portugaise, parlée en Guinée-Bissau. Il s'agit dans un premier temps de la mise à disposition de 25 heures d'enregistrements transcrits, glosés et traduits. Cet article propose une présentation des enjeux (§2) ainsi qu'une description de la méthode choisie p...
Desagulier, Guillaume Lacheret-Dujour, Anne Isel, Frédéric Mun, Seongmin
Rhapsodie is a 33000-word treebank of spoken French that is annotated for syntax and prosody. It breaks down into 57 five-minute long samples produced by 89 male and female speakers. The discourse profile of each sample is captured by six variables: event structure (dialogue vs. monologue), social context (public vs. private), genre (argumentation,...
Komrsková, Zuzana Kopřivová, Marie Lukeš, David Poukarová, Petra Goláňová, Hana
Published in
Journal of Linguistics/Jazykovedný casopis
The paper introduces the ORTOFON corpus of spontaneous spoken Czech and the DIALEKT corpus of Czech dialects, their design principles and practical solutions adopted during data collection.
Cordereix, Pascal
Published in
Histoire Epistémologie Langage
La patrimonialisation des corpus oraux fait désormais partie de leur cycle de vie. Le geste de « mettre à part » (Michel de Certeau) qui caractérise toute entrée en archives amène notamment à un ensemble d’actions descriptives (inventaire, catalogage...) normées, qui vont permettre la consultation, la diffusion, l’exploitation et la conservation pé...