This volume deals with different aspects of the creation and use of multilingual corpora. The term 'multilingual corpus' is understood in a comprehensive sense, meaning any systematic collection of empirical language data enabling linguists to carry out analyses of multilingual individuals, multilingual societies or multilingual communication. The individual contributions are thus concerned with a variety of spoken and written corpora ranging from learner and attrition corpora, language contact corpora and interpreting corpora to comparable and parallel corpora. The overarching aim of the volume is first to take stock of the variety of existing multilingual corpora, documenting possible corpus designs and uses, second to discuss methodological and technological challenges in the creation and analysis of multilingual corpora, and third to provide examples of linguistic analyses that were carried out on the basis of multilingual corpora.
Inhaltsverzeichnis
Schmidt, Thomas / Wörner, Kai: | |||
Introduction | S. XI | ||
Section 1. Learner and attrition corpora | |||
Gut, Ulrike: | |||
The LeaP corpus: A multilingual corpus of spoken learner German and learner English | S. 3 | ||
Hedeland, Hanna / Schmidt, Thomas: | |||
Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German | S. 25 | ||
Ott, Niels / Ziai, Ramon / Meurers, Detmar: | |||
Creation and analysis of a reading comprehension exercise corpus: Towards evaluating meaning in context | S. 47 | ||
Zinsmeister, Heike / Breckle, Margit: | |||
The ALeSKo learner corpus: Design – annotation – quantitative analyses | S. 71 | ||
Saceda Ulloa, Marta / Lleó, Conxita / Garcia Sanchez, Izarbe: | |||
Corpora of spoken Spanish by simultaneous and successive German-Spanish bilingual and Spanish monolingual children | S. 97 | ||
Lleó, Conxita: | |||
Monolingual and bilingual phonoprosodic corpora of child German and child Spanish | S. 107 | ||
Herkenrath, Annette / Rehbein, Jochen: | |||
Pragmatic corpus analysis, exemplified by Turkish-German bilingual and monolingual data | S. 123 | ||
Czachór, Agnieszka: | |||
Corpus of Polish spoken in Germany: Collecting and analysing written & spoken data for investigating contact-induced change | S. 153 | ||
Kupisch, Tanja / Barton, Dagmar / Bianchi, Giulia / Stangen, Ilse: | |||
The HABLA-corpus (German-French and German-Italian) | S. 163 | ||
Section 2. Language contact corpora | |||
Gabriel, Christoph: | |||
The Hamburg Corpus of Argentinean Spanish (HaCASpa) | S. 183 | ||
Kühl, Karoline H.: | |||
Ad hoc contact phenomena or established features of a contact variety? Evidence from corpus analysis |
S. 199 | ||
Benet, Ariadna / Cortés, Susana / Lleó, Conxita: | |||
Phonoprosodic corpus of spoken Catalan (PhonCAT) | S. 215 | ||
Putz, Magdalena: | |||
Researching the intelligibility of a (German) dialect | S. 231 | ||
Höder, Steffen: | |||
Annotating ambiguity: Insights from a corpus-based study on syntactic change in Old Swedish | S. 245 | ||
Section 3. Interpreting corpora | |||
Angermeyer, Philipp Sebastian / Meyer, Bernd / Schmidt, Thomas: | |||
Sharing community interpreting corpora: A pilot study | S. 275 | ||
House, Juliane / Meyer, Bernd / Schmidt, Thomas: | |||
CoSi – A Corpus of Consecutive and Simultaneous Interpreting | S. 295 | ||
Bührig, Kristin / Kliche, Ortrun / Meyer, Bernd / Pawlack, Birte: | |||
The corpus “Interpreting in Hospitals”: Possible applications for research and communication training | S. 305 | ||
Section 4. Comparable and parallel corpora | |||
Fandrych, Christian / Meißner, Cordula / Slavcheva, Adriana: | |||
The GeWiss corpus: Comparing spoken academic German, English and Polish | S. 319 | ||
Dittmann, Henrik / Ďurčo, Matej / Geyken, Alexander / Roth, Tobias / Zimmer, Kai: | |||
Korpus C4: A distributed corpus of German varieties | S. 339 | ||
Čulo, Oliver / Hansen-Schirra, Silvia: | |||
Treebanks in translation studies: The CroCo Dependency Treebank | S. 347 | ||
Section 5. Corpus tools | |||
Rose, Yvan: | |||
Multilingual phonological corpus analysis: The tools behind the PhonBank Project | S. 365 | ||
Wörner, Kai: | |||
Finding the balance between strict defaults and total openness: Collecting and managing metadata for spoken language corpora with the EXMARaLDA Corpus Manager | S. 383 | ||
General index | S. 401 | ||
Corpora index | S. 405 | ||
Language index | S. 407 |