Diverse Veröffentlichungen
-
Bański, Piotr / Barbaresi, Adrien / Clematide, Simon / Kupietz, Marc / Lüngen, Harald (Hrsg.): Proceedings of the LREC 2022. Workshop on Challenges in the Management of Large Corpora (CMLC-10 2022)
36 S. - Marseille: European Language Resources Association (ELRA), 2022.
ISBN: 979-10-95546-83-2
Inhaltsverzeichnis
| Pais, Vasile / Mitrofan, Maria / Barbu Mititelu, Verginica / Irimia, Elena / Micu, Roxana / Gasan, Carol Luca: | |||
| Challenges in Creating a Representative Corpus of Romanian Micro-Blogging Text | S. 1 | ||
| von Korff, Modest: | |||
| Exhaustive Indexing of PubMed Records with Medical Subject Headings | S. 8 | ||
| Brigada Villa, Luca: | |||
| UDeasy: a Tool for Querying Treebanks in CoNLL-U Format | S. 16 | ||
| Diewald, Nils: | |||
| Matrix and Double-Array Representations for Efficient Finite State Tokenization |
→IDS-Publikationsserver →Text |
S. 20 | |
| Fankhauser, Peter / Kupietz, Marc: | |||
| Count-Based and Predictive Language Models for Exploring DeReKo |
→IDS-Publikationsserver →Text |
S. 27 | |
| Biber, Hanno: | |||
| “The word expired when that world awoke.” New Challenges for Research with Large Text Corpora and Corpus-Based Discourse Studies in Totalitarian Times |
S. 32 | ||