Diverse Veröffentlichungen
-
Bański, Piotr / Barbaresi, Adrien / Clematide, Simon / Kupietz, Marc / Lüngen, Harald (Hrsg.): Proceedings of the LREC 2022. Workshop on Challenges in the Management of Large Corpora (CMLC-10 2022)
36 S. - Marseille: European Language Resources Association (ELRA), 2022.
ISBN: 979-10-95546-83-2
Inhaltsverzeichnis
Pais, Vasile / Mitrofan, Maria / Barbu Mititelu, Verginica / Irimia, Elena / Micu, Roxana / Gasan, Carol Luca: | |||
Challenges in Creating a Representative Corpus of Romanian Micro-Blogging Text | S. 1 | ||
von Korff, Modest: | |||
Exhaustive Indexing of PubMed Records with Medical Subject Headings | S. 8 | ||
Brigada Villa, Luca: | |||
UDeasy: a Tool for Querying Treebanks in CoNLL-U Format | S. 16 | ||
Diewald, Nils: | |||
Matrix and Double-Array Representations for Efficient Finite State Tokenization |
→IDS-Publikationsserver →Text |
S. 20 | |
Fankhauser, Peter / Kupietz, Marc: | |||
Count-Based and Predictive Language Models for Exploring DeReKo |
→IDS-Publikationsserver →Text |
S. 27 | |
Biber, Hanno: | |||
“The word expired when that world awoke.” New Challenges for Research with Large Text Corpora and Corpus-Based Discourse Studies in Totalitarian Times |
S. 32 |