Diverse Veröffentlichungen

Bański, Piotr / Barbaresi, Adrien / Clematide, Simon / Kupietz, Marc / Lüngen, Harald (Hrsg.): Proceedings of the LREC 2022. Workshop on Challenges in the Management of Large Corpora (CMLC-10 2022) 36 S. - Marseille: European Language Resources Association (ELRA), 2022.
ISBN: 979-10-95546-83-2

Inhaltsverzeichnis

Pais, Vasile / Mitrofan, Maria / Barbu Mititelu, Verginica / Irimia, Elena / Micu, Roxana / Gasan, Carol Luca:
  Challenges in Creating a Representative Corpus of Romanian Micro-Blogging Text S. 1
von Korff, Modest:
  Exhaustive Indexing of PubMed Records with Medical Subject Headings S. 8
Brigada Villa, Luca:
  UDeasy: a Tool for Querying Treebanks in CoNLL-U Format S. 16
Diewald, Nils:
  Matrix and Double-Array Representations for Efficient Finite State Tokenization IDS-Publikationsserver
Text
S. 20
Fankhauser, Peter / Kupietz, Marc:
  Count-Based and Predictive Language Models for Exploring DeReKo IDS-Publikationsserver
Text
S. 27
Biber, Hanno:
  “The word expired when that world awoke.”
New Challenges for Research with Large Text Corpora and Corpus-Based Discourse Studies in Totalitarian Times
S. 32