Das CLARIN-Servicezentrum des Zentrums Sprache an der BBAW

Das CLARIN-Servicezentrum des Zentrums Sprache an der BBAW



Publikationen

Aufsätze

  • Jurish, Bryan (2015): DiaCollo: On the trail of diachronic collocations. In Koenraad De Smedt (Hrsg.), Proceedings of the CLARIN Annual Conference 2015, Wroclaw, Polen, 14.-17. Oktober, S. 28-31, 2015. [PDF]
  • Jurish, Bryan and Henriette Ast (2015): Using an alignment-based lexicon for canonicalization of historical text. In: Jost Gippert and Ralf Gehrke (Eds.): Historical Corpora. Challenges and Perspectives. Tübingen 2015, S. 197–208.
  • Thomas, Christian and Frank Wiegand (2015): Making great work even better. Appraisal and digital curation of widely dispersed electronic textual resources (c. 15th–19th centuries) in CLARIN-D. In: Jost Gippert and Ralf Gehrke (Eds.): Historical Corpora. Challenges and Perspectives. Tübingen 2015, S. 181–196.
  • Geyken, Alexander and Thomas Gloning (2015): A living text archive of 15th-19th-century German. Corpus strategies, technology, organization. In: Jost Gippert and Ralf Gehrke (Eds.): Historical Corpora. Challenges and Perspectives. Tübingen 2015, S. 165–180.
  • Haaf, Susanne, Alexander Geyken and Frank Wiegand (2014–2015): The DTA “Base Format”: A TEI Subset for the Compilation of a Large Reference Corpus of Printed Text from Multiple Sources, In: Journal of the Text Encoding Initiative [Online], Issue 8 – PREVIEW | 2014–2015. [DOI: 10.4000/jtei.1114]
  • Goosen, Twan Menzo Windhouwer, Oddrun Ohren, Axel Herold, Thomas Eckart, Matej Durco and Oliver Schonefeld: CMDI 1.2: Improvements in the CLARIN Component Metadata Infrastructure. Poster and Short Paper at the CLARIN Annual Conference 2014, 23–25 Oct 2014 in Soesterberg (NL). [Paper].
  • Haaf, Susanne, Peter Fankhauser, Thorsten Trippel et al.: CLARIN’s Virtual Language Observatory (VLO) under scrutiny. The VLO taskforce of the CLARIN-D centres. Talk and Short Paper at the CLARIN Annual Conference 2014, 23–25 Oct 2014 in Soesterberg (NL). [Paper].
  • Haaf, Susanne and Matthias Schulz (2014): Historical Newspapers & Journals for the DTA. In: Language Resources and Technologies for Processing and Linking Historical Documents and Archives – Deploying Linked Open Data in Cultural Heritage – LRT4HDA. Proceedings of the workshop, held at the Ninth International Conference on Language Resources and Evaluation (LREC'14), May 26–31, 2014, Reykjavik (Iceland), pp. 50–54. [Online-Version]
  • Jurish, Bryan, Christian Thomas and Frank Wiegand (2014): Querying the Deutsches Textarchiv. In: Udo Kruschwitz, Frank Hopfgartner and Cathal Gurrin (Hg.): Proceedings of the Workshop MindTheGap 2014: Beyond Single-Shot Text Queries: Bridging the Gap(s) between Research Communities (co-located with iConference 2014, Berlin, 4. März, 2014), S. 25–30, 2014. [online-Versionen vom 5. März 2014]
  • Neuber, Frederike and Christian Thomas (2013): Kurz vor dem Ziel: Das Kurationsprojekt 1 der F-AG 1 »Deutsche Philologie«. In: Clarin-D-Newsletter, Nummer 5, 2013, November, S. 29–32. [online-Version vom 25. November 2013]
  • Jurish, Bryan (2013): Canonicalizing the Deutsches Textarchiv. In: Perspektiven einer corpusbasierten historischen Linguistik und Philologie. Internationale Tagung des Akademienvorhabens „Altägyptisches Wörterbuch“ an der Berlin-Brandenburgischen Akademie der Wissenschaften, 12.–13. Dezember 2011, herausgegeben von Ingelore Hafemann, Berlin 2013, S. 235–244. [online-Version vom 8. August 2013]
  • Geyken, Alexander (2013): Wege zu einem historischen Referenzkorpus des Deutschen: das Projekt Deutsches Textarchiv. In: Perspektiven einer corpusbasierten historischen Linguistik und Philologie. Internationale Tagung des Akademienvorhabens „Altägyptisches Wörterbuch“ an der Berlin-Brandenburgischen Akademie der Wissenschaften, 12.–13. Dezember 2011, herausgegeben von Ingelore Hafemann, Berlin 2013, S. 221–234. [online-Version vom 8. August 2013]
  • Haaf, Susanne, Frank Wiegand and Alexander Geyken (2013): Measuring the Correctness of Double-Keying: Error Classification and Quality Control in a Large Corpus of TEI-Annotated Historical Text. In: Journal of the Text Encoding Initiative (jTEI) 4, 2013. [online-Version vom 26. März 2013]
  • Gloning, Thomas, Frederike Neuber, Thomas Stäcker, Stefanie Seim and Christian Thomas: Zwischenbericht des CLARIN–D Kurationsprojekts 1 der F-AG 1; Berichtszeitraum 01.09.2012–28.02.2013. (Work Report of the CLARIN-D Curation Project led by WG-1 for the Project Period of 1 Sept 2012 to 28 Feb 2013.) March 2013. [Report].
  • Jurish, Bryan, Marko Drotschmann and Henriette Ast (2013): Constructing a canonicalized corpus of historical German by text alignment. In Paul Bennett, Martin Durrell, Silke Scheible and Richard J. Whitt (Eds.), New Methods in Historical Corpora, volume 3 of Corpus Linguistics and Interdisciplinary Perspectives on Language (CLIP), Tübingen 2013, pp. 221-234.
  • Herold, Axel, Lothar Lemnitzer and Alexander Geyken: Integrating lexical resources through an aligned lemma list. In Christian Chiarcos, Sebastian Nordhoff and Sebastian Hellmann (Eds.), Linked data in linguistics, Berlin/Heidelberg 2012, S. 35-44.
  • Beißwenger, Michael, Maria Ermakova and Alexander Geyken, Lothar Lemnitzer, Angelika Storrer: DeRiK: A German Reference Corpus of Computer-Mediated Communication. In Proceedings of Digital Humanities Conference, Hamburg 2012.
  • Geyken, Alexander, Susanne Haaf and Frank Wiegand: The DTA ‘base format’: A TEI-Subset for the Compilation of Interoperable Corpora. In 11th Conference on Natural Language Processing (KONVENS) – Empirical Methods in Natural Language Processing, Proceedings of the Conference (LThist 2012 Workshop). Edited by Jeremy Jancsary. Wien 2012, pp. 383-391 (= Schriftenreihe der Österreichischen Gesellschaft für Artificial Intelligence 5). [Paper].
  • Didakowski, Jörg, Lothar Lemnitzer and Alexander Geyken: Automatic example sentence extraction for a contemporary German dictionary. In Proceedings of the EURALEX, Oslo 2012, pp. 343-349.
  • Beißwenger, Michael, Maria Ermakova, Alexander Geyken, Lothar Lemnitzer and Angelika Storrer: A TEI schema for the Representation of the Computer-mediated Communication. In Journal of the Text Encoding Initiative (jtei) 3 (2012). [Paper].

Vorträge

  • Susanne Haaf: Text Type Classification for the Historical DTA Corpus. Presentation at the CLARIN-NeDiMAH Workshop Exploring Historical Sources with Language Technology: Results and Perspectives, 8-9 Dec 2014, The Hague (NL).
  • Bryan Jurish, Kai Zimmer: Webservices des Zentrums Sprache an der BBAW. (The Webservices at the Center of Language of the BBAW) Presentation at the 1st ifDHb-Circuit at the BBAW Berlin, 1 Sept 2014, Berlin (DE).
  • Kathrin Pindl, Christopher Kolbeck, Susanne Haaf: Vom Archiv ins WWW – Digitale Korpora und (sprach-)historische Synergien. (Digital Corpora and Synergies in (language-)historic Disciplines.) Presentation at the Conference Offene Archive 2.1 – Social media im deutschen Sprachraum und im internationalen Kontext, 3–4 April 2014, Stuttgart (DE).
  • Christian Thomas: The CLARIN-D Service Centre at the Berlin-Brandenburg Academy of Sciences and Humanities (BBAW): Corpora, Tools, Methods, and Best Practices for text-based interdisciplinary research. Presentation at the NeDiMAH-Workshop Using Large Scale Text Collections for Research, 1-2 April 2014, Wuerzburg (DE).
  • Alexander Geyken: Einführung in CLARIN. (Introduction to CLARIN.) Presentation at the constituent Meetings of CLARIN-D's Working Groups 9 (Modern History), 27 Aug 2014, Bielefeld (DE), and 10 (Contemporary History), 26 Sept 2014, Goettingen (DE).
  • Alexander Geyken: Wissenschaftliche Sammlungen aus Sicht des Zentrum Sprache der BBAW. (Scholarly Collections seen from the perspective of the Language Center at the BBAW.) Presentation at the DARIAH-Workshop Stakeholder Gremium wissenschaftliche Sammlungen, 23 July 2014, Goettingen (DE).
  • Alexander Geyken, Thomas Gloning, Thomas Stäcker, et al.: Compiling large historical reference corpora of German: Quality Assurance, Interoperability and Collaboration in the Process of Publication of Digitized Historical Prints. Panel at the Digital Humanities Conference, Hamburg (DE) 2012.
  • Michael Beißwenger, Lothar Lemnitzer: Processing and Representing Computer-Mediated Discourse: An Open Issue in Corpus Linguistics. Presentation at the Workshop on Automatic Processing of Non-Standard Data Sources in Corpus-Based Research, Köln, 31 Aug 2012
  • Axel Herold, Norbert Schrader, Alexander Geyken, Lothar Lemnitzer: Integrating ¹DWB into a digital lexical information system. Presentation at the 6th International Conference on Historical Lexicography and Lexicology, July 2012, Jena (DE).
  • Kay-Michael Würzner, Bryan Jurish, Alexander Geyken, Lothar Lemnitzer: Kollaborative Erstellung eines annotierten Korpus als Grundlage für die Anwendung statistischer Ansätze der automatischen Sprachverarbeitung auf internetbasierte Kommunikation. (Collaborative Preparation of an Annotated Corpus as a Basis for the Application of Statistical Language Processing Methods for Internet-based Communication.) Presentation at the Workshop Webkorpora in Linguistik und Sprachforschung, 27-28 Sept 2012, Mannheim (DE).
  • Alexander Geyken: TEI for written historical corpora: why and how? CLARIN, Standards and the Text Encoding Initiative. Preconference Workshop of the TEI Conference and Members' Meeting, 20 Oct 2013, Rome (IT).
  • Axel Herold: Metadata for CMC documents. Talk at the panel Computer-mediated communication in TEI. What lies ahead? at the TEI Conference and Members' Meeting, Oct 2013, Rome (IT).
  • Frank Wiegand: Integration of Wikisource data sets into the CLARIN-D Infrastructure. Showcase at the M24-workshop of CLARIN-D, 27-28 June 2013, Nijmegen (NL)
  • Frederike Neuber: Curation and integration of historical text resources of the 15th-19th century into the CLARIN-D Infrastructure. Poster and Demo at the M24-workshop of CLARIN-D, 27-28 June 2013, Nijmegen (NL)
  • Frank Wiegand: Integration von Wikisourcedatensätzen in die CLARIN-D Infrastruktur. (Integration of Wikisource data sets into the CLARIN-D infrastructure.) Poster and Demo at the M24-workshop of CLARIN-D, 27-28 June 2013, Nijmegen (NL).
  • Axel Herold: Anforderungen an eine inhaltliche Bewertung von Sprachressourcen auf der Basis nicht-technischer Metadaten. (Requirements for the evaluation of language resources with regard to their contents on the basis of non-technical metadata.) Presentation at the CLARIN-Dissemination-Workshop, 30 Sept 2013, Leipzig (DE).
  • Christian Thomas: Integration von Wikisource-Texten in die verteilte Korpusinfrastruktur von CLARIN-D. (Integration of Wikisource texts into the shared corpus infrastructure of CLARIN-D.) Presentation at the 1st Wikisource Workshop, 18-20 Oct 2013, Leipzig (DE).

Schulungen und Workshops

  • Susanne Haaf, Christian Thomas: Historical Text Corpora for the Humanities and Social Sciences. Digitization, Annotation, Quality Assurance and Analysis. Workshop at the Joint ESU DG K & T and CLARIN-D summer university, 22-26 July 2014, Leipzig (DE).
  • Matthias Boenig, Susanne Haaf, Christian Thomas: Aufbau von Sprachressourcen: Nutzung von Angeboten des Deutschen Textarchivs. Workshop im DHnet Jena in Kooperation mit dem Laboratorium Aufklärung, 14. bis 16. Januar 2016, Jena (DE).