16 | | |||| '''WAC-X morning session''' || |
17 | | || 9:30–9:40 ||'''Welcome and Introduction''' || |
18 | | || 9:40–10:00 ||''Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus Comparison'' || |
19 | | || ||Roland Schäfer and Felix Bildhauer || |
20 | | || 10:00–10:30 ||''Efficient construction of metadata-enhanced web corpora'' || |
21 | | || ||Adrien Barbaresi || |
22 | | |||| '''WAC-X noon session''' || |
23 | | || 11:00–11:30 ||''Topically-focused Blog Corpora for Multiple Languages'' || |
24 | | || ||Andrew Salway, Dag Elgesem, Knut Hofland, Øystein Reigem and Lubos Steskal || |
25 | | || 11:30–12:00 ||''The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case Study'' || |
26 | | || ||Anne Krause || |
27 | | || 12:00–12:30 ||''Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of ’rapefugee’, ’rapeugee’, and ’rapugee’.'' || |
28 | | || ||Quirin Würschinger, Mohammad Fazleh Elahi, Desislava Zhekova and Hans-Jörg Schmid || |
29 | | |||| '''EmpiriST session''' || |
30 | | || 13:30–13:50 ||''EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora'' || |
31 | | || ||Michael Beißwenger, Sabine Bartsch, Stefan Evert and Kay-Michael Würzner || |
32 | | || 13:50–14:10 ||''!SoMaJo: State-of-the-art tokenization for German web and social media texts'' || |
33 | | || ||Thomas Proisl and Peter Uhrig || |
34 | | || 14:10–14:30 ||''UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web Data'' || |
35 | | || ||Jakob Prange, Andrea Horbach and Stefan Thater || |
36 | | |||| '''WAC-X and EmpiriST teaser talks''' || |
37 | | || 14:30–14:35 ||''Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search'' || |
38 | | || ||Gideon Mendels, Erica Cooper and Julia Hirschberg || |
39 | | || 14:35–14:40 ||''A Global Analysis of Emoji Usage'' || |
40 | | || ||Nikola Ljubešić and Darja Fišer || |
41 | | || 14:40–14:45 ||''Genre classification for a corpus of academic webpages'' || |
42 | | || ||Erika Dalan and Serge Sharoff || |
43 | | || 14:45–14:50 ||''On Bias-free Crawling and Representative Web Corpora'' || |
44 | | || ||Roland Schäfer || |
45 | | || 14:55–15:00 ||''EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different Genres'' || |
46 | | || ||Steffen Remus, Gerold Hintz, Chris Biemann, Christian M. Meyer, Darina Benikova, Judith Eckle-Kohler, Margot Mieskes and Thomas Arnold || |
47 | | || 15:00–15:05 ||''bot.zen @ EmpiriST 2015 - A minimally-deep learning PoS-tagger (trained for German CMC and Web data)'' || |
48 | | || ||Egon Stemle || |
49 | | || 15:05–15:10 ||''LTL-UDE @ EmpiriST 2015: Tokenization and PoS Tagging of Social Media Text'' || |
50 | | || ||Tobias Horsmann and Torsten Zesch || |
51 | | |||| '''Posters and discussions''' || |
52 | | || 15:10–16:30 ||='''WAC-X and EmpiriST poster session''' =|| |
53 | | || 16:30–17:30 ||='''WAC-X and EmpiriST closing discussion''' =|| |
54 | | || 17:30–18:30 ||='''Panel discussion ''Corpora, open science, and copyright reforms''''' =|| |
| 16 | |||| '''WAC-X morning session'''|| |
| 17 | || 9:30–9:40 ||Welcome and Introduction || |
| 18 | || 9:40–10:00 ||'''Roland Schäfer and Felix Bildhauer'''[[BR]]''Automatic Classification by Topic Domain for Meta Data Generation, Web Corpus Evaluation, and Corpus Comparison'' || |
| 19 | || 10:00–10:30 ||'''Adrien Barbaresi'''[[BR]]''Efficient construction of metadata-enhanced web corpora'' || |
| 20 | |||| '''WAC-X noon session'''|| |
| 21 | || 11:00–11:30 ||'''Andrew Salway, Dag Elgesem, Knut Hofland, Øystein Reigem and Lubos Steskal'''[[BR]]''Topically-focused Blog Corpora for Multiple Languages'' || |
| 22 | || 11:30–12:00 ||'''Anne Krause'''[[BR]]''The Challenges and Joys of Analysing Ongoing Language Change in Web-based Corpora: a Case Study'' || |
| 23 | || 12:00–12:30 ||'''Quirin Würschinger, Mohammad Fazleh Elahi, Desislava Zhekova and Hans-Jörg Schmid'''[[BR]]''Using the Web and Social Media as Corpora for Monitoring the Spread of Neologisms. The case of ’rapefugee’, ’rapeugee’, and ’rapugee’.'' || |
| 24 | |||| '''EmpiriST session'''|| |
| 25 | || 13:30–13:50 ||'''Michael Beißwenger, Sabine Bartsch, Stefan Evert and Kay-Michael Würzner'''[[BR]]''EmpiriST 2015: A Shared Task on the Automatic Linguistic Annotation of Computer-Mediated Communication and Web Corpora'' || |
| 26 | || 13:50–14:10 ||'''Thomas Proisl and Peter Uhrig'''[[BR]]''!SoMaJo: State-of-the-art tokenization for German web and social media texts'' || |
| 27 | || 14:10–14:30 ||'''Jakob Prange, Andrea Horbach and Stefan Thater'''[[BR]]''UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web Data'' || |
| 28 | |||| '''WAC-X and EmpiriST teaser talks'''|| |
| 29 | || 14:30–14:35 ||'''Gideon Mendels, Erica Cooper and Julia Hirschberg'''[[BR]]''Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search'' || |
| 30 | || 14:35–14:40 ||'''Nikola Ljubešić and Darja Fišer'''[[BR]]''A Global Analysis of Emoji Usage'' || |
| 31 | || 14:40–14:45 ||'''Erika Dalan and Serge Sharoff'''[[BR]]''Genre classification for a corpus of academic webpages'' || |
| 32 | || 14:45–14:50 ||'''Roland Schäfer'''[[BR]]''On Bias-free Crawling and Representative Web Corpora'' || |
| 33 | || 14:55–15:00 ||'''Steffen Remus, Gerold Hintz, Chris Biemann, Christian M. Meyer, Darina Benikova, Judith Eckle-Kohler, Margot Mieskes and Thomas Arnold'''[[BR]]''EmpiriST: AIPHES - Robust Tokenization and POS-Tagging for Different Genres'' || |
| 34 | || 15:00–15:05 ||'''Egon Stemle'''[[BR]]''bot.zen @ EmpiriST 2015 - A minimally-deep learning PoS-tagger (trained for German CMC and Web data)'' || |
| 35 | || 15:05–15:10 ||'''Tobias Horsmann and Torsten Zesch'''[[BR]]''LTL-UDE @ EmpiriST 2015: Tokenization and PoS Tagging of Social Media Text'' || |
| 36 | |||| '''Posters and discussions'''|| |
| 37 | || 15:10–16:30 ||=WAC-X and EmpiriST poster session =|| |
| 38 | || 16:30–17:30 ||=WAC-X and EmpiriST closing discussion =|| |
| 39 | || 17:30–18:30 ||=Panel discussion ''Corpora, open science, and copyright reforms'' =|| |