6 | | |
7 | | == Accepted Papers (alphabetically by first author's first name) == |
8 | | |
9 | | * '''Adrien Barbaresi''': ''Finding viable seed URLs for web corpora: a scouting approach and comparative study of available sources'' |
10 | | * '''Magali Sanches Duran, Lucas Avanço, Sandra Aluísio, Thiago Pardo and Maria da Graça Volpe Nunes''': ''Some issues on the normalization of a corpus of product reviews in Portuguese'' |
11 | | * '''Maik Stührenberg''': ''Less destructive cleaning of web documents by using standoff annotation'' |
12 | | * '''Nikola Ljubešić''': ''{bs,hr,sr}WaC - Web corpora of Bosnian, Croatian and Serbian'' |
13 | | * '''Roland Schäfer, Adrien Barbaresi and Felix Bildhauer''': ''Focused Web Corpus Crawling'' |
14 | | * '''Varvara Magomedova, Natalia Slioussar and Maria Kholodilova''': ''Internet data in a study of language change and a program helping to work with them'' |
15 | | * '''Verena Lyding, Egon Stemle, Andrea Abel, Claudia Borghetti, Marco Brunello, Sara Castagnoli, Felice Dell'Orletta, Henrik Dittmann, Alessandro Lenci and Vito Pirrelli''': ''The PAISÀ Corpus of Italian Web Texts'' |
16 | | |
17 | | == Information for authors == |
18 | | |
19 | | * Please submit your camera-ready full paper formatted according to the EACL stylesheet by March 03, 2014. There will be no extension of this deadline. Failure to submit the manuscript in time means that your paper will no bei included in the proceedings. |
20 | | * Papers can have a maximum length of 8 pages including everything. |
21 | | * LaTeX and MS Word templates are available [http://www.eacl2014.org/files/eacl-2014-styles.zip here]. |
22 | | |
23 | | |
24 | | == Online Survey == |
25 | | |
26 | | Please fill out this [https://www.surveymonkey.com/s/D8RFRCR online survey] regarding a panel discussion about a potential shared task following up CLEANEVAL until Sunday, March 02, 2014. |
| 19 | |
| 20 | |
| 21 | |
| 22 | |
| 23 | |
| 24 | |
| 25 | == Workshop program == |
| 26 | |
| 27 | |
| 28 | '''11:15–11:30''' ''Welcome'' (Felix Bildhauer & Roland Schäfer) |
| 29 | |
| 30 | '''11:30–12:00''' ''Finding Viable Seed URLs for Web Corpora: A Scouting Approach and Comparative Study of Available Sources'' (Adrien Barbaresi) |
| 31 | |
| 32 | '''12:00–12:30''' ''Focused Web Corpus Crawling'' (Roland Schäfer, Adrien Barbaresi & Felix Bildhauer) |
| 33 | ---- |
| 34 | LUNCH BREAK |
| 35 | ---- |
| 36 | '''14:00–14:30''' ''Less Destructive Cleaning of Web Documents by Using Standoff Annotation'' (Maik Stührenberg) |
| 37 | |
| 38 | '''14:30–15:00''' ''Some Issues on the Normalization of a Corpus of Products Reviews in Portuguese'' (Magali Sanches Duran, Lucas Avanço, Sandra Aluísio, Thiago Pardo & Maria da Graça Volpe Nunes) |
| 39 | |
| 40 | '''15:00–15:30''' ''{bs,hr,sr}WaC - Web Corpora of Bosnian, Croatian and Serbian'' (Nikola Ljubešić & Filip Klubička) |
| 41 | ---- |
| 42 | COFFEE BREAK |
| 43 | ---- |
| 44 | '''16:00–16:30''' ''The PAISÀ Corpus of Italian Web Texts'' (Verena Lyding, Egon Stemle, Claudia Borghetti, Marco Brunello, Sara Castagnoli, Felice Dell’Orletta, Henrik Dittmann, Alessandro Lenci & Vito Pirrelli) |
| 45 | |
| 46 | '''16:30–17:00''' ''Internet Data in a Study of Language Change and a Program Helping to Work with Them'' (Varvara Magomedova, Natalia Slioussar & Maria Kholodilova) |
| 47 | |
| 48 | '''17:00–18:00''' ''Discussion'' |
| 49 | |
| 50 | ---- |
| 51 | |
| 52 | == Venue == |
| 53 | |
| 54 | Campus Johanneberg of Chalmers University of Technology[[BR]] |
| 55 | Chalmersplatsen 1[[BR]] |
| 56 | 412 58 Gothenburg, Sweden |
| 57 | |
| 58 | Please see the EACL-2014 website for [http://eacl2014.org/venue details on how to get there]. |
| 59 | |
| 60 | |
| 61 | |
| 62 | |
| 63 | |
| 64 | |
| 65 | |
| 66 | {{{#!comment |
| 67 | == Online Survey == |
| 68 | |
| 69 | Please fill out this [https://www.surveymonkey.com/s/D8RFRCR online survey] regarding a panel discussion about a potential shared task following up CLEANEVAL until Sunday, March 02, 2014.}}} |
| 70 | }}} |