Context Navigation

Changes between Version 34 and Version 35 of WAC8

Timestamp:: 06/18/13 16:19:43 (13 years ago)
Author:: egon w. stemle
Comment:: add CfP e-mail

Legend:

: Unmodified
: Added
: Removed
: Modified

WAC8

-              v34
+              v35
 {{{#!comment
 Author notification e-mail:
+Author notification e-mail, reject:
 """
 …
 ###
+Author notification e-mail, accept:
 """
 Dear [*FIRST-NAME*],
 …
 """
 }}}
+{{{#!comment
+SUBJECT: Call for Participation: 8th Web as Corpus Workshop (22 July 2013, Lancaster, UK)
+"""
+CALL FOR PARTICIPATION
+th Web as Corpus Workshop (WAC-8)
+    Endorsed by ACL SIGWAC
+    Hosted by the Corpus Linguistics 2013 Conference
+    Monday, 22 July 2013 (Lancaster, UK)
+** Note that registration for the workshop and the main conference closes on SUNDAY, JUNE 30. **
+Registration URL: http://ucrel.lancs.ac.uk/cl2013/register.php
+Further details can be found on the workshop homepage at
+    http://sigwac.org.uk/wiki/WAC8
+______________________________________________________________________
+Web corpora and other Web-derived data have become a gold mine for corpus linguistics and natural language processing. The Web is an easy source of unprecedented amounts of linguistic data from a broad range of registers and text types. However, a collection of Web pages is not immediately suitable for exploration in the same way a traditional corpus is.
+Since the first Web as Corpus Workshop organised at the Corpus Linguistics 2005 Conference, a highly successful series of yearly Web as Corpus workshops provides a venue for interested researchers to meet, share ideas and discuss the problems and possibilities of compiling and using Web corpora. After a stronger focus on application-oriented natural language processing and Web technology in recent years – with workshops taking place at NAACL-HLT 2010, 2011 and WWW 2012 – the 8th Web as Corpus Workshop returns to its roots in the corpus linguistics community.
+Accordingly, the leading theme of this workshop is the application of Web data in language research, including linguistic evaluation of Web-derived corpora as well as strategies and tools for high-quality automatic annotation of Web text. The workshop brings together presentations on all aspects of building, using and evaluating Web corpora, with a particular focus on the following topics:
+* applications of Web corpora and other Web-derived data sets for language research
+* automatic linguistic annotation of Web data such as tokenisation, part-of-speech tagging, lemmatisation and semantic tagging (the accuracy of currently available off-the-shelf tools is still unsatisfactory for many types of Web data)
+* critical exploration of the characteristics of Web data from a linguistic perspective and its applicability to language research
+* presentation of Web corpus collection projects or software tools required for some part of this process (crawling, filtering, de-duplication, language identification, indexing, ...)
+______________________________________________________________________
+PROGRAMME
+:00 Akshay Minocha, Siva Reddy and Adam Kilgarriff -- Feed Corpus: An Ever Growing Up-to-date Corpus
+:30 Stephen Wattam, Paul Rayson and Damon Berridge -- LWAC: Longitudinal Web-as-Corpus Sampling
+:00 Roland Schäfer, Adrien Barbaresi and Felix Bildhauer -- The Good, the Bad, and the Hazy: Design Decisions in Web Corpus Construction
+:30 Jesse Egbert and Douglas Biber -- Developing a User-based Method of Web Register Classification
+:00 - 11:30   Tea Break
+:30 Adam Kilgarriff and Vít Suchomel -- Web Spam
+:00 David Lutz, Parry Cadwallader and Mats Rooth -- A web application for filtering and annotating web speech data
+:30 Sarah Schulz, Verena Lyding and Lionel Nicolas -- STirWaC - Compiling a diverse corpus based on texts from the web for South Tyrolean German
+:00 - 14:00   Lunch
+:00 Alexander Piperski, Vladimir Belikov, Nikolay Kopylov, Vladimir Selegey and Serge Sharoff -- Big and diverse is beautiful: A large corpus of Russian to study linguistic variation
+:30 Adriano Ferraresi and Silvia Bernardini -- The academic Web-as-Corpus
+:00 Silke Scheible and Sabine Schulte Im Walde -- A Compact but Linguistically Detailed Database for German Verb Subcategorisation relying on Dependency Parses from a Web Corpus
+:30 - 16:00   Tea Break
+:00 Andrew Brindle -- Thug breaks man's jaw: A Corpus Analysis of Responses to Interpersonal Street Violence
+:30 Colleen Crangle -- A web-based model of semantic relatedness and the analysis of electroencephalographic (EEG) data
+:00 Discussion and wrap-up
+:00 Pub
+______________________________________________________________________
+Looking forward to seeing you at the workshop,
+The organising committee.
+Stefan Evert, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU)
+Egon Stemle, European Academy of Bozen/Bolzano (EURAC)
+Paul Rayson, Lancaster University
+"""
+}}}