Changes between Version 34 and Version 35 of WAC8


Ignore:
Timestamp:
06/18/13 16:19:43 (11 years ago)
Author:
egon w. stemle
Comment:

add CfP e-mail

Legend:

Unmodified
Added
Removed
Modified
  • WAC8

    v34 v35  
    150150
    151151{{{#!comment
    152 Author notification e-mail:
     152Author notification e-mail, reject:
    153153"""
    154154
     
    172172
    173173###
    174 
     174Author notification e-mail, accept:
    175175"""
    176176Dear [*FIRST-NAME*],
     
    222222"""
    223223}}}
     224
     225{{{#!comment
     226SUBJECT: Call for Participation: 8th Web as Corpus Workshop (22 July 2013, Lancaster, UK)
     227"""
     228CALL FOR PARTICIPATION
     229
     230    8th Web as Corpus Workshop (WAC-8)
     231    Endorsed by ACL SIGWAC
     232    Hosted by the Corpus Linguistics 2013 Conference
     233 
     234    Monday, 22 July 2013 (Lancaster, UK)
     235
     236** Note that registration for the workshop and the main conference closes on SUNDAY, JUNE 30. **
     237Registration URL: http://ucrel.lancs.ac.uk/cl2013/register.php
     238
     239Further details can be found on the workshop homepage at
     240
     241    http://sigwac.org.uk/wiki/WAC8
     242
     243______________________________________________________________________
     244 
     245Web corpora and other Web-derived data have become a gold mine for corpus linguistics and natural language processing. The Web is an easy source of unprecedented amounts of linguistic data from a broad range of registers and text types. However, a collection of Web pages is not immediately suitable for exploration in the same way a traditional corpus is.
     246 
     247Since the first Web as Corpus Workshop organised at the Corpus Linguistics 2005 Conference, a highly successful series of yearly Web as Corpus workshops provides a venue for interested researchers to meet, share ideas and discuss the problems and possibilities of compiling and using Web corpora. After a stronger focus on application-oriented natural language processing and Web technology in recent years – with workshops taking place at NAACL-HLT 2010, 2011 and WWW 2012 – the 8th Web as Corpus Workshop returns to its roots in the corpus linguistics community.
     248 
     249Accordingly, the leading theme of this workshop is the application of Web data in language research, including linguistic evaluation of Web-derived corpora as well as strategies and tools for high-quality automatic annotation of Web text. The workshop brings together presentations on all aspects of building, using and evaluating Web corpora, with a particular focus on the following topics:
     250 
     251* applications of Web corpora and other Web-derived data sets for language research
     252* automatic linguistic annotation of Web data such as tokenisation, part-of-speech tagging, lemmatisation and semantic tagging (the accuracy of currently available off-the-shelf tools is still unsatisfactory for many types of Web data)
     253* critical exploration of the characteristics of Web data from a linguistic perspective and its applicability to language research
     254* presentation of Web corpus collection projects or software tools required for some part of this process (crawling, filtering, de-duplication, language identification, indexing, ...)
     255
     256______________________________________________________________________
     257
     258PROGRAMME
     259
     26009:00 Akshay Minocha, Siva Reddy and Adam Kilgarriff -- Feed Corpus: An Ever Growing Up-to-date Corpus
     26109:30 Stephen Wattam, Paul Rayson and Damon Berridge -- LWAC: Longitudinal Web-as-Corpus Sampling
     26210:00 Roland Schäfer, Adrien Barbaresi and Felix Bildhauer -- The Good, the Bad, and the Hazy: Design Decisions in Web Corpus Construction
     26310:30 Jesse Egbert and Douglas Biber -- Developing a User-based Method of Web Register Classification
     264
     26511:00 - 11:30   Tea Break       
     266
     26711:30 Adam Kilgarriff and Vít Suchomel -- Web Spam
     26812:00 David Lutz, Parry Cadwallader and Mats Rooth -- A web application for filtering and annotating web speech data
     26912:30 Sarah Schulz, Verena Lyding and Lionel Nicolas -- STirWaC - Compiling a diverse corpus based on texts from the web for South Tyrolean German
     270
     27113:00 - 14:00   Lunch   
     272
     27314:00 Alexander Piperski, Vladimir Belikov, Nikolay Kopylov, Vladimir Selegey and Serge Sharoff -- Big and diverse is beautiful: A large corpus of Russian to study linguistic variation
     27414:30 Adriano Ferraresi and Silvia Bernardini -- The academic Web-as-Corpus
     27515:00 Silke Scheible and Sabine Schulte Im Walde -- A Compact but Linguistically Detailed Database for German Verb Subcategorisation relying on Dependency Parses from a Web Corpus
     276
     27715:30 - 16:00   Tea Break       
     278
     27916:00 Andrew Brindle -- Thug breaks man's jaw: A Corpus Analysis of Responses to Interpersonal Street Violence
     28016:30 Colleen Crangle -- A web-based model of semantic relatedness and the analysis of electroencephalographic (EEG) data
     28117:00 Discussion and wrap-up
     282
     28318:00 Pub
     284
     285______________________________________________________________________
     286
     287Looking forward to seeing you at the workshop,
     288The organising committee.
     289 
     290Stefan Evert, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU)
     291Egon Stemle, European Academy of Bozen/Bolzano (EURAC)
     292Paul Rayson, Lancaster University
     293"""
     294}}}