Changes between Initial Version and Version 1 of WAC8


Ignore:
Timestamp:
11/29/12 02:39:23 (12 years ago)
Author:
egon w. stemle
Comment:

initial version

Legend:

Unmodified
Added
Removed
Modified
  • WAC8

    v1 v1  
     1= 8th Web as Corpus Workshop (WAC-8) @ [http://ucrel.lancs.ac.uk/cl2013/ Corpus Linguistics 2013]=
     2== Lancaster, UK; Monday 22nd July 2013 ==
     3
     4{{{#!comment
     5Sponsored by [http://www.sigwac.org.uk ACL SIGWAC].
     6}}}
     7
     8Web corpora and other Web-derived data have become a gold mine for corpus linguistics and natural language processing.  The Web is an easy source of unprecedented amounts of linguistic data from a broad range of registers and text types.  However, a collection of Web pages is not immediately suitable for exploration in the same way a traditional corpus is.
     9
     10Since the first Web as Corpus Workshop organised at the Corpus Linguistics 2005 Conference, a highly succesful series of yearly Web as Corpus workshops provides a venue for interested researchers to meet, share ideas and discuss the problems and possibilities of compiling and using Web corpora.  After a stronger focus on application-oriented natural language processing and Web technology in recent years -- with workshops taking place at the NAACL-HLT 2010, 2011 and WWW 2012 -- the 8th Web as Corpus Workshop returns to its roots in the corpus linguistics community.
     11
     12Accordingly, the leading theme of this workshop is the application of Web data in language research, including linguistic evaluation of Web-derived corpora as well as strategies and tools for high-quality automatic annotation of Web text. We invite papers on all aspects of building and using Web corpora, with a particular focus on (but not limited to) the following:
     13
     14 * applications of Web corpora and other Web-derived data sets for language research
     15 * automatic linguistic annotation of Web data such as tokenisation, part-of-speech tagging, lemmatisation and semantic tagging (the accuracy of established software tools is still unsatisfactory for many types of Web data)
     16 * critical exploration of characteristics of Web data from a linguistic perspective and its applicability to language research
     17 * presentation of Web corpus collection projects or software tools required for some part of the process (crawling, filtering, de-duplication, language identification, indexing, ...)
     18
     19{{{#!comment
     20This workshop is endorsed by the Special Interest Group on the Web as Corpus (SIGWAC) of the Association for Computational Linguistics (ACL).
     21}}}
     22
     23{{{#!comment
     24== Programme ==
     25
     26The proceedings are available [https://sigwac.org.uk/raw-attachment/wiki/WAC8/wac8-proc.pdf here]
     27
     28||9.00|| '''Welcome''' ||
     29||9.10|| '''Invited Talk''': ''tba'' ||
     30|| ||tba||
     31||10.00||''foo bar''||
     32|| || something very interesting ||
     33||10.40 || ''' Coffee ''' ||
     34}}}
     35
     36== Submission Information ==
     37
     38Authors are invited to submit extended abstracts on original, unpublished work in the topic area of this workshop.  Contributions must be submitted in PDF format and should not exceed two (2) pages, including references.  We strongly encourage use of the LaTeX or Microsoft Word style files provided on the workshop Web page.
     39{{{#!comment
     40Submissions should be formatted using the [http://www.acm.org/sigs/publications/proceedings-templates ACM SIG stylefiles].
     41}}}
     42
     43Authors of those papers that are accepted will be invited to submit full papers (up to eight pages) before the workshop itself and these will appear in an online proceedings.
     44
     45
     46== Important dates ==
     47 tba
     48{{{#!comment
     49 * Submission of extended abstract by '''February 3, 2013''' to be made through [https://www.easychair.org/conferences/?conf=wac8 EasyChair]
     50 * Notification of acceptance by February 17
     51 * Submission of full paper by June 23
     52}}}
     53
     54
     55== Organising committee ==
     56 * Stefan Evert, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU)
     57 * Egon Stemle, European Academy of !Bozen/Bolzano (EURAC)
     58 * Paul Rayson, Lancaster University
     59       
     60
     61== Programme committee ==
     62Organising committee plus:
     63 tba
     64{{{#!comment
     65 * ?Adam
     66 * ?Serge
     67 * ?Paul Cook
     68 * ?Silvia Bernardini, U of Bologna, Italy
     69 * ?Cédrick Fairon, UCLouvain, Belgium
     70 * ?William H. Fletcher, U.S. Naval Academy, USA
     71 * ?Gregory Grefenstette, Exalead, France
     72 * ?Igor Leturia, Elhuyar Fundazioa, Basque Country, Spain
     73 * ?Preslav Nakov, National U of Singapore
     74 * ?Jan Pomikalek (Masaryk University)
     75 * ?Reinhard Rapp, U Mainz, Germany
     76 * ?Kevin Scannell, Saint Louis U, USA
     77 * ?Gilles-Maurice de Schryver, U Gent, Belgium
     78 * ?Pierre Zweigenbaum, LIMSI, France
     79}}}