Changes between Initial Version and Version 1 of WAC5


Ignore:
Timestamp:
12/23/08 11:11:33 (16 years ago)
Author:
Serge Sharoff
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • WAC5

    v1 v1  
     1= Call for Papers =
     2
     3We invite papers on various topics concerning the use of Web resources for corpus research and NLP applications, including (but not limited to) the following:
     4
     5    * linguistic Web crawler technology and Web corpus collection projects
     6    * applications of Web-derived corpora and other kinds of Web data
     7    * how far does the “easy way” get you? (using search engines, or Google's n-gram lists; we are particularly interested in a critical discussion of the usefulness and limitations of such approaches)
     8    * methods and tools for “cleaning” Web pages to turn them into a corpus (contributors to this topic will be encouraged to participate in the second CLEANEVAL competition to be held in 2009)
     9    * automatic linguistic annotation of Web data: tokenisation, POS tagging, lemmatisation, semantic tagging, etc. (established tools often perform very poorly on Web data)
     10    * search engine architectures for linguists: bringing linguistics to commercial search engines, or high-performance search technology to linguistics?
     11    * search engine-related topics such as result ranking (e.g. how to identify “typical” uses rather than returning 50 very similar matches on the first page)
     12    * duplicate detection, interactive query refinement, etc.
     13    * reviews and clever uses of search engine APIs (Google, Yahoo, Altavista, and in particular Microsoft's current generous LiveSearch API)
     14
     15== Submission information ==
     16
     17Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop. Submissions should follow the format of LREC proceedings and should not exceed eight (8) pages, including references. We strongly recommend the use of LREC LaTeX or Microsoft Word style files tailored for this year's conference. Details on the submission procedure will be posted on this Web site shortly.
     18
     19== Programme committee ==
     20
     21    * Silvia Bernardini, U of Bologna, Italy
     22    * Massimiliano Ciaramita, Yahoo! Research Barcelona, Spain
     23    * Jesse de Does, INL, Netherlands
     24    * Katrien Depuydt, INL, Netherlands
     25    * Stefan Evert, U of Osnabrück, Germany
     26    * Cédrick Fairon, UCLouvain, Belgium
     27    * William Fletcher, U.S. Naval Academy, USA
     28    * Gregory Grefenstette, Commissariat à l'Énergie Atomique, France
     29    * Péter Halácsy, Budapest U of Technology and Economics, Hungary
     30    * Katja Hofmann, U of Amsterdam, Netherlands
     31    * Adam Kilgarriff, Lexical Computing Ltd, UK
     32    * Igor Leturia, Elhuyar Fundazioa, Basque Country, Spain
     33    * Phil Resnik, U of Maryland, College Park, USA
     34    * Kevin Scannell, Saint Louis U, USA
     35    * Gilles-Maurice de Schryver, U Gent, Belgium
     36    * Klaus Schulz, LMU München, Germany
     37    * Serge Sharoff, U of Leeds, UK
     38    * Eros Zanchetta, U of Bologna, Italy
     39
     40== Organising committee ==
     41
     42    * Stefan Evert, University of Osnabrück
     43    * Igor Leturia, Elhuyar Fundazioa
     44    * Serge Sharoff, University of Leeds