|   | 1 | = 6th Web as Corpus Workshop (WAC-6) = | 
          
          
            |   | 2 | To be held in association with [http://www2012.org/ WWW12] in Lyon, | 
          
          
            |   | 3 | 17th April 2012 | 
          
          
            |   | 4 |  | 
          
          
            |   | 5 | Sponsored by [http://www.sigwac.org.uk ACL SIGWAC] | 
          
          
            |   | 6 |  | 
          
          
            |   | 7 | More and more people are using Web data for linguistic and NLP research.  The workshop, the sixth in an annual series, provides a venue for exploring how we can use it effectively and what we will find if we do. | 
          
          
            |   | 8 |  | 
          
          
            |   | 9 | We invite submissions which: | 
          
          
            |   | 10 |  *      describe Web corpus collection projects, or modules for one part of the process (crawling, filtering, de-duplication, language-id, tokenising, indexing, ...) | 
          
          
            |   | 11 |  *      explore characteristics of Web data from a linguistics/NLP perspective including registers, domains, frequency distributions, comparisons between datasets | 
          
          
            |   | 12 |  *      use crawled Web data for NLP purposes (with emphasis on the data rather than the use) | 
          
          
            |   | 13 | Previous WAC workshops have been in Europe and Africa. The west coast of the US is the global centre for web development, hosting Google, Microsoft, Yahoo and a thousand others, so we are looking forward to visiting! | 
          
          
            |   | 14 |  | 
          
          
            |   | 15 |  | 
          
          
            |   | 16 | == Call for Papers == | 
          
          
            |   | 17 |  * Submission by '''January 15 2012,''' to be made through the EasyChair system at  https://www.easychair.org/conferences/?conf=www2012  | 
          
          
            |   | 18 |  * Notification of acceptance by January 30 | 
          
          
            |   | 19 |  * Camera-ready copy due February 15 | 
          
          
            |   | 20 |  | 
          
          
            |   | 21 | Submissions should be formatted using the WWW 2012 stylefiles, with blind review and not exceeding 8 pages plus an extra page for references. Each submission will be reviewed by at least two members of the programme committee. Accepted papers will be published in the workshop proceedings.  | 
          
          
            |   | 22 |  | 
          
          
            |   | 23 |  | 
          
          
            |   | 24 | == Organising committee == | 
          
          
            |   | 25 |  * Adam Kilgarriff (Lexical Computing Ltd.) | 
          
          
            |   | 26 |  * Jan Pomikalek (Masaryk University) | 
          
          
            |   | 27 |  * Serge Sharoff (University of Leeds, Workshop Chair) | 
          
          
            |   | 28 |          | 
          
          
            |   | 29 | == Programme committee == | 
          
          
            |   | 30 | Organising committee plus: | 
          
          
            |   | 31 |  * Silvia Bernardini, U of Bologna, Italy | 
          
          
            |   | 32 |  * Stefan Evert, U of Osnabrück, Germany | 
          
          
            |   | 33 |  * Cédrick Fairon, UCLouvain, Belgium | 
          
          
            |   | 34 |  * William H. Fletcher, U.S. Naval Academy, USA | 
          
          
            |   | 35 |  * Gregory Grefenstette, Exalead, France | 
          
          
            |   | 36 |  * Igor Leturia, Elhuyar Fundazioa, Basque Country, Spain | 
          
          
            |   | 37 |  * Preslav Nakov, National U of Singapore | 
          
          
            |   | 38 |  * Kevin Scannell, Saint Louis U, USA | 
          
          
            |   | 39 |  * Gilles-Maurice de Schryver, U Gent, Belgium |