Changes between Version 3 and Version 4 of WAC8
- Timestamp:
- 01/15/13 19:55:21 (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
WAC8
v3 v4 6 6 Web corpora and other Web-derived data have become a gold mine for corpus linguistics and natural language processing. The Web is an easy source of unprecedented amounts of linguistic data from a broad range of registers and text types. However, a collection of Web pages is not immediately suitable for exploration in the same way a traditional corpus is. 7 7 8 Since the first Web as Corpus Workshop organised at the Corpus Linguistics 2005 Conference, a highly succes ful series of yearly Web as Corpus workshops provides a venue for interested researchers to meet, share ideas and discuss the problems and possibilities of compiling and using Web corpora. After a stronger focus on application-oriented natural language processing and Web technology in recent years – with workshops taking place at NAACL-HLT 2010, 2011 and WWW 2012 – the 8th Web as Corpus Workshop returns to its roots in the corpus linguistics community.8 Since the first Web as Corpus Workshop organised at the Corpus Linguistics 2005 Conference, a highly successful series of yearly Web as Corpus workshops provides a venue for interested researchers to meet, share ideas and discuss the problems and possibilities of compiling and using Web corpora. After a stronger focus on application-oriented natural language processing and Web technology in recent years – with workshops taking place at NAACL-HLT 2010, 2011 and WWW 2012 – the 8th Web as Corpus Workshop returns to its roots in the corpus linguistics community. 9 9 10 10 Accordingly, the leading theme of this workshop is the application of Web data in language research, including linguistic evaluation of Web-derived corpora as well as strategies and tools for high-quality automatic annotation of Web text. We invite papers on all aspects of building and using Web corpora, with a particular focus on (but not limited to) the following: … … 36 36 Authors are invited to submit extended abstracts on original, unpublished work in the topic area of this workshop. Contributions must be submitted in PDF format and should not exceed two (2) pages, including references. We strongly encourage use of the LaTeX or Microsoft Word style files provided on the workshop Web page. 37 37 {{{#!comment 38 Submissions should be formatted using the [http://www.acm.org/sigs/publications/proceedings-templates ACM SIG style files].38 Submissions should be formatted using the [http://www.acm.org/sigs/publications/proceedings-templates ACM SIG style files]. 39 39 }}} 40 40