Context Navigation

Changes between Version 13 and Version 14 of WikiStart

Timestamp:: 12/23/08 11:07:18 (18 years ago)
Author:: Serge Sharoff
Comment:: --

Legend:

: Unmodified
: Added
: Removed
: Modified

WikiStart

-              v13
+              v14
   * [http://cental.fltr.ucl.ac.be/wac3 WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007 ]
   * [http://webascorpus.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_40_WAC-4___lb__2008__rb__ WAC4 at LREC, Marrakech, Morocco, 1 June 2008]
+  * WAC5 is scheduled for 8 September 2009, San Sebastian, Spain
+We invite papers on various topics concerning the use of Web resources for corpus research and NLP applications, including (but not limited to) the following:
+    * linguistic Web crawler technology and Web corpus collection projects
+    * applications of Web-derived corpora and other kinds of Web data
+    * how far does the “easy way” get you? (using search engines, or Google's n-gram lists; we are particularly interested in a critical discussion of the usefulness and limitations of such approaches)
+    * methods and tools for “cleaning” Web pages to turn them into a corpus (contributors to this topic will be encouraged to participate in the second CLEANEVAL competition to be held in 2009)
+    * automatic linguistic annotation of Web data: tokenisation, POS tagging, lemmatisation, semantic tagging, etc. (established tools often perform very poorly on Web data)
+    * search engine architectures for linguists: bringing linguistics to commercial search engines, or high-performance search technology to linguistics?
+    * search engine-related topics such as result ranking (e.g. how to identify “typical” uses rather than returning 50 very similar matches on the first page)
+    * duplicate detection, interactive query refinement, etc.
+    * reviews and clever uses of search engine APIs (Google, Yahoo, Altavista, and in particular Microsoft's current generous LiveSearch API)
+  * [wiki:WAC5] is scheduled for 8 September 2009, San Sebastian, Spain
 == Activities ==