= ACL SIGWAC home page = The Special Interest Group of the [http://www.aclweb.org/ Association for Computational Linguistics (ACL)] on '''Web as Corpus'''. == Objectives == * to promote interest in the use of the web as a source of linguistic data, and as an object of study in its own right; * to provide members of the ACL with a special interest in the web-as-corpus with a means of exchanging news of recent research developments and other matters of interest; * to sponsor meetings and workshops on the web as corpus that appear to be timely and worthwhile. == Meetings == * [http://sslmit.unibo.it/~baroni/web_as_corpus_cl05.html WAC1, at Corpus Linguistics conference, Birmingham, UK, July 2005] * [http://sslmit.unibo.it/~baroni/web_as_corpus_eacl06.html WAC2, at EACL, Trento, Italy, April 2006] * [http://cental.fltr.ucl.ac.be/wac3 WAC3, Louvain-la-Neuve, Belgium, 15-16 September 2007 ] * [http://webascorpus.sourceforge.net/PHITE.php?sitesig=CONF&page=CONF_40_WAC-4___lb__2008__rb__ WAC4 at LREC, Marrakech, Morocco, 1 June 2008] * [wiki:WAC5], at SPLN, San Sebastian, Basque Country, Spain, 7 September 2009 * [wiki:WAC6], at NAACL-HLT, Los Angeles, USA, 5 June 2010: programme [wiki:WAC6Programme here] * [http://www.limsi.fr/~pz/bucc2011-comparable-corpora/ BUCC, Building and Using Comparable Corpora, Portland, Oregon, 24 June 2011], In 2011 we will meet at the BUCC workshop at [http://www.acl2011.org/ ACL2011] * [wiki:WAC7], at [http://www2012.wwwconference.org/ WWW12], Lyon, France, 17 April 2012 == Activities == * [http://cleaneval.sigwac.org.uk/ CLEANEVAL], a competition for cleaning webpages * Mailing list: * sign up [http://devel.sslmit.unibo.it/mailman/listinfo/sigwac here] * address to send mail to sigwac at sslmit.unibo.it == Officers == * Chair: [http://www.linguistik.uni-erlangen.de/wir-ueber-uns/personal.shtml/stefan-evert.shtml Stefan Evert] (email: stefan.evert(at)linguistik.uni-erlangen.de) * Secretary: [http://www.eurac.edu/staff/estemle/ Egon W. Stemle] (email: egon.stemle(at)eurac.edu) Constitution [attachment:wiki:WikiStart:constitution.txt?format=raw here]. == Useful resources == * [http://webascorpus.sf.net/ Stefan Evert's WAC website] * [http://webascorpus.org/ Bill Fletcher's WAC website] * [http://www.sketchengine.co.uk/ Web corpora on Sketchengine] * [http://corpus.leeds.ac.uk/internet.html Web corpora on CTS website] * [http://wacky.sslmit.unibo.it/ WACKY in Forli] * [http://purl.org/net/webgenres A wiki on webgenres]