[Corpora-List] More text added to my USENET corpus. Also: new info on its availability.

Cyrus Shaoul cyrus.shaoul at ualberta.ca
Fri Mar 16 23:19:00 CET 2007

Fellow list members,

I have just uploaded the USENET corpus data from the first three months
of 2007 to our server. This will add approximately 1 billion more
words of text to the archive.

As always, please go this the following URL to download all or part of
the corpus:


Also, after many people requested it, I have been able to set up an
secondary server for those who are connected to non-academic networks.
The only limitation is that this new server can only serve up 1Gb of
data per day (aggregated across all users).
Please report any problems with this new server to me.
(For those one non-academic networks, the new server is automatically
chosen for you when you use the URL above.)



Cyrus Shaoul
University of Alberta

-------------- next part --------------
A non-text attachment was scrubbed...
Name: cyrus.shaoul.vcf
Type: text/x-vcard
Size: 293 bytes
Desc: not available
Url : https://mailman.uib.no/public/corpora-archive/attachments/20070316/517cf340/attachment.vcf

More information about the Corpora-archive mailing list