[Corpora-List] Portuguese thesaurus
adam at lexmasterclass.com
Wed May 4 21:46:00 CEST 2005
Or, you could load a large Portuguese corpus into the Word Sketch Engine,
which then automatically produces a distributional thesaurus: see
http://sketchengine.co.uk <http://sketchengine.co.uk/> We have processed
corpora, and thesauruses, of this kind for English, Chinese, Czech and Irish
But, you may say, are distributional thesaurus as good as traditional ones?
Needless to say, it all depends what you want to do with them. We have some
evidence that, for PP-attachment, for Spanish, a distributional thesaurus
outperforms Spanish WordNet.
From: owner-corpora at lists.uib.no [mailto:owner-corpora at lists.uib.no] On
Behalf Of Andrew Harley
Sent: 03 May 2005 09:21
To: Mark Davies
Cc: corpora at uib.no; owner-corpora at lists.uib.no
Subject: Re: [Corpora-List] Portuguese thesaurus
We have a Portuguese version of our Word Selector title available for
licensing in XML form. See
<http://dictionary.cambridge.org/researchers.htm>. There would be an annual
licence fee, the exact amount depending on whether one researcher or an
institution, and in this case on some third parties also. The title is only
a "mini-thesaurus" with about 10,000 words grouped into semantic categories,
so may well not have comprehensive enough coverage for your needs. Contact
me directly if interested.
Business Systems & Electronic Product Development Manager
English Language Teaching
Cambridge University Press
- the web's favourite learner dictionaries
owner-corpora at lists.uib.no wrote on 02/05/2005 23:36:01:
> I'm looking for a machine-readable thesaurus of Portuguese. I've
> already tried two links to Portuguese WordNet
> http://www.instituto-camoes.pt/WordNet/index.jsp) but neither is
> operational. I've also tried the links at
> http://www.linguateca.pt/enciclopedias.html, but no luck.
> Thanks in advance.
> Mark Davies
> Mark Davies
> Assoc. Prof., Linguistics
> Brigham Young University
> (phone) 801-422-9168 / (fax) 801-422-0906
> ** Corpus design and use // Linguistic databases **
> ** Historical linguistics // Language variation **
> ** English, Spanish, and Portuguese **
This email has been scanned by the MessageLabs Email Security System.
For more information please visit http://www.messagelabs.com/email
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the Corpora-archive