[Corpora-List] Corpora Digest, Vol 24, Issue 1 (fwd)

Hercules Dalianis hercules at dsv.su.se
Tue Jun 2 21:42:41 CEST 2009


Dear Vasiliki!

Have you checked these Movie reviews pos/neg http://www.cs.cornell.edu/people/pabo/movie-review-data/

Xairetismous

Hrakhlhs

On Mon, 1 Jun 2009, corpora-request at uib.no wrote:


> Today's Topics:
>
> 1. Re: Tagging with synsets? (Ciarán Ó Duibhín)
> 2. Call for papers : JADT 2010 (Anne Dister)
> 3. Re: Tagging with synsets? (Tony Berber Sardinha)
> 4. Corpus in English tagged with opinion or sentiment
> (vrentoumi at iit.demokritos.gr)
> 5. Re: Corpus in English tagged with opinion or sentiment
> (CRuehlemann at aol.com)
> 6. Third CFPs: Workshop Adaptation of LanguageResources and
> Technology to New Domains (Kiril Simov)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 1 Jun 2009 01:44:57 +0100
> From: Ciarán Ó Duibhín <ciaran at oduibhin.freeserve.co.uk>
> Subject: Re: [Corpora-List] Tagging with synsets?
> To: <corpora at uib.no>
>
> Thank you for all replies on this, which I summarize:
> . FreeLing http://garraf.epsevg.upc.es/freeling/ (Francis Tyers) (compiled
> application)
> . SenseRelate http://senserelate.sourceforge.net; web interface
> http://marimba.d.umn.edu/allwords/allwords.html (Ted Pedersen) (Perl)
> . UKB http://ixa2.si.ehu.es/ukb (Eneko Agirre)
> . SenseLearner http://lit.csci.unt.edu/~senselearner/ incl web interface
> (Rada Mihalcea) (Perl)
> . Graph-based WSD
> http://lit.csci.unt.edu/index.php/Downloads#GWSD:_Unsupervised_Graph-based_Word_Sense_Disambiguation
> (Rada Mihalcea) (Perl)
> . NLTK http://nltk.googlecode.com/svn/trunk/doc/howto/wordnet.html ;
> http://nltk.googlecode.com/svn/trunk/doc/book/ch02.html (Claire Brierley)
> (Python)
>
> I'm pleasantly surprised to find that some of these include implementations
> of algorithms to tag running English text with preferred synsets.
>
> Any of them could involve a struggle to get it working on Windows, but I have
> a few possibilities there.
>
> Thanks again,
> Ciarán Ó Duibhín.
>
>
>
>
>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2798 bytes
> Desc: not available
> Url :
> https://mailman.uib.no/public/corpora/attachments/20090601/28a35cba/attachment.txt
>
> ------------------------------
>
> Message: 2
> Date: Mon, 01 Jun 2009 10:13:35 +0200
> From: Anne Dister <anne.dister at uclouvain.be>
> Subject: [Corpora-List] Call for papers : JADT 2010
> To: <corpora at uib.no>
>
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 4492 bytes
> Desc: not available
> Url :
> https://mailman.uib.no/public/corpora/attachments/20090601/7a8f999c/attachment.txt
>
> ------------------------------
>
> Message: 3
> Date: Mon, 1 Jun 2009 00:42:31 -0300
> From: Tony Berber Sardinha <tony at corpuslg.org>
> Subject: Re: [Corpora-List] Tagging with synsets?
> To: Ciarán Ó Duibhín <ciaran at oduibhin.freeserve.co.uk>, CORPORA
> <corpora at uib.no>
>
> Dear all
>
> Thanks for the thread and for this summary.
>
> I installed freeling, which looks great, but its tags are numerical
> sense codes, for which I can't find an explanation. For example, in
> the sentence 'General Motors is beginning its reinvention', the verb
> 'beginning' is tagged as 00239960. I'd like to know what this tag
> means. Thanks ahead for any pointers.
>
> bye
>
> tony
>
>
>
> On May 31, 2009, at 9:44 PM, Ciarán Ó Duibhín wrote:
>
>> Thank you for all replies on this, which I summarize:
>> ? FreeLing http://garraf.epsevg.upc.es/freeling/ (Francis Tyers)
>> (compiled application)
>> ? SenseRelate http://senserelate.sourceforge.net; web interface
>> http://marimba.d.umn.edu/allwords/allwords.html
>> (Ted Pedersen) (Perl)
>> ? UKB http://ixa2.si.ehu.es/ukb (Eneko Agirre)
>> ? SenseLearner http://lit.csci.unt.edu/~senselearner/ incl web
>> interface (Rada Mihalcea) (Perl)
>> ? Graph-based WSD http://lit.csci.unt.edu/index.php/
>> Downloads#GWSD:_Unsupervised_Graph-based_Word_Sense_Disambiguation
>> (Rada Mihalcea) (Perl)
>> ? NLTK http://nltk.googlecode.com/svn/trunk/doc/howto/wordnet.html ;
>> http://nltk.googlecode.com/svn/trunk/doc/book/ch02.html
>> (Claire Brierley) (Python)
>>
>> I'm pleasantly surprised to find that some of these include
>> implementations of algorithms to tag running English text with
>> preferred synsets.
>>
>> Any of them could involve a struggle to get it working on Windows,
>> but I have a few possibilities there.
>>
>> Thanks again,
>> Ciarán Ó Duibhín.
>>
>>
>>
>>
>>
>> _______________________________________________
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
>
>
>
>
> ------------------------------
>
> Message: 4
> Date: Mon, 1 Jun 2009 12:36:41 +0300 (EEST)
> From: vrentoumi at iit.demokritos.gr
> Subject: [Corpora-List] Corpus in English tagged with opinion or
> sentiment
> To: CORPORA at UIB.NO
>
> -------------------------------------------------------------------
>
>
> Hello, the lab I am working for is looking to buy a corpus in English that
> has been annotated with opinion or sentiment. Can someone please provide
> relevant information?
>
> Thank you in advance,
>
> Vassiliki Rentoumi
> Phd Student
> NCSR Demokritos,
> IIT (Institute of Informatics and Telecommunications)
> SKEL (Software and Knowledge Laboratory)
> Athens
> Greece
>
>
>
>
>
> ------------------------------
>
> Message: 5
> Date: Mon, 1 Jun 2009 07:31:44 EDT
> From: CRuehlemann at aol.com
> Subject: Re: [Corpora-List] Corpus in English tagged with opinion or
> sentiment
> To: CORPORA at UIB.NO
>
>
> Hi Vassiliki,
>
> you might check out the MPQA Corpus freely available at:
>
> _http://nrrc.mitre.org/NRRC/publications.htm_
> (http://nrrc.mitre.org/NRRC/publications.htm)
>
> described in:
>
>
> Wiebe, Janyce, Theresa Wilson, and Claire Cardie. ?_Annotating
> expressions of opinions and emotions in language?._
> (http://www.cs.pitt.edu/~wiebe/pubs/papers/lre05.pdf) Language Resources and
> Evaluation 39(2-3): 165-210
> (available at:http://nrrc.mitre.org/NRRC/publications.htm)
> Hope this helps
> Chris
> ------------------------------------------------------------------
> Dr. Christoph Rühlemann, Munich
>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2607 bytes
> Desc: not available
> Url :
> https://mailman.uib.no/public/corpora/attachments/20090601/80178f8a/attachment.txt
>
> ------------------------------
>
> Message: 6
> Date: Mon, 1 Jun 2009 15:04:17 +0300
> From: "Kiril Simov" <kivs at bultreebank.org>
> Subject: [Corpora-List] Third CFPs: Workshop Adaptation of
> LanguageResources and Technology to New Domains
> To: <corpora at uib.no>
>
> Adaptation of Language Resources and Technology to New Domains
> (AdaptLRTtoND)
> http://www.bultreebank.org/AdaptLRTtoND/
>
> RANLP 2009 Workshop
> http://www.lml.bas.bg/ranlp2009/
>
> Motivation
>
> It is widely acknowledged that despite the great advances in
> Computational Linguistics nowadays, the creation of new
> Language Resources (LR) and Language Technology (LT) for a
> new domain or task is still quite expensive and
> time-consuming. At the same time there are already a lot of
> varieties of LR and LT, developed for various languages and
> purposes. What happens when new tasks come? Do we have to
> develop new resources and technology from the beginning, or
> can we re-use or adapt the existent ones? Last, but not
> least alternative is to combine both strategies depending on
> the task. The first option seems reasonable when richer and
> larger data is needed for the new applications. The second
> option is justified only if such a resource or technology
> does not exist at all, or some new approach is applied. The
> third one is the ever ?compromising?, but also very
> realistic option.
> As the machine learning techniques have matured enough to
> successfully support real applications within various
> domains, a new bottleneck became the requirement for large
> and adequate training data for input. Thus, the NLP
> community faced the question of the relevant LR and LT
> adaptation. It concerns the operability between general
> domain NLP toolkits and specific domain tasks with respect
> to terminology, language, structure, steps of preprocessing
> etc.
> Thus, the Workshop is devoted to various methods for
> transferring the linguistic knowledge and supportive
> technology from the existing language resources in one
> domain into a different one.
>
> Topics
>
> - parameters of adaptivity and re-usability of LR and LT
> - methods for adaptation of existing NLP resources to specific tasks
> - domain specific requirements to the LR and LT
> - general domain vs. specific domain processing
> - profiling LR
> - extrapolation of richer annotations to large data
> - evaluation of adapted LR and LT
>
> Organizers
>
> Núria Bel, Pompeu Fabra University
> Erhard Hinrichs, Tuebingen University (co-chair)
> Petya Osenova, Bulgarian Academy of Sciences and Sofia University
> Kiril Simov, Bulgarian Academy of Sciences (co-chair)
>
> Invited speaker
>
> Jun'ichi Tsujii, University of Tokyo and University of Manchester - NacTeM
>
> Submission details
>
> Authors are invited to submit an extended abstract up to 800
> words. Abstracts should describe existing research connected
> to the topics of the workshop. The following formats are
> accepted: PDF, PS, MS Word, ASCII text. Each submission
> should provide the following information: title; author(s);
> affiliation(s); and contact author's e-mail address, postal
> address.
>
> The abstracts should be sent electronically to:
> Petya Osenova
> Email: petya at bultreebank.org
> by the deadline listed below. The submissions will be
> reviewed by the workshop's programme committee.
>
> The accepted papers will appear in the workshop proceedings.
> The final paper should not exceed 15 A4 pages formatted
> according RANLP09 guidelines
> (http://www.lml.bas.bg/ranlp2009/).
>
>
> Important Dates
>
> Deadline for abstract submission: 7th June 2009
> Notification of acceptance 7th July 2009
> Final version of the papers 23rd August 2004
>
> Program Committee
>
> Núria Bel, Pompeu Fabra University
> Gosse Bouma, Groningen University
> António Branco, Lisbon University
> Walter Daelemans, Antwerp University
> Markus Dickinson, Indiana University
> Erhard Hinrichs, Tuebingen University
> Josef van Genabith, Dublin City University
> Iryna Gurevych, Technische Universität Darmstadt - UKP Lab
> Atanas Kiryakov, Ontotext OOD
> Vladislav Kubon, Charles University
> Sandra Kuebler, Indiana University
> Lothar Lemnitzer, DWDS, Berlin-Brandenburgische Akademie der Wissenschaften
> Bernardo Magnini, FBK
> Detmar Meurers, Tuebingen University
> Paola Monachesi, Utrecht University
> Preslav Nakov, National University of Singapore
> John Nerbonne, Groningen University
> Petya Osenova, Bulgarian Academy of Sciences and Sofia University
> Gabor Proszeky, MophoLogic
> Adam Przepiorkowski, Polish Academy of Sciences
> Marta Sabou, Open University - UK
> Kiril Simov, Bulgarian Academy of Sciences
> Cristina Vertan, Hamburg University
>
>
>
>
> ----------------------------------------------------------------------
> Send Corpora mailing list submissions to
> corpora at uib.no
>
> To subscribe or unsubscribe via the World Wide Web, visit
> http://mailman.uib.no/listinfo/corpora
> or, via email, send a message with subject or body 'help' to
> corpora-request at uib.no
>
> You can reach the person managing the list at
> corpora-owner at uib.no
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
>
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
> End of Corpora Digest, Vol 24, Issue 1
> **************************************
>

___________________________________________________________________________ Dr. Hercules Dalianis Associate Professor (Docent) Department of Computer and Systems Sciences ph: +46 8 674 75 47 DSV-KTH-Stockholm University mobile ph: +46 70 568 13 59 Forum 100 fax: +46 8 703 90 25 164 40 Kista email: hercules at dsv.su.se Stockholm, Sweden www: http://www.dsv.su.se/~hercules/ ___________________________________________________________________________



More information about the Corpora mailing list