[Corpora-List] New COCA-based resource: www.wordandphrase.info

Mark Davies Mark_Davies at byu.edu
Tue Jan 3 00:16:02 CET 2012


It's weirdnesses with Javascript and CSS in Opera. But IE, Firefox, Chrome, and Safari (both Mac and iPad) work fine. Sorry for the exclusion of Opera...

Mark D.

============================================ Mark Davies Professor of Linguistics / Brigham Young University http://davies-linguistics.byu.edu/

** Corpus design and use // Linguistic databases ** ** Historical linguistics // Language variation ** ** English, Spanish, and Portuguese ** ============================================

________________________________________ From: corpora-bounces at uib.no [corpora-bounces at uib.no] on behalf of Amac Herdagdelen [amac at herdagdelen.com] Sent: Monday, January 02, 2012 2:15 PM To: corpora at uib.no Subject: Re: [Corpora-List] New COCA-based resource: www.wordandphrase.info

Thanks for the new resource! Is there any particular reason why the interface doesn't support the Opera Web Browser? If I choose to mask my browser (Opera 11.60) as Firefox the site seems to be working perfectly. Probably, there is no reason to single out Opera.

Amaç


> Message: 3
> Date: Mon, 2 Jan 2012 14:53:56 +0000
> From: Mark Davies <Mark_Davies at byu.edu>
> Subject: [Corpora-List] New COCA-based resource:
> www.wordandphrase.info
> To: "corpora at uib.no" <corpora at uib.no>
>
> The following might be of interest to those who use corpora for language teaching and learning, and perhaps for those interested in lexicography. For those interested in using corpora to teach English for Academic Purposes (EAP), you might take a look at the note at the end.
>
> ----------------------------------------------------------------------------------------------
>
> We have just released a new interface for the 425 million word Corpus of Contemporary American English (COCA):
>
> http://www.wordandphrase.info
>
> Even more so than the standard COCA interface (which will continue to be available), the new website is designed to provide information on many different aspects of a word and its usage -- all on one screen. Users can browse through the frequency listing (lemmas 1-60,000 in the corpus) or look for specific words, and then for any matching words they can see:
>
> -- the definition(s) of the word (based on WordNet)
> -- the overall frequency in the 425 million word corpus, and its rank (1-60,000)
> -- the frequency in each of the five main genres -- spoken, fiction, magazines, newspapers, and academic
> -- 20-30 collocates, which of course provide useful insight into meaning and usage
> -- 200 concordance lines (re-sortable), which provide insight into the patterns in which the word occurs
> -- synonyms (grouped by meaning and sorted by frequency); can click to see the entries for related words
> -- WordNet entries, showing related words with a more specific or a more general meaning
>
> As noted, all of this information is displayed together on one screen, with extensive links from one word to another. For example, you can click on any of the 20-30 collocates or any word in the concordance lines, to generate a new concordance display for a specific node/collocate pair. Or you can click on any of the synonyms or the WordNet entries to generate a new display, and thus follow a "chain" of related words.
>
> If you are interested in English words and their frequency, genre distribution, meaning, the relationship to related words, and the patterns in which a word occurs, we believe that this new resource will be quite useful for you in your teaching, learning, and research. And as always, it is available for free -- no annual subscription fees for individuals or institutions.
>
> As a final note, we might mention that in the next month or two we'll be releasing a related resource -- a special version of www.wordandphrase.info that is oriented to English for Academic Purposes (EAP). Same functionality as above, but limited to just the 85 million words of academic texts in COCA. Based on words with a much higher frequency in the 85 million words of academic texts in COCA than in other genres, with frequency by academic sub-genre (medical, legal, education, social sciences, humanities, etc), and all collocates and concordance lines limited to just the academic genre.
>
> ============================================
> Mark Davies
> Professor of Linguistics / Brigham Young University
> http://davies-linguistics.byu.edu/
>
> ** Corpus design and use // Linguistic databases **
> ** Historical linguistics // Language variation **
> ** English, Spanish, and Portuguese **
> ============================================
>
>
> ------------------------------
>
> Message: 4
> Date: Mon, 02 Jan 2012 14:05:48 +0100
> From: WILLIAMS Geoffrey <williams at univ-ubs.fr>
> Subject: Re: [Corpora-List] Colour usage by British, American and
> Russian writers
> To: corpora at uib.no
>
> Dear colleague,
>
> I take it you know of "Colouring Meaning: collocation and connotation in
> figurative meaning" (Gill Philip, Benjamins 2011. This gives an
> excellent corpus linguistic perspective.
>
> Best regards
>
> Geoffrey Williams
>
> Le 02/01/2012 11:50, Yuri Tambovtsev a écrit :
>> Dear Corpora colleagues, I study the frequency of the usage of
>> different colours in the texts of various British, American and
>> Russian writers. Do you know any articles or books dealing with this
>> theme? Looking forward to hearing from you soon to yutamb at mail.ru
>> <mailto:yutamb at mail.ru> Be well Yuri Tambovtsev, Novosibirsk, Russia
>>
>>
>> _______________________________________________
>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
>
> --
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 1997 bytes
> Desc: not available
> URL: <https://mailman.uib.no/public/corpora/attachments/20120102/29a1cc53/attachment.txt>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: signatureG-WILLIAMS.jpg
> Type: image/jpeg
> Size: 54035 bytes
> Desc: not available
> URL: <https://mailman.uib.no/public/corpora/attachments/20120102/29a1cc53/attachment-0001.jpg>
>
> ------------------------------
>
> Message: 5
> Date: Mon, 2 Jan 2012 16:08:44 -0500
> From: Nancy Ide <ide at cs.vassar.edu>
> Subject: [Corpora-List] 2nd CFP: Collaborative Resource Development
> and Delivery
> To: CORPORA <corpora at uib.no>, humanist at princeton.edu, "A list for
> those interested in open data in linguistics."
> <open-linguistics at lists.okfn.org>, FLaReNet Members Mailing List
> <flarenet_members at ilc.cnr.it>, elsnet-list <elsnet-list at elsnet.org>,
> Priscilla Rasmussen <acl at aclweb.org>, ln at cines.fr,
> sigann at cs.vassar.edu
>
>
> SECOND CALL FOR PAPERS
>
> LREC Workshop
> ***********************************************
> Collaborative Resource Development and Delivery
> ***********************************************
>
> Lütfi Kirdar Istanbul Exhibition and Congress Centre
> Istanbul, Turkey
> May 27, 2012
> http://www.anc.org/Collaborative_Resource_Development
>
>
> To answer the growing need and lower the costs of resource creation and enhancement,
> there is a movement within the community toward collaborative resource development,
> including collaborative corpus annotation and collective creation/enhancement of
> lexical resources and knowledge bases. Collaborative development encompasses both
> engaging the community in annotation and development of common resources, as
> well as crowd-sourcing and similar solutions.
>
> This workshop seeks contributions in all dimensions of collaborative resource
> development and delivery, with a specific focus on case studies and lessons
> learned. We invite submissions that address but are not limited to the following
> topics:
> - Web services and platforms for collaborative resource development and
> distribution;
> - Crowd sourcing for resource development, including studies of efficacy;
> - Strategies and issues for open resource distribution;
> - Evaluation of collaboratively developed resources;
> - Position papers outlining issues and proposing solutions for community-based
> collaborative resource development and/or delivery.
>
> ---------------
> Special Session
> ---------------
>
> The workshop will include a special session devoted to means and considerations
> for community-based linguistic annotation, with a special emphasis on the
> Manually Annotated Sub-Corpus (MASC) (http://www.anc.org/MASC). We invite
> submissions to this session on the following topics:
>
> - position papers concerning any aspect of collaborative resource development,
> including means to get the community fully invested in such efforts;
> - case studies describing collaborative development efforts, including assessment
> of what works and what doesn't;
> - results obtained using collaboratively developed resources;
> - the role of standards and best practices in collaboratively developed resources
> and contributed annotations.
>
> Special consideration will be given to contributions that have used MASC data in
> a way that highlights the benefits of community-based annotation.
>
> ----------------------
> Submission Information
> ----------------------
>
> Submissions may be long papers or short papers, following the formatting
> guidelines for submissions to the main conference given at
> http://www.lrec-conf.org/lrec2012/. All submissions should be made using the
> START system at https://www.softconf.com/lrec2012/CollaborativeDev2012/.
>
> Submitters will be asked to provide relevant information about resources used
> for or resulting from the work described in their papers to the LRE Map
> (http://www.lrec-conf.org/lrec2102/?LREMap) and to contribute to the
> Language Library initiative.
>
> ---------------
> Important Dates
> ---------------
>
> Submissions due: February 15, 2012
> Acceptance notification to authors: March 15, 2012
> Camera ready due: April 1, 2012
> Workshop: May 27, 2012
>
> -------------------
> Workshop Organizers
> -------------------
>
> Nancy Ide, Vassar College, USA
> Collin Baker, ICSI/UC Berkeley, USA
> Christiane Fellbaum, Princeton University, USA
> Rebecca Passonneau, Columbia University, USA
>
> CONTACT: collaboration-workshop at anc.org
>
> -----------------
> Program Committee
> -----------------
>
> Nicoletta Calzolari, ILC/CNR, Italy
> Bob Carpenter, Alias I,Inc., USA
> Chris Cieri, LDC, University of Pennsylvania, USA
> Mona Diab, Columbia University, USA
> Bill Dolan, Microsoft Corp., USA
> Christiane Fellbaum, Princeton University, USA
> Dan Flickinger, Stanford University, USA
> Terry Langendoen, University of Arizona, USA
> Rebecca Passonneau, Columbia University, USA
> Massimo Poesio, University of Trento, Italy
> Sameer Pradhan, BBN Technologies, USA
> James Pustejovsky, Brandeis University, USA
> Owen Rambow, Columbia University, USA
> Manfred Stede, Universitat Potsdam, Germany
>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 12288 bytes
> Desc: not available
> URL: <https://mailman.uib.no/public/corpora/attachments/20120102/c889d8ec/attachment.txt>
>
> ----------------------------------------------------------------------
> Send Corpora mailing list submissions to
> corpora at uib.no
>
> To subscribe or unsubscribe via the World Wide Web, visit
> http://mailman.uib.no/listinfo/corpora
> or, via email, send a message with subject or body 'help' to
> corpora-request at uib.no
>
> You can reach the person managing the list at
> corpora-owner at uib.no
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
>
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
> End of Corpora Digest, Vol 55, Issue 2
> **************************************

_______________________________________________ UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora Corpora mailing list Corpora at uib.no http://mailman.uib.no/listinfo/corpora



More information about the Corpora mailing list