[Corpora-List] social media corpus collected during 2013-2015

Jordi Carrera Ventura jordi.carrera.ventura at gmail.com
Mon Sep 7 10:55:40 CEST 2015


<div></div><div> <br/> <br/> <br/>Thanks Rob for providing so many useful links.It would be of valuable asset to my lexical study which is interested in the new usages, new spellings and new words in social media. <br/> <br/> <br/>If anyone is doing the similar study as mine, you can go through the links which provided below. <br/> <br/>I am looking for a few more social media corpus If you know any other social media corpus collected during 2013-2015 and are compiled in English, please let me know. <br/> <br/>Thanks. <br/> <br/>Kayee LEE <br/> <br/>________________________________ <br/>&#23492;&#20214;&#32773;: rob van der goot &lt;robvanderg at live.nl&gt; <br/>&#23492;&#20214;&#26085;&#26399;: 2015&#24180;8&#26376;31&#26085; &#19978;&#21320; 12:53 <br/>&#25910;&#20214;&#32773;: LEE, kayee [12118192d] <br/>&#20027;&#26088;: RE: [Corpora-List] Looking for a social media corpora collected in 2013-2015 (Kayee LEE KA LAM) <br/> <br/>Deat Kayee, <br/> <br/>Those files move all the time, I got the updated links here: <br/>Lexnorm, (is old, before 2013 I think) <br/>&lt;http://people.eng.unimelb.edu.au/tbaldwin/etc/lexnorm_v1.2.tgz&gt;http://people.eng.unimelb.edu.au/tbaldwin/etc/lexnorm_v1.2.tgz <br/>Lexnorm 2015, is not in the overview, but is newer. <br/>https://noisy-text.github.io/files/lexnorm2015.tgz <br/>I think the sms messages are also from before 2013, but if you are still interested: <br/>http://www.comp.nus.edu.sg/~nlp/corpora.html <br/>Pos-tagged tweets: <br/>bit.ly/twitter-bootstrap-corpus <br/> <br/>Another interesting corpus might be the encow corpus (from 2014). <br/>https://webcorpora.org/ <br/>Or you can always collect you own tweets, <br/>https://dev.twitter.com/rest/public <br/> <br/>For some of the corpora you do have to contact the creators. <br/> <br/>Good luck with them, <br/>Rob van der Goot <br/> <br/> <br/>[http://mlm.polyu.edu.hk/intimate/templates/images/PolyU/PolyU_Email_Signature.jpg] <br/> <br/>Disclaimer: <br/> <br/>This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful. <br/> <br/>The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information. <br/> <br/>[http://mlm.polyu.edu.hk/intimate/templates/images/PolyU/PolyU_Email_Signature.jpg] <br/> <br/>Disclaimer: <br/> <br/>This message (including any attachments) contains confidential information intended for a specific individual and purpose. If you are not the intended recipient, you should delete this message and notify the sender and The Hong Kong Polytechnic University (the University) immediately. Any disclosure, copying, or distribution of this message, or the taking of any action based on it, is strictly prohibited and may be unlawful. <br/> <br/>The University specifically denies any responsibility for the accuracy or quality of information obtained through University E-mail Facilities. Any views and opinions expressed are only those of the author(s) and do not necessarily represent those of the University and the University accepts no liability whatsoever for any losses or damages incurred or caused to any party as a result of the use of such information. <br/>_______________________________________________ <br/>UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora <br/>Corpora mailing list <br/>Corpora at uib.no <br/>http://mailman.uib.no/listinfo/corpora <br/></div> -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 8326 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20150907/25f22064/attachment.txt>


More information about the Corpora mailing list