[Corpora-List] New techniques in text processing

Adam Kilgarriff adam at lexmasterclass.com
Tue Feb 5 09:05:53 CET 2013

Dear Amaš,

as well as tools you can trust, you need data you can trust. Techniques I describe in Getting to know your corpus<http://trac.sketchengine.co.uk/attachment/wiki/AK/Papers/Kilgarriff_TSD2012.pdf?format=raw> are designed to help researchers find the characteristics, quirks and biases of their dataset

(video version http://www.youtube.com/watch?v=0XvWh6YqgkU)



On 5 February 2013 03:44, Amac Herdagdelen <amac at herdagdelen.com> wrote:

> Dear Corpora Members,
> A colleague of mine who is currently doing his PhD in political science
> asked me this interesting question:
> "One of the hallmarks of Political Science (for better or for worse) is
> importing techniques from other disciplines and figuring out how to make
> them work in the Political context. As far as text processing the latest
> and greatest that we have in our literature is LDA. Is there anything
> new/fun that jumps to mind that I should read up on? LDA papers have won
> our "Best Methodological Innovation" awards 3 years running."
> What are your thoughts? What new things do we have/know to offer other
> fields?
> Best,
> Amaš
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-- ======================================== Adam Kilgarriff <http://www.kilgarriff.co.uk/> adam at lexmasterclass.com Director Lexical Computing Ltd<http://www.sketchengine.co.uk/>

Visiting Research Fellow University of Leeds<http://leeds.ac.uk>

*Corpora for all* with the Sketch Engine <http://www.sketchengine.co.uk>

*DANTE: a lexical database for English<http://www.webdante.com>

* ======================================== -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3416 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20130205/11f9c11e/attachment.txt>

More information about the Corpora mailing list