[Corpora-List] python's NLTK vs R's TM

Michele Filannino michele.filannino at cs.manchester.ac.uk
Mon Aug 20 10:34:44 CEST 2012


Hi Matías,

I would suggest you NLTK for Python. You can start using the book published by O'Reilly, it's very easy and effective. It fits your needs.

Bye, michele.

On Mon, Aug 20, 2012 at 1:04 AM, Matías Guzmán <mortem.dei at gmail.com> wrote:


> Dear all,
>
> I'm not a very strong programmer but I know a bit of python and a bit of
> R, and I was wandering which is better for corpus work. I'm not interesting
> in creating any fancy language technology thingy, I just need to extract
> raw text from documents off and on-line, analyze them and perform some
> basic statistics on them. Which one would you recommend? should I use both?
>
> Thanks,
>
> Matías Guzmán
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>

-- Michele Filannino

CDT PhD student in Computer Science Room IT301 - IT Building The University of Manchester filannim at cs.manchester.ac.uk -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1683 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20120820/4f4d5689/attachment.txt>



More information about the Corpora mailing list