[Corpora-List] python's NLTK vs R's TM

Kevin B. Cohen kevin.cohen at gmail.com
Mon Aug 20 18:56:38 CEST 2012

NLTK probably provides all of the functionality that you will need.

Best wishes,


On Sun, Aug 19, 2012 at 5:04 PM, Matías Guzmán <mortem.dei at gmail.com> wrote:
> Dear all,
> I'm not a very strong programmer but I know a bit of python and a bit of R,
> and I was wandering which is better for corpus work. I'm not interesting in
> creating any fancy language technology thingy, I just need to extract raw
> text from documents off and on-line, analyze them and perform some basic
> statistics on them. Which one would you recommend? should I use both?
> Thanks,
> Matías Guzmán
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora

-- Kevin Bretonnel Cohen, PhD Biomedical Text Mining Group Lead, Computational Bioscience Program, U. Colorado School of Medicine 303-916-2417 (cell) 303-377-9194 (home) http://compbio.ucdenver.edu/Hunter_lab/Cohen

More information about the Corpora mailing list