[Corpora-List] A tool for corpus management?

Normand Peladeau peladeau
Wed Aug 25 17:09:39 CEST 2010


If you don't mind paying for a commercial software, I suggest you have a look at our suite of text analysis software, QDA Miner and WordStat.

QDA Miner is a document management and coding tool that allows one to tag and annotate documents, perform searches and statistics on tags (frequency, co-occurrences, comparisons, sequences). QDA Miner can import MS Word, WordPerfect, Rich Text, HTML, Text file, and PDF files (most of them) as well as structured data files such as Excel, MS Access, SPSS, and many other database and spreadsheet formats, allowing you to combine text with metadata (numerical, categorical, dates, etc.). QDA Miner offers several text search tools, including boolean searches, query by examples, section retrieval and keyword searches. Tags may be automatically applied to retrieved text segments.

WordStat is a text analysis add-on to perform text analysis on QDA Miner project files. WordStat can perform word frequency, extract phrases, apply text mining techniques to identify themes and patterns, and also support automatic categorization and classification of words, word patterns, phrases and proximity rules. We will release in a few days version 6.1 of WordStat, and a maintenance update of QDA Miner. Both software will offer major speed improvements over the currently available versions. For example, WordStat 6 will analyse up to 15 millions words per minute (a 50% speed increase). WordStat 6 integrates several language dictionaries, two English thesaurus and WordNet in order to support the dictionary builidng process (what others may call "taxonomy building"). We will introduce in version 6.1 thesauri for French, Spanish, German and Portuguese languages.

You can get more information about the software from our web site at:

http://www.provalisresearch.com

We also have flash demos of those two software:

http://www.provalisresearch.com/wordstat/WordStatFlashDemo.html

http://www.provalisresearch.com/QDAMiner/Flash/DemoQDA.htm

Normand Peladeau Provalis Research

At 8/25/2010 07:06 AM, Mahdi Mohseni wrote:
>Dear Colleagues,
>
>I need a tool for managing a corpus with the following capabilities:
> * Adding text files to the corpus
> * Editing files
> * Annotating words
> * Searching
> * Reporting statistics of words and tags
>Would you please introduce me a suitable tool?
>
>Best,
>Mahdi Mohseni
>
>_______________________________________________
>Corpora mailing list
>Corpora at uib.no
>http://mailman.uib.no/listinfo/corpora
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3244 bytes Desc: not available URL: <http://www.uib.no/mailman/public/corpora/attachments/20100825/dafe5cd8/attachment.txt>



More information about the Corpora mailing list