[Corpora-List] Corpus for clustering

Bob Parks bobp at clarityconnect.com
Sat Mar 10 14:49:00 CET 2007


I'm looking for references on how to construct corpora that reflect
documents that use particular concepts and topics. I'm assuming its
necessary to first cluster a larger document set. But how does one
conceptualize the problem of creating the larger set to analyze for a
set of concepts/topics, before the analysis?
Thanks,
Bob Parks
--
* The best dictionary and integrated thesaurus on the web:
http://www.wordsmyth.net
* Robert Parks - Wordsmyth - (607) 272-2190
* "To imagine a language is to imagine a form of life." (LW) And to
imagine new forms of life is to create pathways to the language for
living that life.
* "Philosophers have only interpreted the world. The point, however,
is to change it." (KM) And the best way to change the world is to
first imagine a better form of life, and shape and offer your words
as tools for living that world. This is the highest calling of a
wordsmyth: to enrich the deep structure of communication and
community.





More information about the Corpora-archive mailing list