[Corpora-List] Considering Distributions Across Texts

Andrew Caines andrewcaines7 at gmail.com
Mon Mar 3 11:38:08 CET 2014

Hi Brian, Jurafsky & Martin discuss term frequency and document frequency in their textbook; others may know of specific research papers that have investigated this. Andrew

On 28 February 2014 16:16, Brian Schanding <bschanding at gmail.com> wrote:

> Hello,
> I'm working on research with learner corpora. My corpora aren't that big
> (approx. 250,000 wds with about 300-400 text files). I wonder what
> research/textbook sources anyone can point me to that discuss the
> importance of considering how many texts in the corpus a language feature
> occurs in (as opposed to merely considering overall frequency of a language
> feature within a corpus).
> Many Thanks!
> Brian
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 1531 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20140303/1e60c813/attachment.txt>

More information about the Corpora mailing list