[Corpora-List] Query about nomenclature
cbrew at acm.org
Sun Mar 6 19:22:01 CET 2005
On Sun, Mar 06, 2005 at 02:56:29PM -0000, John Mckenny wrote:
> Dear CORPORA subscribers
> Should it be N-gram/Ngram/n-gram/ngram? Is there a consensus about
> which of these four to use? Is there a way to measure usage on the Web
> via www.webcorp.org.uk or other meta-engines?
> What comes after bigrams, trigrams? Is it 4grams, 5grams etc. or
> could it be something like quadrigrams, pentagrams, hexagrams? It was
> pointed out to me that pentagram is a Satanist symbol.
The sequence could have been
monogram, digram, trigram, tetragram, pentagram, hexagram, ...
with fairly uniform (Greek) etymology, but someone chose
these look like Latin numerical prefixes, so my guess is that
the intended extrapolation is
which replicates the mixed Latin/Greek etymology of bigram through
the series. Pretty yukky...
More information about the Corpora-archive