[Corpora-List] occurrence of phonemes in texts of world languages

Yuri Tambovtsev yutamb at mail.cis.ru
Sat Jan 14 14:46:00 CET 2006


Dear Corpora colleagues,
I compute the frequency of occurrence of
phonemes in world languages. The frequency of occurrence is computed on the material of texts and dictionaries. I feed a text in my computer and then I compute how many times this or that sound occurs. I have computed some Finno-Ugric, Turkic, Paleo-Asiatic, Australian aboriginal, Polinesian, etc.languages. Also some American Indian languages: Totonac, Nahuatl,
Sayula populuca, Pocomchi, Capanahua, and 20 more American Indian languages. What Amerrican Indian or any other language do you study? Were the frequencies of its phonemes in texts computed? Could we compute some of the texts in your language? I can do it if you send me a text on paper or in the electronic form, but as a simple -txt or -doc file.
After that it is interesting to compare, for instance, the
occurrence of labial consonants in Totonac (7.38%) and
Pocomchi (10.83%). Or Nahuatl (11.73%) and Sayula populuca (12.34%). Or Guarani (12.92%) and Sweet Grass Cree (15.15%).Etc, etc. The values can also show the typology and the closeness. If you know some linguist who may be intestested in co-operating with me on the problem, then please, forward
my message to this scholar with my new correct address
yutamb at mail.ru Do not send me web-sites since my computer system cannot open web-sites. I cannot open attachments as well, only normal messages, like this one. Looking forward to hearing from you soon to yutamb at mail.ru
Yours sincerely Yuri Tambovtsev, Novosibirsk Pedagog.
University, Novosibirsk, Russia

-------------- next part --------------
An HTML attachment was scrubbed...
URL: https://mailman.uib.no/public/corpora-archive/attachments/20060114/d4a0aa59/attachment.html


More information about the Corpora-archive mailing list