> Thank you, Martin for your reply. Apologies for not mentioning the
> details. I've a dataset of names of persons from this
> <http://www.cis.upenn.edu/~ccb/publications/transliterating-from-all-languages.pdf>
> paper which were mined from Wikipedia. I'm considering a subset of those
> languages, and focusing more on these particular languages: pap, jbo, mhr,
> fur, ilo, rue, or, bcl and hopefully soon hi.

I have a list of 1000 candidates for each name, from which I'm building a model to predict the correct transliteration, which is where the ranker comes into play. So far, I've used features that were a part of the output given for each of the 1000 candidates (generated from Joshua). I'm now looking to see if there are any other features I could use, particularly those from Named Entity Transliteration and Discovery in Multilingual Corpora, Klementiev and Roth <http://klementiev.org/publications/learningmt08.pdf>. Hope it's more clear now as to what I intend to do.

Thank you!

