Specifically, the below, which presumably will mention ties between ancient languages and modern Arabic. "I" is Regina.
Talk Title: "Embracing Language Diversity: Unsupervised Multilingual Learning?"
Talk Abstract: For centuries, the deep connection between human languages has fascinated scholars, and driven many important discoveries in linguistics and anthropology. In this talk, I will show that this connection can empower unsupervised methods for language analysis. The key insight is that joint learning from several languages reduces uncertainty about the linguistic structure of each individual language.
I will present multilingual generative unsupervised models for morphological segmentation, part-of-speech tagging, and parsing. In all of these instances we model the multilingual data as arising through a combination of language-independent and language-specific probabilistic processes. This feature allows the model to identify and learn from recurring cross-lingual patterns to improve prediction accuracy in each language. I will also discuss ongoing work on unsupervised decoding of ancient Ugaritic tablets using data from related Semitic languages.
This is joint work with Benjamin Snyder, Tahira Naseem and Jacob Eisenstein.