[Corpora-List] GUM Corpus 7 and Beyond

Janet Liu yl879 at georgetown.edu
Thu Apr 16 16:34:40 CEST 2020


(Apologies for cross-postings)

*** The GUM Corpus - Public Survey ***

*** Georgetown University Multilayer Corpus ***

The Corpling Lab at Georgetown University <http://corpling.uis.georgetown.edu/corpling/> would like your participation in *this survey* <https://forms.gle/QxZa9fMDRuAzyPTh9> to help us better understand GUM usage and preferences regarding old and new genres in the GUM corpus, which would be of great help for our future selection of genres and availability of formats and annotation layers.

Survey Link: https://forms.gle/QxZa9fMDRuAzyPTh9 <https://forms.gle/PgVahmXeH5TFeaG87>

GUM is an open source corpus of richly annotated English texts from multiple genres: academic, bio, fiction, interview, news, travel, how-to and Reddit forum discussions. The corpus is created by students as part of the Computational Linguistics curriculum at Georgetown University and is available under Creative Commons licenses. As of now, the GUM Corpus has released 6 series containing nearly 130K tokens annotated for multiple layers. For more information and to search or download the corpus online, see: https://corpling.uis.georgetown.edu/gum/

We value your opinions and appreciate your participation and help! For full consideration, please respond to the survey by the end of May.

Cheers, Janet

-- Yang (Janet) Liu PhD Student in Computational Linguistics Department of Linguistics Graduate School of Arts & Sciences Georgetown University http://janetlauyeung.georgetown.domains/ -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3751 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20200416/8a4b275b/attachment.txt>



More information about the Corpora mailing list