[Corpora-List] (Large) dataset with sentence-level semantic phenomena

lasha abzianidze lasha.abzianidze at gmail.com
Fri Jan 25 12:58:30 CET 2019


Hi Mostafa, - In addition to the already suggested Groningen Meaning Bank (GMB <http://gmb.let.rug.nl/>) you can check the Parallel Meaning Bank [http://pmb.let.rug.nl/], an active project and successor of the GMB. In both meaning banks you can find texts annotated with deep formal semantics *a la* Discourse Representation Theory. - best, Lasha

lready uggestedby On Fri, Jan 25, 2019 at 12:00 PM <corpora-request at uib.no> wrote:


> Send Corpora mailing list submissions to
> corpora at uib.no
>
> To subscribe or unsubscribe via the World Wide Web, visit
> https://mailman.uib.no/listinfo/corpora
> or, via email, send a message with subject or body 'help' to
> corpora-request at uib.no
>
> You can reach the person managing the list at
> corpora-owner at uib.no
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
> Today's Topics:
>
> 1. Registration is open: Corpus Linguistics Summer School,
> Birmingham, UK, 24-28 June 2019 (Florent Perek)
> 2. Final CfP: Algorithms and Systems for Digital Humanities at
> Euro-Par 2019 (Tristan Miller)
> 3. Discourse Relation Parsing and Treebanking (DISRPT
> 2019):Second call for papers and shared task (dasdebop)
> 4. Re: (Large) dataset with sentence-level semantic phenomena
> (Karim BOUZOUBAA)
> 5. Re: (Large) dataset with sentence-level semantic phenomena
> (Katrin Erk)
>
>
>
> ---------- Forwarded message ----------
> From: Florent Perek <florent.perek at gmail.com>
> To: corpora at uib.no
> Cc:
> Bcc:
> Date: Thu, 24 Jan 2019 11:37:14 +0000
> Subject: [Corpora-List] Registration is open: Corpus Linguistics Summer
> School, Birmingham, UK, 24-28 June 2019
> Registration to the 4th Corpus Linguistics Summer School is now open. The
> summer school will take place at the University of Birmingham, UK, from 24
> to 28 June 2019.
>
> Attendance is open to all on a first come, first served basis, upon
> payment of the registration fee through the university’s online shop.
> Places are limited and tend to sell out quickly, so potential delegates are
> advised to book early.
>
> Registration fee: £150 student, £190 non-student
>
> To register and for more information, please visit
> https://www.birmingham.ac.uk/research/activity/corpus/events/2019/summer-school-2019.aspx
>
>
>
>
>
>
>
>
> ---------- Forwarded message ----------
> From: Tristan Miller <miller at ukp.informatik.tu-darmstadt.de>
> To: corpora <corpora at uib.no>
> Cc:
> Bcc:
> Date: Thu, 24 Jan 2019 16:07:50 +0100
> Subject: [Corpora-List] Final CfP: Algorithms and Systems for Digital
> Humanities at Euro-Par 2019
> FINAL CALL FOR PAPERS
> =====================
>
> 25th International European Conference on Parallel and Distributed
> Computing (Euro-Par 2019)
> Topic 14: Algorithms and Systems for Digital Humanities
>
> August 26–30, 2019
> Göttingen, Germany
>
> <
> http://2019.euro-par.org/contributors/topics/topic-14-new-algorithms-and-systems-for-digital-humanities/
> >
>
>
> Description
> -----------
>
> The Digital Humanities aims to advance the effective and efficient use
> of computational methods and digital data as tools for original research
> questions in the humanities. In recent years, projects such as DARIAH,
> CLARIN and CESSDA have pushed technologies and services that support the
> FAIR principles of Open Science (data registries, identifier services,
> metadata specifications) as well as tools that support the domain
> scientists. Many of these tools and services are hosted at computing
> centers, aim to address the huge heterogeneity of data and tasks, and
> aim to increase access to and reuse of research data via collaboration
> and crowdsourcing. Based on these experiences, the DH track at the
> EuroPar-2019 casts its sights on the years to come and how the
> relationship between DH and computing centers will evolve in the near
> future. To this end, we solicit contributions on the following topics.
>
>
> Focus
> -----
>
> * Applications of computing center and cloud resources in Digital
> Humanities projects
>
> * Applications of large-scale batch computing for Digital Humanities
> research questions on HPC resources
>
> * Applications of small-scale on-demand computational resources and
> serverless computing for Digital Humanities Applications of hosting of
> interactive services and tools for the Digital Humanities in the cloud
> or at computing centers
>
> * Automatically locating and processing distributed data for Digital
> Humanities use cases
>
> * Digital Humanities applications with special hardware requirements
> that can be provided by a computing center (GPUs, server-side VR
> rendering, etc.)
>
> * Applications of MPI clusters in the Digital Humanities
>
> * Growth of data in the Digital Humanities (text, images, videos, 3D
> models, multi-modal data, etc.)
>
> * Management of heterogeneous discipline- and task-specific metadata
>
> * Sustainability models for research software and online services in
> the Digital Humanities
>
> * Obstacles in the technical infrastructures that affect Digital
> Humanities (may range from disk/network latency issues over memory
> capacities, authentication and authorization, to accounting)
>
>
> Organizers
> ----------
>
> Global Chair: Iryna Gurevych (Technische Universität Darmstadt,
> Ubiquitous Knowledge Processing Lab)
>
> Local Chair: Marco Büchler (Leibniz Institute of European History Mainz,
> Digital Historical Research)
>
> Co-Chair: Sayeed Choudhury (Sheridan Libraries of Johns Hopkins
> University, Baltimore, USA)
>
>
> Submission details
> ------------------
>
> Initial paper submission: February 15, 2019
>
> Author notification: May 3, 2019
>
> Camera-ready submission: June 12, 2919
>
> Conference: August 26–30, 2019
>
> Submissions should be in PDF format, use Springer's LNCS style, and not
> exceed 12 pages (including references). Please refer to the Submission
> Guidelines at
> <http://2019.euro-par.org/contributors/call-for/paper-call/> for further
> instructions and for a link to the online submission system.
>
>
> --
> Tristan Miller, Research Scientist
> Ubiquitous Knowledge Processing Lab (UKP-TUDA)
> Department of Computer Science, Technische Universität Darmstadt
> Tel: +49 6151 162 5296 | Web: https://www.ukp.tu-darmstadt.de/
>
>
>
>
>
>
> ---------- Forwarded message ----------
> From: dasdebop <dasdebop at hu-berlin.de>
> To: corpora at uib.no
> Cc: ddas at sfu.ca, dasdebop at hu-berlin.de
> Bcc:
> Date: Tue, 22 Jan 2019 10:26:44 +0100
> Subject: [Corpora-List] Discourse Relation Parsing and Treebanking (DISRPT
> 2019):Second call for papers and shared task
> (apologies for cross-postings)
>
> Second Call for Papers and Shared Task - Discourse Relation Parsing and
> Treebanking (DISRPT 2019)
>
> 7th Workshop on Rhetorical Structure Theory and Related Formalisms
> In conjunction with: NAACL 2019, June 6
>
> https://sites.google.com/view/disrpt2019
>
> Study of coherence relations in frameworks such as RST, SDRT, and PDTB
> has experienced a revival in the last few years, in English and many
> other languages. Multiple sites are now actively engaged in the
> development of discourse parsers as a goal in itself, but also for
> applications such as sentiment analysis, argumentation mining,
> summarization, question answering, or machine translation evaluation. At
> the same time, evaluation of results in discourse parsing has proven
> complicated, and progress in integrating results across discourse
> treebanking frameworks has been slow.
>
> DISRPT 2019 follows a series of biennial events on discourse relation
> studies, which were initially focused especially on RST. The 2019
> workshop aims to broaden the scope of discussion to include different
> discourse theories (especially, but not limited to, RST, SDRT, and
> PDTB). We are interested in applied papers with a computational
> orientation, resource papers and work on discourse parsing, as well as
> papers that advance the field with novel theoretical contributions and
> promote cross-framework fertilization.
>
> We invite submissions on the following and related topics, handling any
> language(s), and especially under-represented ones:
>
> Discourse relations (segmentation, relation inventory, cognitive
> status of relations)
> Discourse parsing in any formalism, including shallow and deep
> discourse parsing
> Relation signaling (connectives and any other signals) and
> annotation
> Applications of coherence relations in NLP
>
> Invited Speaker
>
> The invited speaker for the workshop will be Bonnie Webber (Institute
> for Language, Cognition, and Computation, University of Edinburgh) -
> title: TBA.
>
> Shared Task - Discourse Unit Segmentation Across Formalisms
>
> This workshop introduces the first iteration of a cross-formalism shared
> task on discourse unit segmentation. Since all major discourse parsing
> frameworks imply a segmentation of texts into segments, learning
> segmentations for and from diverse resources is a promising area for
> converging methods and insights. We will provide training, development
> and test datasets from all available languages in RST, SDRT, and PDTB,
> using a uniform format. Because different corpora, languages, and
> frameworks use different guidelines, the shared task will promote the
> design of flexible methods for dealing with various guidelines, and will
> help to push forward the discussion of converging standards for
> discourse units. For datasets which have treebanks, we will evaluate in
> two different scenarios: with and without gold syntax.
>
> NOTE: The training data for the shared task has been released and is
> available at:
> https://github.com/disrpt/sharedtask2019.
>
> Important Dates
>
> Fri, Dec 28 - shared task sample data release
> Mon, Jan 21 - training data release
> Fri, Feb 15 - test data release
> Thu, Feb 28 - papers due (shared task & regular workshop papers)
> Wed, March 27 - notification of acceptance
> Fri, April 5 - camera-ready papers due
> June 6 - workshop
>
> Organization
>
> Amir Zeldes (Georgetown University, Washington, DC, USA)
> Debopam Das (Humboldt University of Berlin, Germany)
> Erick Galani Maziero (Universidade Federal de Lavras, Brazil)
> Juliano Desiderato Antonio (Universidade Estadual de Maringá, Brazil)
> Mikel Iruskieta (University of the Basque Country, Spain)
>
> Program Committee
>
> Stergos Afantenos, IRIT - Université Paul Sabatier, France
> Farah Benamara, IRIT - Université Paul Sabatier, France
> Irene Castellon, Universitat de Barcelona, Spain
> Johann Christian Chiarcos, Wolfgang Goethe Universität Frankfurt,
> Germany
> Maria Beatriz Nascimento Decat, Universidade Federal de Minas Gerais,
> Brazil
> Iria da Cunha, Universidad Nacional de Educación a Distancia, Spain
> Barbara Di Eugenio, University of Illinois at Chicago, USA
> Arantza Diaz de Ilarraza, University of the Basque Country, Spain
> Flavius Frasincar, Erasmus University Rotterdam, Netherlands
> Maria Eduarda Giering, Universidade do Vale do Rio dos Sinos, Brazil
> Nancy Green, University of North Carolina, USA
> Graeme Hirst, University of Toronto, Canada
> Kerstin Kunz, Universität Heidelberg, Germany
> Ekaterina Lapshinova-Koltunski, Universität des Saarlandes, Germany
> Jiri Mirovsky, Charles University, Czech Republic
> Anna Nedoluzhko, Charles University, Czech Republic
> Thiago Pardo, Universidade de São Paulo, Brazil
> Lucie Polakova, Charles University, Czech Republic
> Gisela Redeker, University of Groningen, Netherlands
> Hannah Rohde, University of Edinburgh, UK
> Gerardo Sierra, Universidad Nacional Autónoma de México, Mexico
> Christian Stab, Technische Universität Darmstadt, Germany
> Manfred Stede, Universität Potsdam, Germany
> Juan-Manuel Torres, Laboratoire Informatique d'Avignon, France
> Nianwen Xue, Brandeis University, USA
>
>
> -------------------------------------------------------------------------------------------------
>
> -------------------------------------------------------------------------------------------------
>
> Debopam Das
> Postdoctoral Researcher
> Department of English and American Studies
> Humboldt-Universität zu Berlin
> dasdebop at hu-berlin.de
> debdas at uni-potsdam.de; ddas at sfu.ca
> http://www.sfu.ca/~ddas
>
>
>
>
>
> ---------- Forwarded message ----------
> From: Karim BOUZOUBAA <bouzoubaa at emi.ac.ma>
> To: Mostafa Hany <mostafa_hany56 at yahoo.com>, "corpora at uib.no" <
> corpora at uib.no>
> Cc:
> Bcc:
> Date: Tue, 22 Jan 2019 18:17:42 +0000
> Subject: Re: [Corpora-List] (Large) dataset with sentence-level semantic
> phenomena
> you can try ontonotes from ldc
>
>
>
> Sent from my Samsung Galaxy smartphone.
>
>
> -------- Original message --------
> From: Mostafa Hany <mostafa_hany56 at yahoo.com>
> Date: 1/22/19 16:56 (GMT+00:00)
> To: corpora at uib.no
> Subject: [Corpora-List] (Large) dataset with sentence-level semantic
> phenomena
>
> Hey,
>
> Does anyone know of any English datasets which are annotated for a range
> of sentence-level semantic phenomena?
>
> Thanks!
> Mostafa
>
>
>
> ---------- Forwarded message ----------
> From: Katrin Erk <katrin.erk at gmail.com>
> To: Mostafa Hany <mostafa_hany56 at yahoo.com>
> Cc: "corpora at uib.no" <corpora at uib.no>
> Bcc:
> Date: Thu, 24 Jan 2019 14:57:09 -0600
> Subject: Re: [Corpora-List] (Large) dataset with sentence-level semantic
> phenomena
> Hi Mostafa,
>
> Have you checked out the Groningen meaning bank? http://gmb.let.rug.nl/
>
> Cheers,
> Katrin
>
> On Tue, Jan 22, 2019 at 10:56 AM Mostafa Hany <mostafa_hany56 at yahoo.com>
> wrote:
>
>> Hey,
>>
>> Does anyone know of any English datasets which are annotated for a range
>> of sentence-level semantic phenomena?
>>
>> Thanks!
>> Mostafa
>> _______________________________________________
>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>> Corpora mailing list
>> Corpora at uib.no
>> https://mailman.uib.no/listinfo/corpora
>>
>
>
> --
> Katrin Erk, Linguistics
> The University of Texas at Austin
> http://www.katrinerk.com
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> https://mailman.uib.no/listinfo/corpora
>

-- Lasha Abzianidze CLCG - Center for Language and Cognition University of Groningen -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 19855 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20190125/0ea16ea6/attachment.txt>



More information about the Corpora mailing list