[Corpora-List] corpus labelled with segment-wise post-editing time and/or keystrokes

Rohit Gupta enggrohitgupta at gmail.com
Thu Sep 3 19:52:07 CEST 2015


Dear Philipp,

Looks good! Thank you very much for this.

Regards, Rohit

On Thu, Sep 3, 2015 at 3:26 PM, Philipp Koehn <phi at jhu.edu> wrote:


> Hi,
>
> there is a bunch of this kind of data released by the CASMACAT project:
> http://www.casmacat.eu/index.php?n=Main.Downloads
>
> Some of it also includes eye tracking data.
>
> -phi
>
> On Thu, Sep 3, 2015 at 9:13 AM, Rohit Gupta <enggrohitgupta at gmail.com>
> wrote:
>
>> Dear All,
>>
>> Please let me know if there is any corpus (1000+ segments) where
>> segment-wise translation-reference pairs are labelled with post-editing
>> time and/or keystrokes (preferably English but any language).
>>
>> Something like this:
>>
>> <Segment-1> <Translation-1> <Reference-1> <Post-editing
>> time/Keystrokes-1>
>> <Segment-2> <Translation-2> <Reference-2> <Post-editing
>> time/Keystrokes-2
>> >
>> ----
>>
>> <Translation>: Either machine translation or a match retrieved from
>> translation memory
>> < Reference>: correct post-edited version of <Translation>
>>
>> Thank you very much.
>>
>>
>>
>>
>> Regards,
>> Rohit Gupta
>> Early Stage Researcher
>> University of Wolverhampton
>>
>>
>> _______________________________________________
>> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
>> Corpora mailing list
>> Corpora at uib.no
>> http://mailman.uib.no/listinfo/corpora
>>
>>
>
-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 2793 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20150903/abc5a002/attachment.txt>



More information about the Corpora mailing list