[Corpora-List] Corpus of English translations from various Indian language

Parth Mehta parth.mehta126 at gmail.com
Mon May 23 12:00:22 CEST 2016


Hello Shuly Wintner,

Several parallel corpora have been developed under TDIL, Department of Electronics and Information Technology, Govt. of India.

1) Gyan Nidhi Parellel Text Corpus. (11 Indian Languages + English). 2) Health related - ILCI (Hindi pivot language: 10 Indian languages + English) 3) Tourism Related - EILMT (English pivot language 6 languages) 4) Tourism Related - ILCI (Hindi pivot language: 10 Indian languages + English) 5) Agriculture and entertainment related - ILCI (Hindi pivot language: 14 Indian languages + English)

All these corpora are available at : http://www.tdil-dc.in/index.php?option=com_catalogue&task=viewTools&id=%20125&lang=en

*Note*: Gyan Nidhi corpus (which will probably the most relevant to you), is present at the end of the list. The list on the website loads further only when you scroll down to the very bottom of the page.

*Alternate link* for the Gyan Nidhi corpus: http://www.tdil-dc.in/index.php?option=com_download&task=showresourceDetails&toolid=281&lang=en

Regards, Parth

On Sun, May 22, 2016 at 3:31 PM, <corpora-request at uib.no> wrote:


> Today's Topics:
>
> 1. Fwd: CFP:: Request (Md. Hasanuzzaman)
> 2. Fwd: CFP:: Request (Md. Hasanuzzaman)
> 3. Job opening: Natural Language Processing opportunity at
> Conversant (Davis, Paul)
> 4. Corpus of English translations from various Indian languages
> (Wintner Shuly)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sat, 21 May 2016 20:51:51 +0800
> From: "Md. Hasanuzzaman" <hasanuzzaman.im at gmail.com>
> Subject: [Corpora-List] Fwd: CFP:: Request
> To: corpora at lists.uib.no, corpora at uib.no
>
> 2016 Third International Conference on Digital Information Processing, Data
> Mining, and Wireless Communications (DIPDMWC)
>
> National Research Nuclear University MEPhI (Moscow Engineering Physics
> Institute), Moscow, Russia
>
> July 06-08, 2016
>
> http://sdiwc.net/conferences/dipdmwc2016/
>
> ==================================================================================================================
>
> The conference welcomes papers on the following (but not limited to)
> research topics:
>
> - Adaptive Signal Processing
> - Parallel Programming & Processing
> - Artificial Intelligence
> - Expert Systems
> - Image Processing
> - Information Security and Cryptography
> - Modulation, Coding, and Channel Analysis
> - Multimedia Signal Processing
> - Bioinformatics & Biomedical Imaging
> - Biomedical Signal Processing
> - Natural Language Processing
> - Neural Networks and Genetic Algorithms
> - Computer-Aided Surgery
> - Data Compression and Watermarking
> - Data Mining Techniques
> - Ethics of Data Mining
> - Risk Management and Analysis
> - Data Classification and Clustering
> - Abnormally and Outlier Detection
> - Feature Extraction and Data Reduction
> - Multi-Task Learning
> - Optimization Techniques
> - Data Cleaning and Processing
> - Text and Web Mining
> - Bluetooth and Personal Area Networks
> - Wireless System Architecture
> - Mobile Management in Wireless Networks
> - Mobile Database Access and Design
> - IP Multimedia Sub-Systems
> - Key Management Protocols
> - Mobile/ Wireless Network Modeling and Simulation
> - Mobile / Wireless Network Planning
> - Wireless Network Standard and Protocols
> - Digital Right Management and Multimedia Protection
> and many more...
>
>
> ==================================================================================================================
> Important Dates
>
> Submission Dates: Open from now until June 6, 2016
> Notification of Acceptance: 2-4 weeks from the submission date
> Camera Ready Submission: June 26, 2016
> Registration Deadline : June 26, 2016
> Conference Dates : July 6-8, 2016
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2366 bytes
> Desc: not available
> URL: <
> https://mailman.uib.no/public/corpora/attachments/20160521/d887e418/attachment.txt
> >
>
> ------------------------------
>
> Message: 2
> Date: Sat, 21 May 2016 20:51:51 +0800
> From: "Md. Hasanuzzaman" <hasanuzzaman.im at gmail.com>
> Subject: [Corpora-List] Fwd: CFP:: Request
> To: corpora at lists.uib.no, corpora at uib.no
>
> 2016 Third International Conference on Digital Information Processing, Data
> Mining, and Wireless Communications (DIPDMWC)
>
> National Research Nuclear University MEPhI (Moscow Engineering Physics
> Institute), Moscow, Russia
>
> July 06-08, 2016
>
> http://sdiwc.net/conferences/dipdmwc2016/
>
> ==================================================================================================================
>
> The conference welcomes papers on the following (but not limited to)
> research topics:
>
> - Adaptive Signal Processing
> - Parallel Programming & Processing
> - Artificial Intelligence
> - Expert Systems
> - Image Processing
> - Information Security and Cryptography
> - Modulation, Coding, and Channel Analysis
> - Multimedia Signal Processing
> - Bioinformatics & Biomedical Imaging
> - Biomedical Signal Processing
> - Natural Language Processing
> - Neural Networks and Genetic Algorithms
> - Computer-Aided Surgery
> - Data Compression and Watermarking
> - Data Mining Techniques
> - Ethics of Data Mining
> - Risk Management and Analysis
> - Data Classification and Clustering
> - Abnormally and Outlier Detection
> - Feature Extraction and Data Reduction
> - Multi-Task Learning
> - Optimization Techniques
> - Data Cleaning and Processing
> - Text and Web Mining
> - Bluetooth and Personal Area Networks
> - Wireless System Architecture
> - Mobile Management in Wireless Networks
> - Mobile Database Access and Design
> - IP Multimedia Sub-Systems
> - Key Management Protocols
> - Mobile/ Wireless Network Modeling and Simulation
> - Mobile / Wireless Network Planning
> - Wireless Network Standard and Protocols
> - Digital Right Management and Multimedia Protection
> and many more...
>
>
> ==================================================================================================================
> Important Dates
>
> Submission Dates: Open from now until June 6, 2016
> Notification of Acceptance: 2-4 weeks from the submission date
> Camera Ready Submission: June 26, 2016
> Registration Deadline : June 26, 2016
> Conference Dates : July 6-8, 2016
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 2366 bytes
> Desc: not available
> URL: <
> https://mailman.uib.no/public/corpora/attachments/20160521/d887e418/attachment.txt
> >
>
> ------------------------------
>
> Message: 3
> Date: Sat, 21 May 2016 14:02:45 +0000
> From: "Davis, Paul" <pdavis at conversantmedia.com>
> Subject: [Corpora-List] Job opening: Natural Language Processing
> opportunity at Conversant
> To: "corpora at uib.no" <corpora at uib.no>
>
> Conversant is looking for Natural Language Processing Scientists to join
> our Decision Sciences R&D team in Chicago, Illinois
>
> Job Description: Data Scientist - (0067637) Chicago, IL, United States
>
> http://www.conversantmedia.com/careers, then search for Job ID 0067637
>
> -------------------------------------------------------------------------------------------
>
> Data Scientist
> Chicago, IL, United States
> Full-time
> OVERVIEW
>
> As a Data Scientist in our Decision Sciences R&D organization, you will be
> responsible for researching and building machine learning and natural
> language processing applications to extend Conversant's personalization
> platform. Conversant's business is based on analyzing anonymized data at
> internet scale and evaluating more than 1 trillion advertising
> opportunities per month in real-time. You will work on real-world problems
> as part of our highly collaborative R&D team, and your solutions will
> directly and rapidly impact our business. This includes researching and
> developing models, algorithms, and applications; analyzing raw source data
> and derived data; presenting findings; and building tools and analyses for
> new and existing products.
>
> RESPONSIBILITIES
> * Develop an understanding of Conversant's personalization
> platform and proprietary datasets
> * Use your natural language processing and machine learning
> expertise to research and recommend the best approaches to solving our
> technology and business problems
> * Design, implement, and validate your solutions in Apache Spark,
> Apache Hive, using Scala or Python on a large state-of-the-art cluster
> * Work with our Engineering teams to integrate your solutions into
> Conversant's platform
> * Participate fully in our collaborative approach to research and
> applications projects
> REQUIREMENTS
> * A Ph.D., (or Master's degree plus at least 3 years' relevant
> experience), in Computer Science, Linguistics, Statistics, Electrical
> Engineering, Mathematics, Economics, Physics, or a related scientific
> discipline
> * Research experience and coursework in Machine Learning and
> Natural Language Processing
> * Fluency in programming
> * Experience with large data sets
> * Strong understanding of statistics and modeling techniques
> * Desire to work in a highly collaborative environment
> ADDITIONAL USEFUL BUT NOT REQUIRED SKILLS
>
> * Experience with distributed computing, such as Hadoop, Spark, or
> related technologies
>
> * Experience with Information Retrieval or Recommender Systems
>
> * Familiarity with SQL, Scala, Python, or Java
>
>
>
>
>
> This email and any files included with it may contain privileged,
> proprietary and/or confidential information that is for the sole use
> of the intended recipient(s). Any disclosure, copying, distribution,
> posting, or use of the information contained in or attached to this
> email is prohibited unless permitted by the sender. If you have
> received this email in error, please immediately notify the sender
> via return email, telephone, or fax and destroy this original transmission
> and its included files without reading or saving it in any manner.
> Thank you.
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: not available
> Type: text/html
> Size: 19611 bytes
> Desc: not available
> URL: <
> https://mailman.uib.no/public/corpora/attachments/20160521/5b7be3c8/attachment.txt
> >
>
> ------------------------------
>
> Message: 4
> Date: Sun, 22 May 2016 12:25:50 +0300
> From: Wintner Shuly <shuly at cs.haifa.ac.il>
> Subject: [Corpora-List] Corpus of English translations from various
> Indian languages
> To: corpora at uib.no
>
> Hi,
>
> I'm looking for English texts translated from any of the languages spoken
> in India. English translated from other Asian (and, more generally,
> non-European) languages is also of interest. Any links would be greatly
> appreciated. Thank you,
>
> Shuly
>
> --
> Shuly Wintner
> Dept. of Computer Science, University of Haifa, 31905 Haifa, Israel
> Phone: +972 (4) 8288180 Fax: +972 (4) 8249331
> shuly at cs.haifa.ac.il http://cs.haifa.ac.il/~shuly
>
>
>
>
>
>
> ----------------------------------------------------------------------
> Send Corpora mailing list submissions to
> corpora at uib.no
>
> To subscribe or unsubscribe via the World Wide Web, visit
> http://mailman.uib.no/listinfo/corpora
> or, via email, send a message with subject or body 'help' to
> corpora-request at uib.no
>
> You can reach the person managing the list at
> corpora-owner at uib.no
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Corpora digest..."
>
>
> _______________________________________________
> Corpora mailing list
> Corpora at uib.no
> http://mailman.uib.no/listinfo/corpora
>
>
> End of Corpora Digest, Vol 107, Issue 36
> ****************************************
>

-- Regards, Parth Mehta DA-IICT -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 15530 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20160523/b54fad9b/attachment.txt>



More information about the Corpora mailing list