[Corpora-List] Succeed hackathon at UA

Marco BÜCHLER mbuechler at e-humanities.net
Tue Mar 25 15:28:14 CET 2014


On the 10th and 11th of April 2014, the Succeed project will hold a hackathon at the University of Alicante, whose aim is to look at improving the state-of-the-art open-source tools for the digitisation of textual content such as books and newspapers.

Over the two days, developers will work together in small groups to discuss, roadmap and plan the future development of existing tools. Some of the topics up for discussion are:

* How to train the Tesseract

<http://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3> OCR

engine.

* Creation of XSLT stylesheets for format conversion, e.g. hOCR, PAGE,

FRXML.

* Debian package generation.

The hackathon provides a unique opportunity to meet developers involved in digitisation projects all over Europe. Participation to the event is FREE OF CHARGE, but please make sure you reserve <https://www.eventbrite.com/e/2nd-succeed-dev-workshop-hackathon-tickets-10907317079> your place. Participants are also encouraged to take a look at Last year's hackathon's outcomes <http://www.digitisation.eu/blog/1st-succeed-hackathon-kb/> and background information <http://succeed-project.eu/wiki/index.php/Developers_workshops_%28hackathons%29>.

-- Marco BÜCHLER Georg-August-Universität Göttingen Göttingen Centre for Digital Humanities (GCDH) Papendiek 16 37073 Göttingen (Heynehaus)

eMail : mbuechler at e-humanities.net Web : http://www.gcdh.de/ Profil : http://www.gcdh.de/en/people/team/marco-buechler/ Facebook : http://www.facebook.com/marco.buechler LinkedIn : http://www.linkedin.com/profile/view?id=15098543&trk=tab_pro Twitter : https://twitter.com/mabuechler

l-h

-------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 4036 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20140325/ed677df1/attachment.txt> -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: image/png Size: 12771 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20140325/ed677df1/attachment.png>



More information about the Corpora mailing list