[Corpora-List] Any Urdu sentence tokenizer and word tokenizer (Python based)

Hugo Sanjurjo González hugo.sanjurjo at deusto.es
Wed Dec 9 15:11:01 CET 2020


Hi Fatima,

I found Urduhack [link <https://pypi.org/project/urduhack/>]. It may be useful for your purposes.

Best regards, Hugo

El mié, 9 dic 2020 a las 8:50, Fatima Tuz Zuhra (<fzuhra at cs.qau.edu.pk>) escribió:


> Hi,
>
> I am in search of an Urdu word tokenizer and sentence splitter. Any help
> in this regard would be appreciated.
>
> Regards.
>
> --
> Fatima Tuz Zuhra
> Ph.D. Scholar
> Quaid i Azam University Islamabad, Pakistan.
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list
> Corpora at uib.no
> https://mailman.uib.no/listinfo/corpora
>

-- *Dr. Hugo Sanjurjo González* Departamento de Tecnologías Informáticas, Electrónicas y de Comunicación [image: Deusto] <http://www.deusto.es/> Universidad de Deusto / Deustuko Unibertsitatea Portal de Arriaga 62 Bajo, 01013 Vitoria-Gasteiz Tel. 945 01 01 09 hugo.sanjurjo at deusto.es <hugo.sanjurjo at deusto.es> *www.deusto.es* <http://www.deusto.es/> [image: twitter] <http://twitter.com/deusto> [image: facebook] <https://www.facebook.com/UDeusto> [image: linkedin] <https://www.linkedin.com/edu/school?id=12212&trk=edu-cp-title> [image: Instagram] <https://instagram.com/udeusto/> [image: deusto.eus] <http://deusto.eus/> -------------- next part -------------- A non-text attachment was scrubbed... Name: not available Type: text/html Size: 3911 bytes Desc: not available URL: <https://mailman.uib.no/public/corpora/attachments/20201209/efd6ea1b/attachment.txt>



More information about the Corpora mailing list