[Corpora-List] Encoding of apostrophes and quotes
eric at comp.leeds.ac.uk
Fri Jun 30 09:50:03 CEST 2006
I think it is quite reasonable for UNICODE standards to give apostrophe
and single-quote a single encoding, if in practice many people cant
understand the difference and use either character interchangeably.
I see this as analogous to a word whcih can have more than one function,
e.g. "to" can function as preposition or infinitival marker,
or "one" has four possible tags in the ICE/TOSCA part-of-speech tagset
corresponding to four separable functions.
Senior Lecturer, Language research group, School of Computing,
Faculty of Engineering, University of Leeds, LEEDS LS2 9JT, England
TEL: +44-113-3435430 FAX: +44-113-3435468 http://www.comp.leeds.ac.uk/eric
More information about the Corpora-archive