[Corpora-List] plain text and .caj files (Mike Scott)
Lei Lei
leileileo at gmail.com
Fri Nov 12 01:55:12 CET 2010
Re: plain text and .caj files (Mike Scott)
Does anyone know how to extract the plain Chinese text from .caj text
files? I understand these are similar in conception to .PDFs.
Thanks -- Mike Scott, Aston University
---------------------------
Hi, Mike,
Yes, it is similar to .PDFs.
If you want to extract the Chinese characters (text) from .caj files, one quick but dirty method is to first virtually print the .caj files into .PDFs, and then to extract the Chinese characters from the .PDFs.
You can also directly use the "choose text" button to choose the text you want and then extract the text by copying and pasting within the CAJViewer, but I prefer the aforementioned method.
Good luck!
Lei
2010-11-12
Lei Lei
Associate Professor
School of Foreign Languages
Huazhong University of Science and Technology
Email: leileicn at 126.com
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: text/html
Size: 3836 bytes
Desc: not available
URL: <http://www.uib.no/mailman/public/corpora/attachments/20101112/7fd3d737/attachment.txt>
More information about the Corpora
mailing list