libany2uni 1.0.3
libany2uni is a library to extract raw unicode text from any written documents (office documents). It should be useful to developp
|
|||||||||||||||||||
libany2uni is a library to extract raw unicode text from any written documents (office documents).
It should be useful to developpers of search engine, text processing, corpus analysis, ....
UTF8 tool:
In the 'utils' directory, you can find a tool using libany2uni. It is called 'any2utf8' It reads a document and outputs the text in UTF8, to the standard output.
To compile it, just type make.
Run it with './any2utf8 < path + name of the document >'.
You can also get metadata with the -m option.
tags
Download libany2uni 1.0.3
Authors software
|
|
Similar software
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Other software in this category
|
|
|
|
|
|
|
|
|
|
Featured Software
jEdit 4.3 pre8
jEdit is an Open Source text editor written in Java
Opera 9.02
Surf the Internet in a safer, faster, and easier way with Opera browser
GNU Aspell 0.60.4
GNU Aspell is a Free and Open Source spell checker designed to eventually replace Ispell
- Communications
- Database
- Desktop Environment
- Games
- Internet
- Multimedia
- Office
- Programming
- Science and Engineering
- System
- Text Editing&Processing
