README
Name
chupa-text-decomposer-libreoffice-word
Description
This is a ChupaText decomposer plugin to extract text and meta-data
from Microsoft Word binary file format file (.doc
file). This plugin
uses LibreOffice.
You can use libreoffice-word
decomposer.
It depends on pdf
decomposer. Because it converts a office file to
PDF file and extracts text and meta-data by pdf
decomposer.
Install
Install chupa-text-decomposer-libreoffice-word gem:
% gem install chupa-text-decomposer-libreoffice-word
Install LibreOffice from download page.
Now, you can extract text and meta-data from office files:
% chupa-text document.doc
Author
- Kouhei Sutou
<[email protected]>
License
LGPL 2.1 or later.
(Kouhei Sutou has a right to change the license including contributed patches.)