man djvutxt (Commandes) - Extract the hidden text from DjVu documents.
NAME
djvutxt - Extract the hidden text from DjVu documents.
SYNOPSIS
djvutxt [-page pagenum] inputdjvufile [outputtxtfile]
DESCRIPTION
Program djvutxt decodes the hidden text layer of a DjVu document inputdjvufile and prints the UTF8 encoded text into file outputtxtfile or the standard output. No output is produced if the file contains no hidden text layer. The hidden text layer is usually generated with the help of an optical character recognition software.
OPTIONS
- -page pagenum
- Select a page in a multipage document. Without this option, djvutxt outputs the text corresponding to all pages of the document.
REMARKS
CREDITS
This program was written by Andrei Erofeev <andrew_erofeev@yahoo.com> and was then improved Bill Riemers <docbill@sourceforge.net> and many others.