*APPLICATION NAME
Infty.exe ver.3.2
(C) Copyright 2000-2020:
Masakazu Suzuki (Kyushu University) and
Non Profit Organization Science Accessibility Net.
*DESCRIPTION
Infty.exe is an commandline application option of InftyReader,
OCR (Optical Character Recognition) software for scientific documents
including mathematical formulae.
It takes binary images of TIF, TIFF(CCITT-3/CCITT-4/LZW), GIF or PNG
format with resolution 600DPI or 400DPI and PDF files as inputs,
and returns the recognition result in various formats (KML, IML, HTML,
MathML, LaTeX, HRTeX, EPUB3) as outputs.
IML format is an original format describing recognition results.
KML format is a format that additionally includes character rectangle
coordinates and recognition candidates information to the IML format.
MathML format is a standard format worked by W3C (http://www.w3.org/Math/).
HRTeX (Human readable TeX) is an output format (*.txt) for visually
impaired individuals.
You can controle the LaTeX output style editing "LaTeXOutputSetting.cfg"
included in the package.
*LICENSE
Infty.exe can be used under the same license agreement as InftyReader.
Please see the file License_E.ext included in the package.
*SYNOPSIS
Infty -help
Infty folderName [fileName1] [fileName2]... [option]...
Infty recognizes images specified by `fileName1', `fileName2',...
in the folder `folderName'.
Please be aware that folderName cannot be omitted, and you need to input
"Space" after the folderName (not the key "\").
When no file names are specified, all files with the extension .tiff and .tif
in the `folderName' are recognized.
The output file is saved in the same folder as the input image files.
The default output format is IML.
When the number of input files is one, the output file name is the same name
as the input file.
When the number of input files is more than one, the output file name is the
same name as the `foldername'.
e.g.) Infty C:\Data\Images
e.g.) Infty "C:\Document and Settings\user\images" image1.tif
e.g.) Infty C:\Data\Images sample1.tif sample2.tif
e.g.) Infty C:\Data\Images -n test -o C:\OutPut -f iml -f tex
e.g.) Infty C:\Data\Articles paper1.pdf -startPage 1 -endPage 10 -ePDF 1
(!) The folder name should not be omitted.
(!) The folder name should begin with the drive name "C:\....", "D:\....".
*Options
-r (400 | 600)
specifies the resolution of input images (default 600 dpi). Only 600/400
dpi are acceptable. The resolution is set to 600 dpi when any other value
is specified.
e.g.) Infty C:\Data\Images sample1.tif -r 400
-l (Japanese, English)
specifies the language of the texts of input images (default "english").
-f ( iml | kml | tex | mathml | hrtex )
specifies output formats (default iml).
One can get several result files in various formats
by specifying several options.
e.g.) Infty C:\Data\Images sample1.tif -f iml -f kml
-fo ( iml | kml | tex | mathml | hrtex )
specifies output formats (default iml), and open the results by associated
applications.
e.g.) Infty C:\Data\Images sample1.tif -f iml -f kml
-o outputFolderName
changes the output folder name where the output file is produced.
e.g.) InftyReader C:\Data\Images -o C:\Data\OutPut
-n outputFileName
changes the output file name. The extension of the file is the output
format specified by the option -f.
e.g.) Infty C:\Data\Images -n test
-paragraph
add line feed at the end of paragraphs. The default setting is adding
line feed at the end of each line of the original image.
-preamble preambleFile
specifies the preamble file used in TeX format output.
When no preamble file is specified, "defaultpreamble.tex" in the same
folder as "Infty.exe" will be used. In case the preamble tex file is
not found, the preamble will be empty in the TeX output.
e.g.) Infty C:\Data\Images -f tex -preamble C:\Data\preamble.tex
-nopreamble
Output LaTeX source without preamble.
The commands "\begn{document}" and "\end{document}" will also be ommited.
-utf8
use UTF-8 in TeX format output.
In case you select languages other than Japanese and English, the option
"-utf8" is recommended.
*For PDF recognition:
-imagePDF
Set this option if the imput file is PDF created from scanned image.
-ePDF 1
Use character information embedded in PDF instead of OCRing.
-ePDF 0
Ignore character information embedded in PDF and use OCR.
-startPage a
-endPage b
Recognize PDF from the page a to b.
-----
Usage samples:
e.g.) Infty C:\Data\Images
e.g.) Infty "C:\Document and Settings\user\images" image1.tif
e.g.) Infty C:\Data\Images sample1.tif sample2.tif
e.g.) Infty C:\Data\Images -n test -o C:\OutPut -f iml -f tex
e.g.) Infty C:\Data\Images FrenchTest.tif -l French -utf8 -f iml -f tex -preamble
C:\Data\preamble.tex
e.g.) Infty C:\Data\Books book1.pdf -startPage 10 -endPage 20 -ePDF 1