Business forms classification using earth mover's distance

SS Bukhari, M Ebbecke… - 2014 11th IAPR …, 2014 - ieeexplore.ieee.org
SS Bukhari, M Ebbecke, M Gillmann
2014 11th IAPR International Workshop on Document Analysis Systems, 2014ieeexplore.ieee.org
Form Classification has not been focused on for the last decade. Unfortunately the
algorithms published mainly in the 80s and 90s do not meet the requirements in our present
commercial document analysis projects. There we are confronted with conditions and
requirements unanticipated by that research, such as fax distortions and-even worse-form
variations. In this work we introduce a new color-coded pixel-based form classification
method using Earth Mover's Distance (EMD) that is robust against fax distortions and content …
Form Classification has not been focused on for the last decade. Unfortunately the algorithms published mainly in the 80s and 90s do not meet the requirements in our present commercial document analysis projects. There we are confronted with conditions and requirements unanticipated by that research, such as fax distortions and - even worse - form variations. In this work we introduce a new color-coded pixel-based form classification method using Earth Mover's Distance (EMD) that is robust against fax distortions and content variations. Experimental results prove the effectiveness of the presented method. It achieved more than 90% classification accuracy on a real-world business forms dataset, which is significantly better than the competing state-of-the-art methods.
ieeexplore.ieee.org
Showing the best result for this search. See all results