TY - GEN
T1 - Proper names extraction from fax images combining textual and image features
AU - Likforman-Sulem, Laurence
AU - Vaillant, Pascal
AU - Yvon, François
N1 - Publisher Copyright:
© 2003 IEEE.
PY - 2003/1/1
Y1 - 2003/1/1
N2 - In the frame of a Unified Messaging System, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of emails, since no standard headers are defined. The aim of the present work is to identify and extract a specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimised dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
AB - In the frame of a Unified Messaging System, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of emails, since no standard headers are defined. The aim of the present work is to identify and extract a specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimised dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.
U2 - 10.1109/ICDAR.2003.1227724
DO - 10.1109/ICDAR.2003.1227724
M3 - Conference contribution
AN - SCOPUS:21944446351
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 545
EP - 549
BT - Proceedings - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
PB - IEEE Computer Society
T2 - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
Y2 - 3 August 2003 through 6 August 2003
ER -