Document Analysis Systems VI: 6th International Workshop, DAS 2004, Florence, Italy, September 8-10, 2004, ProceedingsSimone Marinai, Andreas Dengel Thisvolumecontainspapersselectedforpresentationatthe6thIAPRWorkshop on Document Analysis Systems (DAS 2004) held during September 8–10, 2004 at the University of Florence, Italy. Several papers represent the state of the art in a broad range of “traditional” topics such as layout analysis, applications to graphics recognition, and handwritten documents. Other contributions address the description of complete working systems, which is one of the strengths of this workshop. Some papers extend the application domains to other media, like the processing of Internet documents. The peculiarity of this 6th workshop was the large number of papers related to digital libraries and to the processing of historical documents, a taste which frequently requires the analysis of color documents. A total of 17 papers are associated with these topics, whereas two yearsago (in DAS 2002) only a couple of papers dealt with these problems. In our view there are three main reasons for this new wave in the DAS community. From the scienti?c point of view, several research ?elds reached a thorough knowledge of techniques and problems that can be e?ectively solved, and this expertise can now be applied to new domains. Another incentive has been provided by several research projects funded by the EC and the NSF on topics related to digital libraries. |
Contents
Digital Libraries | 1 |
The Trinity College Dublin 1872 Online Catalogue | 17 |
A SemanticBased System for Querying Personal Digital Libraries | 39 |
A SegmentationFree Recognition Technique | 63 |
A Complete Approach to the Conversion | 90 |
Segmentation of Handwritten Characters | 114 |
Selforganizing Maps and Ancient Documents | 125 |
Layout Analysis | 146 |
WordWise Script Identification from Indian Documents | 310 |
Recognizing Freeform Digital Ink Annotations | 322 |
Postprocessing of Handwritten Pitmans Shorthand | 332 |
Graphics Recognition | 342 |
Performance Evaluation of Symbol Recognition | 354 |
Attributed Graph Matching Based Engineering Drawings Retrieval | 378 |
A Platform to Extract Knowledge from Graphic Documents | 389 |
Internet Documents | 401 |
Physical Layout Analysis of Complex Structured Arabic Documents | 170 |
Multiview Fabien Carmagnac HAC for Semisupervised Pierre Héroux Document and Eric Image Classification Trupin | 191 |
Layout and Content Extraction for PDF Documents | 213 |
Color Documents | 229 |
Serialized kMeans for Adaptative Color Image Segmentation | 252 |
Adaptive Region Growing Color Segmentation for Text | 264 |
Preprocessing and Segmentation | 276 |
Handwritten Documents | 286 |
Information Retrieval System for Handwritten Documents | 298 |
RuleBased Structural Analysis of Web Pages | 425 |
Extracting Table Information from the Web | 438 |
Document Analysis Systems | 451 |
A Document Analysis System Builder Eric Trupin | 472 |
Applications | 496 |
Document Image Retrieval in a Question Answering System | 521 |
Document Image Watermarking Based on WeightInvariant Partition | 546 |
Coupling Media for Thematic Segmentation | 559 |
Other editions - View all
Document Analysis Systems VI: 6th International Workshop, DAS 2004, Florence ... Simone Marinai,Andreas Dengel No preview available - 2014 |
Document Analysis Systems VI: 6th International Workshop, DAS 2004, Florence ... Simone Marinai,Andreas Dengel No preview available - 2004 |
Common terms and phrases
algorithm Analysis and Recognition annotations application approach archive automatic background Bangla Berlin Heidelberg 2004 binary bounding box Braille character classifier clusters color Computer Computer Vision Conference on Document connected components corresponding database defined Dengel Eds described detection Devanagari digital library distance DjVu document image Document Image Analysis evaluation example extraction feature extraction Figure function graph graphics recognition handwriting recognition handwritten Hanja HSV color space identified IEEE image processing Indic scripts input interface International Conference invoice label layout logical manuscripts Marinai matching metadata nodes paper parameters Pattern Recognition performance pixels problem Proc proposed method query representation represented retrieval scanned scenario script segmentation spam SPIHT Springer-Verlag Berlin Heidelberg stamp structure support vector machine symbol recognition Table techniques template text line text segment textual threshold tion vector Voronoi tessellation word