Abstract
A method for segmentation of text that may be connected to graphics in engineering drawings is presented. It consists of three steps: growing individual characterbox regions, using a recursive merging scheme by stroke linking; merging the detected characterboxes into a textbox and determining its orientation; and re-segmenting the textbox back into the refined characterbox that can be input to an OCR subsystem. The method can segment dimensioning text as well as other classes of text. It handles both isolated and touching characters, aligned at any slant. The capability of segmenting characters that touch either themselves or graphics, which is an important feature in handling real life drawings, is obtained by focusing on intermediate vector information rather that on the raw pixel data. We present the details of the algorithm and show both successful and unsuccessful examples from an experimental set of 36 dimensioning textboxes, in which 94% segmentation rate was achieved with 3% false alarm rate.
Chapter PDF
Similar content being viewed by others
References
D. Dori and K. Tombre, “From Engineering Drawings to 3D CAD Models: Are We Ready Now?”, Computer Aided Design, 1995, 27(4), 243–254.
L.A. Fletcher and R. Kasturi, “A Robust Algorithm for Textbox String Separation from Mixed Text/Graphics Images”, IEEE PAMI, 1988, 10(6), 900–918.
C.P. Lai and R. Kasturi, “Detection of Dimension Sets in Engineering Drawings”, Proc. of 2nd ICDAR, Tsukuba, Japan, 1993, 606–613.
I. Chai and D. Dori, “Extraction of Text Boxes from Engineering Drawings”, Proc. SPIE/IS&T Symposium on Electronic Imaging Science and Technology, Conference on Character Recognition and Digitizer Technologies, San Jose, 1992, SPIE Vol. 1661, 38–49.
D. Dori, Y. Liang and I. Chai, “Spare Pixel Recognition of Primitives in Engineering Drawings”, Machine Vision and Applications, 6, 1993, 69–82.
D. Dori and Y. Velkovitch, “Segmentation and Recognition of Dimensioning Text from Engineering Drawings”, Pre Proc. GREC'95, The Perm. State U., USA, Aug., 1995, 141–150.
Liu W. and D. Dori, “Sparse Pixel Tracking: A Fast Vectorization Algorithm Applied to Engineering Drawings”, Proc. of the 13th ICPR, Vienna, Austria, Aug., 1996
D. Dori, “Object-process Analysis: Maintaining the Balance between System Structure and Behaviour”, J. Logic Computation, 1995, 5(2), 227–249.
Liu W., D. Dori, Tang L. and Tang Z., “Object Recognition in Engineering Drawings Using Planar Position Indexing”, PreProc. GREC'95, The Perm. State U., Aug., 1995, 53–61.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1996 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dori, D., Wenyin, L. (1996). Vector-based segmentation of text connected to graphics in engineering drawings. In: Perner, P., Wang, P., Rosenfeld, A. (eds) Advances in Structural and Syntactical Pattern Recognition. SSPR 1996. Lecture Notes in Computer Science, vol 1121. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-61577-6_33
Download citation
DOI: https://doi.org/10.1007/3-540-61577-6_33
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-61577-4
Online ISBN: 978-3-540-70631-1
eBook Packages: Springer Book Archive