A new method to recognize Dimension Sets and its application in Architectural Drawings. I. Introduction

Similar documents
Dimension Recognition and Geometry Reconstruction in Vectorization of Engineering Drawings

Method for Real Time Text Extraction of Digital Manga Comic

Number Plate Recognition Using Segmentation

An Improved Bernsen Algorithm Approaches For License Plate Recognition

Extraction and Recognition of Text From Digital English Comic Image Using Median Filter

Libyan Licenses Plate Recognition Using Template Matching Method

UNIT 5a STANDARD ORTHOGRAPHIC VIEW DRAWINGS

tsushi Sasaki Fig. Flow diagram of panel structure recognition by specifying peripheral regions of each component in rectangles, and 3 types of detect

A Fast Segmentation Algorithm for Bi-Level Image Compression using JBIG2

COURSE CONTENTS FOR THE AVTS COURSES

MAV-ID card processing using camera images

Vehicle License Plate Recognition System Using LoG Operator for Edge Detection and Radon Transform for Slant Correction

VEHICLE LICENSE PLATE DETECTION ALGORITHM BASED ON STATISTICAL CHARACTERISTICS IN HSI COLOR MODEL

Computer-Aided Design Data Extraction Approach to Identify Product Information

Chapter 1 Introduction

Real Time ALPR for Vehicle Identification Using Neural Network

Real Time Word to Picture Translation for Chinese Restaurant Menus

Mechanical Drawing. Unit 2 Study Guide for Chapters 6-10

CAD Mechanical Design I

AUTOMATIC NUMBER PLATE DETECTION USING IMAGE PROCESSING AND PAYMENT AT TOLL PLAZA

Geometric elements for tolerance definition in feature-based product models

SCIENCE & TECHNOLOGY

CAD and CAM systems have been widely used in the

License Plate Localisation based on Morphological Operations

Virtual Engineering: Challenges and Solutions for Intuitive Offline Programming for Industrial Robot

A Geometric Correction Method of Plane Image Based on OpenCV

28 ESSCAD: EXPERT SYSTEM INTEGRATING CONSTRUCTION SCHEDULING WITH CAD DRAWING

Chapter 8. Technical Drawings

GstarCAD Mechanical 2015 Help

ORTHOGRAPHIC PROJECTION

Automatic Licenses Plate Recognition System

Volumetric positioning accuracy of a vertical machining center equipped with linear motor drives (evaluated by the laser vector method)

AN EFFICIENT APPROACH FOR VISION INSPECTION OF IC CHIPS LIEW KOK WAH

Enhanced MLP Input-Output Mapping for Degraded Pattern Recognition

R. K. Sharma School of Mathematics and Computer Applications Thapar University Patiala, Punjab, India

Computer-Aided Manufacturing

Panel and speech balloon extraction from comic books

Smart License Plate Recognition Using Optical Character Recognition Based on the Multicopter

8.2 IMAGE PROCESSING VERSUS IMAGE ANALYSIS Image processing: The collection of routines and

Blur Estimation for Barcode Recognition in Out-of-Focus Images

Conceptual Metaphors for Explaining Search Engines

APPLICATION OF COMPUTER VISION FOR DETERMINATION OF SYMMETRICAL OBJECT POSITION IN THREE DIMENSIONAL SPACE

1. is the modification of an existing product or process. A. Invention C. Recreation B. Innovation D. Enhancement

A Feature-based Method for MBD Model Exchange among Heterogeneous CAD Systems

A Scheme for Salt and Pepper oise Reduction and Its Application for OCR Systems

Application of Machine Vision Technology in the Diagnosis of Maize Disease

Locally baseline detection for online Arabic script based languages character recognition

This document is a preview generated by EVS

A New Connected-Component Labeling Algorithm

An Evaluation of Automatic License Plate Recognition Vikas Kotagyale, Prof.S.D.Joshi

A Novel Morphological Method for Detection and Recognition of Vehicle License Plates

Software Development & Education Center NX 8.5 (CAD CAM CAE)

Mechanical Design. CATIA - 3D Functional Tolerancing and Annotations 2 (FTA) CATIA V5R20

Extraction of Newspaper Headlines from Microfilm for Automatic Indexing

Colour Profiling Using Multiple Colour Spaces

Chapter 1 Overview of an Engineering Drawing

Image Recognition for PCB Soldering Platform Controlled by Embedded Microchip Based on Hopfield Neural Network

Implementation of Face Detection System Based on ZYNQ FPGA Jing Feng1, a, Busheng Zheng1, b* and Hao Xiao1, c

Angle Measure and Plane Figures

Activity 5.2 Making Sketches in CAD

Weaving Density Evaluation with the Aid of Image Analysis

CIS581: Computer Vision and Computational Photography Homework: Cameras and Convolution Due: Sept. 14, 2017 at 3:00 pm

REPRESENTATION, RE-REPRESENTATION AND EMERGENCE IN COLLABORATIVE COMPUTER-AIDED DESIGN

Introduction to DSP ECE-S352 Fall Quarter 2000 Matlab Project 1

Preprocessing and Segregating Offline Gujarati Handwritten Datasheet for Character Recognition

DFTG 1305 UNIT 1. Semester: Spring 2016 Class #: Term: SS Instructor: Mays ALSabbagh

Real-Time Face Detection and Tracking for High Resolution Smart Camera System

Image Measurement of Roller Chain Board Based on CCD Qingmin Liu 1,a, Zhikui Liu 1,b, Qionghong Lei 2,c and Kui Zhang 1,d

A comparative study of different feature sets for recognition of handwritten Arabic numerals using a Multi Layer Perceptron

A Level Computer Science H446/02 Algorithms and programming. Practice paper - Set 1. Time allowed: 2 hours 30 minutes

Vehicle Number Plate Recognition Using Hybrid Mathematical Morphological Techniques

Robust Hand Gesture Recognition for Robotic Hand Control

Detection and Verification of Missing Components in SMD using AOI Techniques

Aesthetically Pleasing Azulejo Patterns

A System for Recognizing a Large Class of Engineering Drawings

Specification D data models

Automatic Electricity Meter Reading Based on Image Processing

Keyword: Morphological operation, template matching, license plate localization, character recognition.

Finger print Recognization. By M R Rahul Raj K Muralidhar A Papi Reddy

A Retargetable Framework for Interactive Diagram Recognition

Advanced Maximal Similarity Based Region Merging By User Interactions

ORTHOGRAPHIC PROJECTIONS. Ms. Sicola

Fast identification of individuals based on iris characteristics for biometric systems

COMPARATIVE PERFORMANCE ANALYSIS OF HAND GESTURE RECOGNITION TECHNIQUES

Image Extraction using Image Mining Technique

Traffic Sign Recognition Senior Project Final Report

An Improved Method of Computing Scale-Orientation Signatures

CLASS views from detail on a grid paper. (use appropriate line types to show features) - Optional views. Turn in for grading on class 6 (06/04)

VLSI Implementation of Impulse Noise Suppression in Images

Chapter 2. Drawing Sketches for Solid Models. Learning Objectives

Implementation of License Plate Recognition System in ARM Cortex A8 Board

Demosaicing Algorithm for Color Filter Arrays Based on SVMs

FPGA based Real-time Automatic Number Plate Recognition System for Modern License Plates in Sri Lanka

Relative Coordinates

AutoCAD 2D I. Module 16. Isometric and Dimensioning. IAT Curriculum Unit PREPARED BY. January 2011

CADValidator: A Critical Aid for the Model-Based Enterprise

Real-Time License Plate Localisation on FPGA

CHAPTER 4 LOCATING THE CENTER OF THE OPTIC DISC AND MACULA

Target Recognition and Tracking based on Data Fusion of Radar and Infrared Image Sensors

Laser Printer Source Forensics for Arbitrary Chinese Characters

Transcription:

A new method to recognize Dimension Sets and its application in Architectural Drawings Yalin Wang, Long Tang, Zesheng Tang P O Box 84-187, Tsinghua University Postoffice Beijing 100084, PRChina Email: wyl@sun5thittsinghuaeducn Abstract - Recognition of dimension sets is a prerequisite for engineering drawings interpretation A new heuristic algorithm for dimension sets recognition is presented Following the recognized text strings, the algorithm recognizes the dimension lines and other components of the dimension set according to the dimension set syntaxes The advantages of this algorithm to the architectural drawings are unmatched A data structure of planar position index is used and evaluated Experimental results are also presented Key word: Dimension Set Recognition, Engineering Drawings Interpretation, Text Segmentation, Pattern Recognition, architectural drawings I Introduction Engineering drawings are important vehicles to convey the mechanical and manufacturing information for 3-D solid objects Traditionally, the drawings are manually prepared on the drawing board using paper-and-pencil drafting techniques With the development of computer-aided design (CAD) and computer-aided manufacturing (CAM) techniques, more and more engineering drawings are in CAD formats There are many merits that can demonstrate the conversion[1]: easy in storage, retrieval, and query, convenient in revision, and cheap in filing, copying, accessing, and preparing these drawings Many of the existing paper engineering drawings are still active and operational, and there is a demand for a fast and reliable means to transform them into some standard CAD/CAM format Manually transferring paper drawings into such a format is labor-intensive and time-consuming, so it has become extremely desirable for the computer to process a wide variety of paper drawings and automatically create the CAD format drawings Automatic engineering drawings input and recognition system can serve for this function Such systems should recognize not only the geometry primitives which represent the object contours but also the annotation primitives which give the measurement of the object Dimension sets, which describe the logical relations among geometry primitives and annotations, should also be correctly recognized and interpreted, as it understanding is a prerequisite for the higher level understanding of engineering drawings Some approaches on recognizing dimension sets in the engineering drawings have been achieved[2,3,4,5] Dori[2]presented the fundamental method for recognition of engineering drawings which starts with arrowhead recognition, including the definition of dimension components with a set Ω = {C, W, S, L, A, T}, the concept of symmetry and regularity of dimensions on a functional basis Min et al [3] presented a web grammar based on arrowhead-match in accordance with the People Æs Republic of China national standard GB4458-84 for machine engineering drawings Their method can recognize 27 dimension patterns and 48 subpatterns which are based on morphological and functional classifications Although their method has more practical implementation background, they assume that preprocessing work can supply the method with all the identified arrowheads and texts However, due to the complexity of real-life engineering drawings and the presence of noise, it is still a tough problem for the vectorization module to supply a solid basis for their method Kasturi[4] developed a new rule-based text/graphics separation algorithm and a model-based procedure for detecting arrowheads in any orientation that are drawn to ANSI drafting standards LangranaÆs method [5] recognize dimension sets based on morphological filters With a manual input file of text, this method reconstructed geometry using variational geometry theory Their methods all begin with the scanned image and get the recognized dimension sets finally But the inherent difficulty with the 1

arrowhead recognition makes them strongly dependent on the image quality It also seems timeconsuming to integrate the text blocks with the corresponding leaders and the feature control frames In this paper, we propose a new algorithm to recognize dimension sets in the architecture drawings (an example is shown in Fig 1) based on dimension sets syntaxes, which is discussed in Section II In Section III, a data structure named Planar Position Index(PPI) is briefly described, which is employed to fully realize the new algorithm The advantages of this algorithm in building costs estimation on architecture drawings are unmatched, which is shown in Section IV Finally, a summary and future work are given in Section V II Recognition of Dimension Sets in Architectural Drawings Dimension sets are mainly used to define geometric characteristics such as lengths and relative locations of the various entities in the drawings They consist of entities such as dimension lines, witness lines and dimensioning text The block diagram of our dimension sets recognition procedure is shown in Fig 2 image Text/graphics separation Character candidates Graphics Fig 1 An example of architectural drawing OCR Vectorization Dimensioning texts Vectorized results Recognition of dimension sets Fig 2 Block diagram of dimension sets recognition procedure Recognized dimension sets Understanding drawings on 2D level 2

Our method works bottom-up First, we segment and recognize the characters in the drawings Second, the vectorization module produces vectorized lines We use dimension set syntaxes to recognize all the dimension lines Then we integrate them with dimensioning text and witness lines The dimensioning text recognition, recognition of dimension lines and integration of dimension sets are described in this section 1 Dimensioning Text Recognition Fig 3 gives an example of the dimension set in architectural drawings We can find there is no features of the dimension sets except that there is a text block near the dimension line Our method first segments and recognizes the dimension texts In our engineering drawing interpretation system, there is a specific module to deal with text/graphics separation and recognition[6] The goal of this module is aimed at recognizing all the characters that are in any direction and are either human Æs writing or machine print Reasonable as it is, it is a goal too ambitious to reach Its average correct rate is 90% To raise the accuracy rate, a new algorithm specially for architectural drawings which could get a higher accuracy rate, was developed The revision to the method is described below 3300 Fig 3 Example of dimension set in architecture drawings A Some criteria we use: We can use the three criteria to extract all the character candidates Connected component criteria: All the character candidates are connected components Size criteria: The characters in the engineering drawings have a fixed size range The most frequently appeared connected components should be characters, and the average size of them is charactersæsize Group criteria: The characters whose central points are in one horizontal or vertical line and whose distances to one another almost have the same value(with some tolerance) should be grouped as one string B Restriction to character direction and Special pattern file to the drawings For a machine printing architectural drawing, there are only two character directions: horizontal and vertical Because there is no oblique string in the drawings, we could get a high accuracy rate for the extracted strings The drawings we processed are all drawn by computer driven plotter, so the texts in them have a fixed character type It is possible and convenient for us to create a full pattern file which covers all the character types in the drawings During the recognition stage, a precisely matching template method was employed By doing these, we can get a high recognition accuracy C Simple touching character processing It is difficult to handle the touching characters in the engineering drawings because they have no major difference with the graphics in the drawings We use a projection method[7] to segment the simple touching characters First, we relax the size criteria a little not to discard the connected components whose sizes do not exceed the character size too much Then we define two functions as V(x) and Φ(x) to find the cutting position The position having sharp maximum value of Φ(x) is the cutting position where we can separate the touching characters Then we use recognition engine to verify whether it is really a touching character The verification process is a recursive one and it may test several cutting positions to get the correct character recognition results The two functions of V(x) and Φ(x) are defined as below Fig 4 shows one example of them 3

V(x): The function mapping horizontal position to the number of blob pixels in vertical column at that position V ( x 1) 2 V ( x) + V ( x + 1) 1 < x < Width Φ(x): ( ) 2 Recognition of Dimension Lines In [4], Kasturi et al describe a model-based procedure for detecting arrowheads Das and Langrana[5] developed an image segmentation process based on morphological filters to recognize arrowheads In both of the works, dimension lines are detected based on the recognized arrowheads Because of the special shape of arrowheads in the architectural drawings, their methods are useless here The algorithm we have designed is capable of handling the dimension lines in the architectural drawings First, our system extracts and recognizes the characters and vectorizes the other parts of the drawing Secondly, our algorithm uses the syntaxes the geometric objects yield to in the engineering drawing and a data structure named Planar Position Index to recognize the dimension lines Fig 5 shows recognized characters and vectorized results to the example drawing shown in Fig 1 In our present work, we only recognize the two surrounding layers of dimension sets(both horizontal and vertical ones) They all belong to the type of dimension sets shown in Fig 3 Some shape and location feature of the dimension sets are useful for us to identify the dimension lines We can define a dimension line as a line with a text block as dimensioning text attached at its back and lines as the witness lines perpendicular to it at its ends Based on this definition, we can start from the recognized dimensioning texts to find the dimension lines Fig 5 Recognized characters and vectorized results A Grouping of dimensioning texts The orientation of a dimension line is the same as the orientation of the dimensioning text The orientation information of dimensioning texts can be used to search the dimension lines they attach at We classify the dimensioning texts as horizontal and vertical, according to their orientations Two arrays, namely vert_group and horz_group are then created The dimensioning texts in the horz_group are stored in an ascending order of the y_coordinates of their centroid points The dimensioning texts in the vert_group are stored in an ascending order of the x_coordinates of their centroid points B Recognition of the outmost horizontal dimension lines 4

In Fig 3, the outmost horizontal dimension set is special Its dimensioning text is ô10200ö, but its dimension line is crossed by other three vertical lines The special way to recognize it is designed as below To the last dimension text whose y_coordinate of the centroid point is the biggest, a rectangle area is constructed under it to search the dimension line A horizontal line which meets two vertical lines at its ends will be found Then we search for another horizontal line at its ends to extend the line The work continues at both of its ends until no horizontal line can be found We regard all the horizontal lines as one horizontal line and record the long line Æs starting and ending y_coordinates We recognize the first dimension line which is located at the most outer layer C Recognition of dimension lines at the second outmost horizontal layer After removing the dimensioning text whose y_coordinate is the biggest from the horz_group, we can get several dimensioning texts whose y_coordinates of the centroid points almost equal to one another and bigger than other dimension texts For these dimension texts, rectangle searching areas will be constructed under them and horizontal lines which meet two vertical lines at their ends will be found No extension operation needs to do on this layer We record each lineæs start and end points Æposition and then recognize the dimension lines at the second layer D Error detection After we get the outmost two horizontal layers of dimension sets, we have the opportunity to detect the errors introduced by previous steps We calculate the sum of the dimensioning texts at the second outmost layer to verify whether it equals the outmost layeræs dimensioning texts If they do not equal to each other, an error alarm should be created We can use the similar method to find the vertical dimension lines which are located at the outermost two layers 3 Integration of dimension sets In other works[4,5], based on recognized dimension lines, a search rectangle was constructed to search for the dimension text which are either recognized or manually created Then leaders(the tail combined with the arrowhead is called leaders), witness lines and associated dimension texts are integrated That is to say, dimension sets are recognized Our method works in the reverse direction According to the information of dimension texts Æ locations, we recognize the dimension lines and witness lines then integrate them as dimension sets We can say we integrate the dimension sets at the same time as we recognize the dimension lines So our method is more efficient in the integration period than other methods The property we are most interested at is the locations of the dimension sets in the drawing We can know the real lengths for some lines in the drawings by this information So, we just record the start pointsæand end pointsæcoordinates and the dimension set values as the recognized dimension set results The results of recognized dimension sets in Fig 5 are shown in Table 1 and Table 2 After these works, we have recognized the dimension sets at the surrounding two layers from the scanned drawing image, which serves the basis for our further application in the architectural drawings Dimension set Number Dimension set value Starting x_coordinate Ending x_coordinate 0 10200 136 ± 5 2288 ± 5 1 3300 136 ± 5 512 ± 5 2 3600 512 ± 5 945 ± 5 3 1500 945 ± 5 1122 ± 5 4 1800 1122 ± 5 2288 ± 5 Table 1 Recognized horizontal dimension sets results Dimension set Number Dimension set value Starting y_coordinate Ending y_coordinate 0 12000 203 ± 5 1634 ± 5 1 5100 203 ± 5 820 ± 5 2 3600 820 ± 5 1242 ± 5 5

3 3300 1242 ± 5 1634 ± 5 III Data Structure and its evaluation During the recognition and integration period in our work and in [4,5], a position-to-object search was employed for many times Sequential lists are normally used to record objects With no geometric planar position relations among objects, searching an object in a given area require sequential search of the object lists This is inefficient because to check for the presence of some object we want to avoid searching objects in the whole drawing, when only a much smaller area of the drawing is of interest 1 Planar Position Index To facilitate geometric object search in a given area, a new data structure named Planar Position Index(PPI) [8]was constructed to the vectorized objects The entire drawing area is divided into adjacent horizontal strips of equal width Each strip has a strip number and each strip contains a node list holding zero or more nodes Each index node is a rectangle area in the drawing with a left boundary and right boundary The nodes in each strip are sorted by their boundaries The sorted index node sequence is realized as a balanced tree(b-tree) Fig 6 is an example of the part of the drawing and its data structure 0 A B Y i1 Y i2 Y i3 abc (a) Part of Engineering drawing Table 2 Recognized vertical dimension sets results Y i1 Y i2 Y i3 Instance of line A Instance of line B Instance of line text abc 2 BD tree In a drawing processor named AI-MUDAMS developed by Masao Sakauchi and Yutaka Ohsawa[9], they developed a data structure named BD tree to deal with vector data This data structure is similar with our method in many ways A brief description of BD tree can be found below On the BD tree, each vector data is represented by its centroid point and its circumscribed quadrilateral The plane in which centroid points exist is divided into two equal-size sub-plane successfully The axis of the division is selected alternatively from vertical and horizontal axes This decomposition process continues until just one centroid point data exists in each divided sub-plane Fig 7 shows one example of the BD tree construction 2 2 1 (b) PPI construction for (a) Fig 6 Part of Engineering drawing and its PPI construction 3 Comparison 6

In both of methods, searching an objet at some position is more efficient than normal sequential searching method The time complexity of searching an object in PPI is O( n log n), where n is the average number of nodes in a strip The time complexity of searching an object in BD tree is no less than O( N log N ), where N is the number of all the vector objects in the drawing The time complexity is larger than O( N log N ) when the tree is not balanced, which depends on the contents of the drawings To the space complexity, PPI records one object several times(when it passes more strips than one strip), but BD tree records one object only once So the space complexity of PPI is larger than BD tree As a combination of consideration of time and space complexity, the time complexity seems more important than space complexity for the drawing processor and the larger space occupied by PPI is tolerant So PPI is more reliable and efficient than BD tree IV Building Costs Estimation After we recognize the dimension sets in the architectural drawing, we have the opportunity to do some understanding of the drawings on 2-D level The application mentioned here comes from a real application background Before an architectural engineer begins a new building project, he always wants to estimate the building costs of part or whole of the building The building costs can be gotten by a definite calculation on the area of the building zone So an automatic calculation of zones Æarea from a scanned architectural drawing is desirable By using our new dimension sets recognition methods, we can easily recognize the dimension sets which are useful for the estimation So when one user selects on zone(always in rectangle shape) in the drawings, our program can tell the user the real parameters(eg the length and width for the rectangle zone) and the areas of the zones For example, if the user selects the whole building he can pick up the upper-left point and lower-right point of the zone in the drawing Based on the coordinates of the two points, the software can know the dimension set values on both horizontal and vertical directions are 10, 200 and 12, 000 So the user can be told by the software the zone area he selects is 10, 200 12, 000 = 122,400, 000 In other words, we realize a 2-D understanding to the architectural drawings V Implementation and future work The system for building costs estimation was developed in C++ in MS-Windows environment and runs on PCs Currently our system realizes the surrounding two layers of the dimension sets group, which can serve quite well to the area calculation to the building without consideration of the width of the wall, the existence of windows and some rooms in special shapes Our current method does not pay much attention to the recognition of witness lines, so we have much difficulty in the automatic area calculation in the architectural drawings They are also the future research works for us to do Reference 1 T Kanungo, RM Haralick, and D Dori Understanding engineering drawings: A survey In International Workshop on Graphics Recognition, pages 119-130, 1995 2 D Dori A Syntactic geometric approach to recognition of dimension in engineering machine drawings Computer Vision Graphics Image Process, 47, pages 271-291, 1989 3 W Min, Z Tang, and L Tang Recognition of dimensions in engineering drawing based on arrowhead-match In Proceedings of the International Conference on Document Analysis and Recognition, pages 373-376, 1993 4 CP Lai and R Kasturi Detection of dimensions sets in engineering drawings In Proceedings of the International Conference on Document Analysis and Recognition, pages 606-613, 1993 7

5 AK Das and NA Langrana Recognition of dimension sets and integration with vectorized engineering drawings In Proceedings of the International Conference on Document Analysis and Recognition, pages 347-350, 1995 6 J Gao, L Tang, W Liu and Z Tang Segmentation and Recognition of Dimension Text in Engineering Drawings In Proceedings of the International Conference on Document Analysis and Recognition, pages 528-531, 1995 7 S Kahan, T Pavlidis and HS Baird On the Recognition of Printed Characters of Any Font and Size In IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol PAMI-9, No 2, 1987 8 Liu W, D Dori, T Long, and T Zesheng Object recognition in engineering drawings using planar position indexing In International Workshop on Graphics Recognition, pages 53-61, 1995 9 M Sakauchi and Y Ohsawa The AI-MUDAMS, The Drawing Processor Based on the Multidimensional Pattern Data Structure In IEEE Computer Society Workshop on Computer Architecture for Pattern Analysis and Image Management Database - Capaidm Miami Beach, Florida, Nov 18-20, 1985 8