عنوان مقاله

استخراج کاراکتر در تصویر وب برای بازشناسی متن



خرید نسخه پاورپوینت این مقاله


خرید نسخه ورد این مقاله



 

فهرست مطالب

چکیده

مقدمه

تکنیک پیشنها شده

آزمایشات و بحث

نتیجه گیری





بخشی از مقاله

هموارسازی و دودویی سازی تصویر 

به منظور تضمین نتیجه تشخیص مطلوب، بهتر این است که متن و زمینه تصاویر وب اول هموار شوند.در موارد ایده آل، شدت های پیکسل درون یک طبقه  باهم برابر میباشد،اختلافات شدت فقط در مرز بین متن و زمینه رخ می دهد. بنابراین، تصویر هموارشده باید تاحد امکان دارای تغییر شدت کمی باشد که تابع اختلاف با خط اصلی می باشد. 





خرید نسخه پاورپوینت این مقاله


خرید نسخه ورد این مقاله



 

کلمات کلیدی: 

Character Extraction in Web Image for Text Recognition Bolan Su12∗ , Shijian Lu2+, Trung Quy Phan1 and Chew Lim Tan1∗ 1Department of Computer Science,School of Computing,National University of Singapore Computing 1, 13 Computing Drive, Singapore 117417 2Department of Computer Vision and Image Understanding,Institute for Infocomm Research 1 Fusionopolis Way, #21-01 Connexis, Singapore 138632 ∗{subolan,phanquyt,tancl}@comp.nus.edu.sg,+slu@i2r.a-star.edu.sg Abstract Images with text are frequently used on Internet for different purposes. Automatic recognition of text from web images plays an important role on extraction and retrieval of web information. However, the web images are usually in low resolution with artifacts and special effects, which makes word recognition a challenge task even after the text has been localized. In this paper, we propose a robust text recognition technique to efficiently convert the web images into text format. The proposed technique first makes use of the L0 norm smoothing to increase the edge contrast of the input web images. The images are then binarized on each color channel. A connected component analysis is followed to identify the possible character components. Finally the character candidates are recognized by the OCR engine after skew correction. Extensive experiments have been conducted on the latest ICDAR 2011 robust reading competition dataset for born-digital text. The experimental results show the superior performance of our proposed technique. 1. Introduction The images on Internet are increasing tremendously during these years. Many of these images contain text information that cannot be found in other places of the web pages [2]. The recognition of the textual information within web images is very helpful for a better understanding of the contents of web pages. As these images with text embed are used in Internet for different purposes, text recognition in web images can be applied on different kinds of applications, such as web page indexing & retrieval, web page content filtering [3]. It will become even more important as the textual information within web images is contributing more and more due (a) (b) (c) (d) Figure 1. Some low quality web image examples