首页> 外文期刊>International journal of synthetic emotions >Feature based Text Extraction System using Connected Component Method
【24h】

Feature based Text Extraction System using Connected Component Method

机译:基于连接组件方法的基于特征的文本提取系统

获取原文
获取原文并翻译 | 示例
           

摘要

Text detection and segmentation system serves as important method for document analysis as it helps in many content based image analysis tasks. This research paper proposes a connected component technique for text extraction and character segmentation using maximally stable extremal regions (MSERs) for text line formation followed by connected components to determined separate characters. The system uses a cluster size of five which is selected by experimental evaluation for identifying characters. Sobel edge detector is used as it reduces the execution time but at the same time maintains quality of the results. The algorithm is tested along a set of JPEG, PNG and BMP images over varying features like font size, style, colour, background colour and text variation. Further the CPU time in execution of the algorithm with three different edge detectors namely prewitt, sobel and canny is observed. Text identification using MSER gave very good results whereas character segmentation gave on average 94.572% accuracy for the various test cases considered for this study.
机译:文本检测和分割系统作为文档分析的重要方法,因为它可以帮助完成许多基于内容的图像分析任务。本研究论文提出了一种用于文本提取和字符分割的连接组件技术,该技术使用最大稳定的极值区域(MSER)进行文本行形成,然后使用连接的组件来确定单独的字符。该系统使用的簇大小为5,该簇大小是通过实验评估选择的,用于识别字符。使用Sobel边缘检测器是因为它减少了执行时间,但同时又保持了结果的质量。沿着一组JPEG,PNG和BMP图像对算法进行了测试,这些图像具有各种特征,例如字体大小,样式,颜色,背景颜色和文本变化。此外,观察到使用三个不同的边缘检测器(即prewitt,sobel和canny)执行算法时的CPU时间。对于本研究考虑的各种测试用例,使用MSER进行的文本识别给出了很好的结果,而字符分割的平均准确性为94.572%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号