[1]张婷婷,马明栋,王得玉.OCR 文字识别技术的研究[J].计算机技术与发展,2020,30(04):85-88.[doi:10. 3969 / j. issn. 1673-629X. 2020. 04. 016]
 ZHANG Ting-ting,MA Ming-dong,WANG De-yu.Research on OCR Technology[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2020,30(04):85-88.[doi:10. 3969 / j. issn. 1673-629X. 2020. 04. 016]
点击复制

OCR 文字识别技术的研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
30
期数:
2020年04期
页码:
85-88
栏目:
智能、算法、系统工程
出版日期:
2020-04-10

文章信息/Info

Title:
Research on OCR Technology
文章编号:
1673-629X(2020)04-0085-04
作者:
张婷婷马明栋王得玉
南京邮电大学,江苏 南京 210003
Author(s):
ZHANG Ting-tingMA Ming-dongWANG De-yu
Nanjing University of Posts and Telecommunications,Nanjing 210003,China
关键词:
OCR文字识别post 方法图像处理
Keywords:
OCRcharacter recognitionpost methodimage processing
分类号:
TP31
DOI:
10. 3969 / j. issn. 1673-629X. 2020. 04. 016
摘要:
图像中的文字在当下相机高速发展下显得尤为重要,人们开始通过拍摄照片直接进行图像上文字的识别,最常用的就是寄快递收寄地址的识别。 其中用到的技术是 OCR (optical character recognition)字符识别技术,其中文名字叫做光学字符识别。 它是利用光学技术和计算机技术通过检测字符每个像素的暗、亮模式确定其形状,然后用字符识别方法将形状翻译成计算机文字的过程。 随着日常生活网络化的推进,各种纸质文档的数字化智能化识别进程也在加速。 经过二十世纪九十年代的发展,对字符识别技术的研究已经取得了很大的进展,市场上目前正在使用的各种 OCR 识别软件层出不穷。 但是以往对证件的识别是一个比较大的难题。文中的研究主要是对普通的文字进行识别。 识别系统包括三个模块:图像预处理、图像分割、字符识别。 前两个模块又包含图像的二值化分析、灰度化等,对其进行了描述。
Abstract:
With the rapid development of cameras,the text in images is particularly important. People begin to recognize the text on images directly by taking photos,and the most commonly used method is to recognize the address of receiving and mailing by express delivery. The technology used is OCR, optical character recognition, which is a process of using optical technology and computer technology to determine the shape of each pixel by detec- ting the dark and bright mode of the character,and then translating the shape into computer text by character recognition method. With the developm- ent of network in daily life, the digital and intelligent recognition process of all kinds of paper documents is also accelerating. After the development of character recognition technology in the 1990s,great progress has been made. Various OCR recognition software are being used in the market. But the identification of documents is a big problem in the past. We mainly focus on the recognition of common characters. The recognition system consists of three modules:image preprocessing,image segmentation and character recognition. The first two modules also include image binarization analysis,grayscale and so on,which are described.

相似文献/References:

[1]陈梓洋,王宇飞,钱侃,等. 自然场景下基于区域检测的文字识别算法[J].计算机技术与发展,2015,25(07):230.
 CHEN Zi-yang,WANG Yu-fei,QIAN Kan,et al. Character Recognition Algorithm Based on Region Detection in Natural Scene[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2015,25(04):230.
[2]任荣梓,高航. 基于反馈合并的中英文混排版面OCR技术研究[J].计算机技术与发展,2017,27(03):39.
 REN Rong-zi,GAO Hang. Investigation on Layout Analysis Technology of Chinese and English Mixed OCR Based on Feedback Merging[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2017,27(04):39.
[3]曾 悦,马明栋.基于 Tesseract_OCR 文字识别的研究[J].计算机技术与发展,2021,31(11):76.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 013]
 ZENG Yue,MA Ming-dong.Research on Text Recognition Based on Tesseract_OCR[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2021,31(04):76.[doi:10. 3969 / j. issn. 1673-629X. 2021. 11. 013]
[4]蒋子敏,刘宁钟,沈家全.基于轻量级网络的 PCB 芯片文字识别[J].计算机技术与发展,2021,31(12):55.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 010]
 JIANG Zi-min,LIU Ning-zhong,SHEN Jia-quan.PCB Chip Text Recognition Based on Lightweight Network[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2021,31(04):55.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 010]
[5]童 攀,龙炳鑫,拥 措 *.基于注意力机制藏文乌金体古籍文字识别研究[J].计算机技术与发展,2023,33(10):163.[doi:10. 3969 / j. issn. 1673-629X. 2023. 10. 025]
 TONG Pan,LONG Bing-xin,YONG Cuo*.Research on Tibetan Ujin Ancient Book Character Recognition Based on Attention Mechanism[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2023,33(04):163.[doi:10. 3969 / j. issn. 1673-629X. 2023. 10. 025]
[6]章 安,马明栋.基于 Tesseract 文字识别的预处理研究[J].计算机技术与发展,2021,31(01):73.[doi:10. 3969 / j. issn. 1673-629X. 2021. 01. 013]
 ZHANG An,MA Ming-dong.Research on Preprocessing Based on Tesseract Text Recognition[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2021,31(04):73.[doi:10. 3969 / j. issn. 1673-629X. 2021. 01. 013]

更新日期/Last Update: 2020-04-10