halcon学习笔记-ocr识别

Posted 2021-06-11 殇堼

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了halcon学习笔记-ocr识别相关的知识，希望对你有一定的参考价值。

ocr识别

关键：字符分割、字符定位

分类器的种类

1.mlp神经网络 2.支持向量机SVM 3.KNN 4.混合高斯模型，将识别目标的特征进行分类、训练，下一次识别的时候符合哪个特征，从而得出识别结果。
特征种类：颜色、形状、概率特征、描述算子特征等等
mlp神经网络：输入层、中间层、输出层

ocr识别的步骤

1.创建分类器之前要进行训练：其中 .trf格式文件包含着用于训练的样本数据和样本名称，trf格式文件在halcon里面是一个训练文件，将图像（连通域）和字符进行关联。
2.创建分类器
3.训练分类器
4.识别

示例1：使用mlp神经网络做ocr识别。

* This example program shows how to use a simple MLP OCR classifier
* 
dev_update_off ()
read_image (Image, 'letters')
get_image_size (Image, Width, Height)
dev_close_window ()
dev_open_window_fit_image (Image, 0, 0, -1, -1, WindowHandle)
dev_set_colored (12)
set_display_font (WindowHandle, 16, 'mono', 'true', 'false')
get_tmp_dir (TmpDir)
TrainFile := TmpDir + '/letters.trf'
dev_display (Image)
gen_rectangle1 (Rectangle, 0, 0, Height - 1, 400)
reduce_domain (Image, Rectangle, Image)

* Segment the image图像分割
binary_threshold (Image, Region, 'max_separability', 'dark', UsedThreshold)

* Connect the i's and j's with their dots 膨胀扩大区域
dilation_circle (Region, RegionDilation, 3.5)
* Compute the correct connected components 计算出正确的连通域
connection (RegionDilation, ConnectedRegions)
* Reduce each connected component (character) to its original shape
intersection (ConnectedRegions, Region, RegionIntersection)
* Sort the characters line-by-line
sort_region (RegionIntersection, Characters, 'character', 'true', 'row')
dev_display (Characters)
disp_message (WindowHandle, 'Training characters', 'window', 12, 12, 'black', 'true')
disp_continue_message (WindowHandle, 'black', 'true')
stop ()
* Compute the true class of each character
count_obj (Characters, Number)
Length := Number / 27
Classes := []
for J := 0 to 25 by 1
    Classes := [Classes,gen_tuple_const(Length,chr(ord('a') + J))]
endfor
Classes := [Classes,gen_tuple_const(Length,'.')]
* Construct the necessary training file from the segmented characters
write_ocr_trainf (Characters, Image, Classes, TrainFile)
* Create the classifier.  We read out the classes from the train file.
* Therefore, the training part of the program is generic and can be
* used to train any OCR classifier.
read_ocr_trainf_names (TrainFile, CharacterNames, CharacterCount)
create_ocr_class_mlp (8, 10, 'constant', 'default', CharacterNames, 20, 'canonical_variates', 26, 42, OCRHandle)
* Train the classifier
trainf_ocr_class_mlp (OCRHandle, TrainFile, 100, 0.01, 0.01, Error, ErrorLog)
* 
* Now test the classifier on the whole training image
full_domain (Image, Image)
* Segment characters the same way as before
binary_threshold (Image, Region, 'max_separability', 'dark', UsedThreshold1)
dilation_circle (Region, RegionDilation, 3.5)
connection (RegionDilation, ConnectedRegions)
intersection (ConnectedRegions, Region, RegionIntersection)
sort_region (RegionIntersection, Characters, 'character', 'true', 'row')
* Classification分类
do_ocr_multi_class_mlp (Characters, Image, OCRHandle, Class, Confidence)
* 
* Display results显示结果
area_center (Characters, Area, Row, Column)
dev_display (Image)
set_display_font (WindowHandle, 16, 'sans', 'true', 'false')
for J := 0 to |Row| - 1 by 1
    disp_message (WindowHandle, Class[J], 'image', Row[J] - 16, Column[J] + 8, 'blue', 'false')
endfor
set_display_font (WindowHandle, 16, 'mono', 'true', 'false')
disp_message (WindowHandle, 'Classification result', 'window', 12, 12, 'black', 'true')
clear_ocr_class_mlp (OCRHandle)
dev_set_check ('~give_error')
delete_file (TrainFile)
dev_set_check ('give_error')

以上是关于halcon学习笔记-ocr识别的主要内容，如果未能解决你的问题，请参考以下文章

Halcon学习笔记-分类器mlp的解读

halcon学习和实践（svm模型训练实战）

Halcon学习笔记——条形码的定位与识别

python ocr(光学文字识别）学习笔记

python ocr(光学文字识别)学习笔记

Halcon学习笔记-手眼标定-九点法