Tesseract不会识别png文件中的验证码，该文件包含英文字母的数字和字母

Question

我需要从url中提取验证码并使用Tesseract识别它。我的代码是：

#!/usr/bin/perl -X
###
$user = 'user'; #Enter your username here
$pass = 'pass'; #Enter your password here
###
#Server settings
$home = "http://perltest.adavice.com";
$url = "$home/c/test.cgi?u=$user&p=$pass";
#Get html code!
$html = `GET "$url"`
###Add code here!
#Grab img from HTML code
if ($html =~ m%img[^>]*src="(/[^"]*)"%s)
{
    $img = $1;
}
###
die "<img> not found
" if (!$img);
#Download image to server (save as: ocr_me.img)
print "GET '$home$img' > ocr_me.img
";
system "GET '$home$img' > ocr_me.img";
###Add code here!
#Run OCR (using shell command tesseract) on img and save text as ocr_result.txt
system("tesseract ocr_me.img ocr_result");
print "GET '$txt' > ocr_result.txt
";
system "GET '$txt' > ocr_result.txt";
###
die "ocr_result.txt not found
" if (!-e "ocr_result.txt");
# check OCR results:
$txt = 'cat ocr_result.txt';
$txt =~ s/[^A-Za-z0-9-_.]+//sg;
$img =~ s/^.*///;
print `echo -n "file=$img&text=$txt" | POST "$url"`;

图像正确解析。此图片包含captcha，看起来像：

我的输出是：

GET 'http://perltest.adavice.com/captcha/1533110309.png' > ocr_me.img
Tesseract Open Source OCR Engine v3.02.02 with Leptonica
GET '' > ocr_result.txt
Captcha text not specified

如您所见，脚本正确解析图像。但Tesseract在PNG文件中没有看到任何内容。我试图用shell命令tesseract指定其他参数，如-psm和-l，但这也没有给出任何内容

更新：阅读答案@Dave Cross后，我尝试了他的建议。

在输出中我得到：

http://perltest.adavice.com/captcha/1533141024.png
ocr_me.img
Tesseract Open Source OCR Engine v3.02.02 with Leptonica
[]
200Captcha text not specified
Original image file not specified
Captcha text not specified

为什么我需要来自图像.PNG的文字？也许这些额外的信息可以帮助您。看看：

这就是$ url在浏览器中的样子。我的目标是使用perl在wim中为此页面创建查询。为此，我需要填写$ user，$ pass和$ txt之上的表格（来自Tesseract图像的识别）。并使用POST'url'发送它（代码中的最后一个字符串）。