python实现KNN,识别手写数字

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了python实现KNN,识别手写数字相关的知识,希望对你有一定的参考价值。

写了识别手写数字的KNN算法,如下图所示。参考链接http://blog.csdn.net/april_newnew/article/details/44176059。

技术分享

# -*- coding: utf-8 -*-

import numpy as np
import pandas as pd
import os
def readtxt(filename):
    text=[]
    f = open(filename,r,encoding=utf-8)
    for line in f.readlines():
        text.append(line)
    txt = list(text)
    txt=np.array(txt,dtype=float)
    txt = txt.tolist()
    return txt

def readdata(rootfile):
    data = []
    label = []
    for root,dirs,files in os.walk(rootfile):
        for name in files:
            filename = root +\\\\+name
            txt = readtxt(filename)
            data.append(txt)
            label1 = name.split(_)[0]
            label.append(label1)
    data = pd.DataFrame(data)
    return data,label

def KNN(traindata,trainlabel,testdatai,K):
    length = len(traindata)
    newtest = np.tile(testdatai, (length,1))
    newtest = pd.DataFrame(newtest)
    diff = newtest - traindata
    diff = diff**2
    cha = diff.sum(axis=1)
    cha = cha**0.5
    result = pd.DataFrame({label:trainlabel,
                       cha:cha})
    labels = result.sort_values(by=cha)[:K]
    frequent =labels.groupby(labels[label]).size()
    labely = frequent.argmax()
    return labely
        
def test(trainfile,testfile,K):
    result = []
    traindata, trainlabel= readdata(trainfile)
    testdata, testlabel = readdata(testfile)
    for i in range(len(testdata)):
        labely = KNN(traindata,trainlabel,testdata.loc[i,:],K)
        result.append(labely)
    tongji  = pd.DataFrame({result:result,testlabel:testlabel})
    accuary = len(tongji[tongji[result]==tongji[testlabel]])/len(result)
    return result,accuary
    
trainfile=rE:\\trainingDigits
testfile=rE:\\testDigits
K=3    
result, accuary= test(trainfile,testfile,K)
            

注:训练数据集有2,210条记录,测试数据有670条。准确率并不高,只有0.45。目前不知道为什么,以后多学习,争取优化代码。

以上是关于python实现KNN,识别手写数字的主要内容,如果未能解决你的问题,请参考以下文章

OpenCV-Python实战(番外篇)——利用 KNN 算法识别手写数字

[3] python:使用KNN识别手写数字

Python实验--手写五折交叉验证+调库实现SVM/RFC/KNN手写数字识别

Python,OpenCV使用KNN来构建手写数字及字母识别OCR

基于python Knn 算法识别手写数字,计算准确率 ——第二弹

KNN分类算法实现手写数字识别