Python - PyPDF2错过了大量的文本。 Windows上的任何替代方案？

Question

我试图用PyPDF2解析一个pdf文件，但我只检索了大约10％的文本。对于剩余的90％，pyPDF2仅带回换行...有点令人沮丧。

你知道在Windows上运行Python的任何替代方案吗？我听说过pdftotext，但似乎我无法安装它，因为我的电脑不能在Linux上运行。

任何的想法？

import PyPDF2

filename = 'Doc.pdf'
pdf_file = PyPDF2.PdfFileReader(open(filename, 'rb'))

print(pdf_file.getPage(0).extractText())