python学习之爬虫：BeautifulSoup

Posted 2020-10-17

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了python学习之爬虫：BeautifulSoup相关的知识，希望对你有一定的参考价值。

一、功能：

BeautifulSoup是用来从HTML或XML中提取数据的Python库。

from bs4 import BeautifulSoup

import bs4

三、编码格式：

soup使用Unicode编码

有四种类型：Tag，NavigableString，BeautifulSoup，Comment。
BeautifulSoup将文档转化为树形结构，每个节点都是上述四种类型的Python对象。

tag属性：name、attrs

参考网址：

1、http://python.jobbole.com/84774/

2、https://www.crummy.com/software/BeautifulSoup/bs4/doc/#making-the-soup

3、http://wiki.jikexueyuan.com/project/python-crawler-guide/beautiful-soup.html

以上是关于python学习之爬虫：BeautifulSoup的主要内容，如果未能解决你的问题，请参考以下文章