Python 3最重要的新特性大概要算是对文本和二进制数据作了更为清晰的区分。文本总是Unicode,由str类型表示,二进制数据则由bytes类型表示。Python 3不会以任意隐式的方式混用str和bytes,正是这使得两者的区分特别清晰。你不能拼接字符串和字节包,也无法在字节包里搜索字符串(反之亦然),也不能将字符串传入参数为字节包的函数(反之亦然).
python3.0中怎么创建bytes型数据
1
2
|
bytes([ 1 , 2 , 3 , 4 , 5 , 6 , 7 , 8 , 9 ]) bytes( "python" , ‘ascii‘ ) # 字符串,编码 |
设置一个原始的字符串
1
2
3
4
5
6
|
>>> website = ‘http://www.169it.com/os‘ >>> type (website) < class ‘str‘ > >>> website ‘http://www.169it.com/os‘ >>> |
按utf-8的方式编码,转成bytes
1
2
3
4
5
6
|
>>> website_bytes_utf8 = website.encode(encoding = "utf-8" ) >>> type (website_bytes_utf8) < class ‘bytes‘ > >>> website_bytes_utf8 b ‘http://www.169it.com/os‘ >>> |
按gb2312的方式编码,转成bytes
1
2
3
4
5
6
|
>>> website_bytes_gb2312 = website.encode(encoding = "gb2312" ) >>> type (website_bytes_gb2312) < class ‘bytes‘ > >>> website_bytes_gb2312 b ‘http://www.169it.com/os‘ >>> |
解码成string,默认不填
1
2
3
4
5
6
7
|
>>> website_string = website_bytes_utf8.decode() >>> type (website_string) < class ‘str‘ > >>> website_string ‘http://www.169it.com/os‘ >>> >>> |
解码成string,使用gb2312的方式
1
2
3
4
5
6
|
>>> website_string_gb2312 = website_bytes_gb2312.decode( "gb2312" ) >>> type (website_string_gb2312) < class ‘str‘ > >>> website_string_gb2312 ‘http://www.169it.com/os‘ >>> |