Python数据类型：序列(字符串str列表list元组tuple字典dict范围range) 和集合set

Posted 2023-01-09 风流少年

tags:

篇首语：本文由小常识网(cha138.com)小编为大家整理，主要介绍了Python数据类型：序列(字符串str列表list元组tuple字典dict范围range) 和集合set相关的知识，希望对你有一定的参考价值。

一：序列

序列sequence是多个值组成的一个整体，Python中的序列包含列表list、元组tuple、范围range、字符串str，集合set不属于序列。

二：字符串str

2.1原始字符串

r表示原始字符串，表示把特殊字符串也当做普通的字符，常用于定义正则表达式(正则表达式经常使用到很多特殊字符)。

# Hello
#  Python
s = "Hello \\n Python"
# Hello \\n Python
s2 = r"Hello \\n Python"

import re

pattern = r'[0-9a-zA-Z.]+@[0-9a-zA-Z.]+?com'
match = re.match(pattern, '123@163.com')
if match:
    print(True)

repr(s): 使用print() 打印时会将特殊字符作为普通字符打印，并在两端加上单引号。

s = "物品\\t单价\\t数量\\n包子\\t1\\t2"
# 物品	单价	数量
# 包子	1	2
print(s)

# 物品\\t单价\\t数量\\n包子\\t1\\t2
print(r'物品\\t单价\\t数量\\n包子\\t1\\t2')

# '物品\\t单价\\t数量\\n包子\\t1\\t2'
print(repr(s))

2.2 转换大写upper()/转换小写lower()/首字符转大写capitalize()/所有单词首字符大写

line = 'this is line string'
# THIS IS LINE STRING
upper = line.upper()
# this is line string
lower = upper.lower()

a = 'hello world'
# Hello world
print(a.capitalize())
# Hello World
print(a.title())

2.3 左对齐/右对齐/居中对齐

a = 'python'
b = 'java'
# python
# java
print(a.ljust(10))
print(b.ljust(10))
#     python
#       java
print(a.rjust(10))
print(b.rjust(10))

# python____
# ______java
print(a.ljust(10, '_'))
print(b.rjust(10, '_'))


a = 'python'
b = 'java'
# _______python_______
# ________java________
print(a.center(20, '_'))
print(b.center(20, '_'))

2.4 前缀startswith()和后缀endswith()

line = 'this is line string'
# True
yes = line.startswith('this')
# True
result = line.endswith('string')

2.5 替换 replace()

line = 'this is line string'
# this is line str
replace = line.replace('string', 'str')

2.6 分割 split()

line = 'this is line string'
# ['this', 'is', 'line', 'string']
items = line.split(' ')

2.7 索引 index()

查找字符串第一次出现的索引，如果找不到抛出异常ValueError。

line = 'this is line string'
# ValueError: substring not found
pos = line.index('line2')

2.8 查找 find()

与index()不同的是find()如果找不到返回-1而不会抛出异常ValueError。

line = 'this is line string'
# -1
pos = line.find('line2')

2.9 统计字符串出现的次数 count()

字符串对应的count()可以指定区间：开始索引，结束索引。

line = 'this is line string'
# 2
count = line.count('is')

# 1
count = line.count("is", 4, len(line))

3.10去除首尾空格和换行符 strip() | lstrip() | rstrip()

line = ' \\n this is line string\\n '
strip = line.strip()
print(strip)

lstrip = line.lstrip()
rstrip = line.rstrip()

3.11 是否为数字

# True
print("123".isnumeric())
# True
print("66".isdecimal())

3.11 格式化 format()

# 方式一
line = "1  2  3 "
print(line.format('a', 'b', 'c'))

# 方式二
line = "1 %s 2 %d 3 %f"
print(line % ('张三', 18, 66.0))

# 方式三
line = "姓名name 年龄 age".format(name='张无忌', age=28)
print(line)

# 方式四
name = '张无忌'
age = 30
line = f"姓名name 年龄 age"
print(line)

3.12 字符串和二进制转换

如果要在网络上传输，或者保存到磁盘上，就需要把str变为以字节为单位的bytes, 通过encode()方法将字符串转为字节。Python对bytes类型的数据用带b前缀的单引号或双引号表示。x = b"Hello"

反过来，如果我们从网络或磁盘上读取了字节流，那么读到的数据就是bytes。要把bytes变为str，就需要用decode()方法；

在操作字符串时，我们经常遇到str和bytes的互相转换。为了避免乱码问题，应当始终坚持使用UTF-8编码对str和bytes进行转换。

str -> bytes
# b'Hello'
"Hello".encode('utf-8')

bytes -> str
b'Hello'.decode('utf-8')
'Hello'

循环字符串

foorbar = 'foobar'
# f
print(foorbar[0])

# f-o-o-b-a-r-
for i in foorbar:
    print(i, end='-')

三：列表list

字符串转列表list()

['a', 'b', 'c']
print(list('abc'))

正负索引

正索引：从0开始，方向从左到右。
负索引：从-1开始，方向从右到左。

chars = ['a', 'b', 'c', 'd', 'e', 'f']
# nums[0]=a, nums[-1]=f
print(f"chars[0]=nums[0], nums[-1]=chars[-1]")

切片

[开始索引:结束索引:步长]

开始索引: 可以是正索引也可以是负索引，如果省略开始索引默认从头开始。
结束索引：可以是正索引也可以是负索引，如果省略结束索引默认到列表的结尾，不包含结束索引元素。
步长：默认为1，大于1表示：包含第一个元素，然后每隔n -1 获取指定的元素。

切片获取

chars = ['a', 'b', 'c', 'd', 'e', 'f']
# ['c', 'd', 'e', 'f']
print(chars[2:])
# ['a', 'b'] 不包含index=2的元素
print(chars[:2]) 
# ['e', 'f'] 负索引，从右到左
print(chars[-2:])


# ['b', 'c']
print(chars[1:3])
# ['d', 'e']
print(chars[-3:-1])
# ['b', 'c']
print(chars[1:-3])


# ['b', 'c', 'd', 'e']
print(chars[1:5:1])
# ['b', 'd'] 从[1:5]获取的结果开始，每隔步长-1个获取一个元素
print(chars[1:5:2])
# ['a', 'c', 'e']
print(chars[::2])
# ['f', 'd']
print(chars[-1:-5:-2])

切片修改元素

nums = [1, 2, 3, 4, 5]
nums[1:4] = [22, 33, 44]
# [1, 22, 33, 44, 5]
print(nums)

切片添加元素

切片赋值时当开始索引等于结束索引时，此时是在该索引位置添加多个索引元素

nums = [1, 5]
# 5
print(nums[1])
# []：开始包含index=1，结束又不包含index=1，所以为[]
print(nums[1:1])
nums[1:1] = [2, 3, 4]
# [1, 2, 3, 4, 5]
print(nums)

切片删除元素

nums = [1, 2, 3, 4, 5]
nums[1:4] = []
# [1, 5]
print(nums)

# 相当于del[1:4]
del nums[1:4]


nums = [1, 2, 3, 4, 5]
nums[1:4] = ['b', 'c']
# [1, 'b', 'c', 5]
print(nums)

序列相加 +

# [1, 2, 3, 'a', 'b', 'c']
print([1, 2, 3] + ['a', 'b', 'c'])

序列相乘 *

序列相乘表示将列表中的所有元素追加到列表中几次。

# [1, 2, 3, 1, 2, 3, 1, 2, 3]
print([1, 2, 3] * 3)

# 初始化3个空元素
# [None, None, None]
print([None] * 3)

成员资格 in

print(1 in [1, 'hello', 2.0, True])

长度len、最大值max、最小值min

nums = [0, 1, 2, 3, 4, 5]
# 6, 5, 0
print(f'len(nums), max(nums), min(nums)')

删除元素 remove()

删除指定元素的第一个，如果不存在则抛出异常ValueError。

chars = ['a', 'b', 'a', 'd', 'e']
chars.remove('a')
# ['b', 'a', 'd', 'e']
print(chars)

# ValueError: list.remove(x): x not in list
chars.remove('f')

# 根据索引删除
del chars[0]

删除元素 del()

lst = [1, 2, 3]
del(lst[0])
# [2, 3]
print(lst)


d = 'k1': 'v1', 'k2': 'v2'
del(d['k1'])
# 'k2': 'v2'
print(d)

弹出元素 pop()

删除列表中指定索引的元素（如果没有指定则删除最后一个元素），并返回删除的元素。

chars = ['a', 'b', 'c', 'd', 'e']
# 根据索引删除，并返回删除的元素，pop=a
pop = chars.pop(0)
# pop=e
pop = chars.pop()

元素拼接成字符串

print(",".join(['a', 'b', 'c']))

追加 append()

将单个元素添加到列表的末尾。

nums = [1, 2]
nums.append(3)
print(nums)

拓展 extend()

将一个列表中的所有元素添加到另一个列表的末尾，和+的作用一样。

# 方式一：使用extend()函数
nums = [1, 2, 3]
nums.extend([4, 5, 6])
# [1, 2, 3, 4, 5, 6]
print(nums)

# 方式二：使用➕运算符
# [1, 2, 3, 4, 5, 6]
print([1, 2, 3] + [4, 5, 6])

# 方式三：切片
nums = [1, 2, 3]
nums[3:3] = [4, 5, 6]
# [1, 2, 3, 4, 5, 6]
print(nums)

插入 insert()

items = ['a', 'c', 'd']
items.insert(1, 'b')
# ['a', 'b', 'c', 'd']
print(items)

清空 clear()

清空列表中的所有元素。

chars = ['a', 'b', 'c', 'd', 'e']

# 清空列表 chars=[]
chars.clear()

数量 count()

获取元素在列表中出现的次数。

nums = ['a', 'a', 'a', 'b']
# 3
count = nums.count('a')
print(count)

索引 index()

获取元素中第一次出现的索引位置，如果找不到会抛出异常 ValueError。

items = ['a', 'b', 'c', 'a']
index = items.index('a')
# 0
print(index)


items = ['a', 'b', 'c', 'a']
# ValueError: 'd' is not in list
index = items.index('d')

反转 reverse()

chars = ['a', 'b', 'c', 'd', 'e']
chars.reverse()
# ['e', 'd', 'c', 'b', 'a']
print(chars)

赋值 copy()

copy()方法是深拷贝。

chars = ['a', 'b', 'c']
chars_copy = chars.copy()

chars.append('d')
chars_copy.append('e')
# ['a', 'b', 'c', 'd'] ['a', 'b', 'c', 'e']
print(chars, chars_copy)

排序 sort()

排序会修改原来的值，sort()方法没有返回值。

nums = [5, 1, 8, 6, 9]
nums.sort()
print(nums)

排序 sorted()

sorted()函数有返回值，返回排序后的值。

nums = [5, 1, 8, 6, 9]
nums_sort = sorted(nums)
# [5, 1, 8, 6, 9] [1, 5, 6, 8, 9]
print(nums, nums_sort)

map()

lst = [1, 2, 3, 4, 5]

# [2, 4, 6, 8, 10]
print(list(map(lambda x: x * 2, lst)))

filter()

lst = [1, 2, 3, 4, 5]
# [2, 4]
print(list(filter(lambda x: x % 2 == 0, lst)))

reduce()

from functools import reduce

lst = [1, 2, 3, 4, 5]

# 累加示例：15
print(reduce(lambda x, y: x + y, lst, 0))

循环

nums = [5, 1, 8, 6, 9]
for i in nums:
    print(i)

四：元组tuple

元组tuple和列表list最大的不同是元组不可以修改，其它的和列表没什么区别。

# 当元组中只有一个元素，元素后面的逗号是不能省略的，因为如果没有逗号那么(1)和1是完全一样，此时()只是用来表示优先级。
t = (1,)

# 可以使用函数tuple()将list转为tuple
nums = tuple([1, 2, 3])
# 注意：5下标虽然超出实际值，但不报错
print(nums[1:5])

t = (1, 2, 3, 4)

# 拆包的变量个数必须和元组的长度相同
a, b, c, d = t
# _表示占位符
a, b, _, d = t

# 可以将最后的多个元素拆成一个列表，使用*来表示列表
# 2 [2, 3, 4]
a, *lst = t

压缩zip（难女搭配，干活不累）

zip可以将两个列表将相同的位置的元素组合成一个整体。

boys = ['杨过', '虚竹', '张无忌']
girls = ['小龙女', '梦姑', '小昭']

# [('杨过', '小龙女'), ('虚竹', '梦姑'), ('张无忌', '小昭')]
bg = list(zip(boys, girls))

# (('杨过', '小龙女'), ('虚竹', '梦姑'), ('张无忌', '小昭'))
bg = tuple(zip(boys, girls))

# '杨过': '小龙女', '虚竹': '梦姑', '张无忌': '小昭'
bg = dict(zip(boys, girls))

('虚竹', '梦姑')<以上是关于Python数据类型：序列(字符串str列表list元组tuple字典dict范围range) 和集合set的主要内容，如果未能解决你的问题，请参考以下文章