在utf8中迭代两个数据帧的列和str.encode

Question

我目前正在运行Python 2.7并且有两个数据帧x和y。我想使用某种列表推导来迭代两列，并在每列上使用str.encode（'UTF8）来摆脱unicode。

这非常好，并且易于阅读，但希望尝试更快，更高效地使用。

for col in y:
  if y[col].dtype=='O':
    y[col] = y[col].str.encode("utf-8")

for col in x:
  if x[col].dtype=='O':
    x[col] = x[col].str.encode("utf-8")

我试过的其他方法：

1.)[y[col].str.encode("utf-8") for col in y if y[col].dtype=='O' ]

2.)y.columns= [( y[col].str.encode("utf-8") if y[col].dtype=='O' else y[col]) for col in y ]

3.)y.apply(lambda x : (y[col].str.encode("utf-8") for col in y if y[col].dtype=='O'))

我为2.）和3.）获得了价值误差和长度不匹配错误

Answer 1

另一答案