将数据帧的每一行转换为字符串

Question

我正在尝试使用pyspark中的hashlib.md5为数据帧生成哈希码。它只接受一个字符串来生成哈希码。

我需要将数据帧的每一行转换为字符串。

我尝试使用concat_ws函数连接所有列并使其成为字符串但没有结果。

我的数据框有Id, name, marks列

我试过了：

str=df.select(concat_ws("id","name","marks"))

print(hashlib.md5(str.encode(encoding='utf_8', errors='strict')).hexdigest())

我收到了这个错误：

AttributeError: 'DataFrame' object has no attribute 'encode'