Pandas.get_dummies返回两列（_Y和_N）而不是一列

Question

我正在尝试使用sklearn根据我的数据集训练决策树。

当我试图将数据切分为（结果：Y，并预测变量：X）时，结果（我的标签）在True / False中：

#data slicing 
X = df.values[:,3:27] #X are the sets of predicting variable, dropping unique_id and student name here
Y = df.values[:,'OffTask'] #Y is our predicted value (outcome), it is in the 3rd column

这是我的方式，但我不知道这是否是正确的方法：

#convert the label "OffTask" to dummy 

df1 = pd.get_dummies(df,columns=["OffTask"])
df1

我的麻烦是数据集df1将我的标签Offtask返回到OffTask_N和OffTask_Y

有人知道如何解决它吗？

Answer 1

另一答案