Pythonic找到重叠元素的方法

Question

我有2个正则表达式，它们提取数据并将它们分类为feature-1和feature-2，但在少数情况下，数据被分类为feature-1（f1）和feature-2（f2）。例如：亚马逊已有100年历史。

100岁被归类为f1,100岁被归类为f2。当它显示在UI上时。它显示为100岁100年是错误的，因为只需要显示100岁。为了处理超集和子集。我试图遍历每个功能并比较它们是否是重叠特征f2。此代码正在运行，但如果要处理25000个功能，则会产生巨大的性能影响。

`if df.iloc[i]['section_id'] == section_id and st_pos != -1:
    start_pos.append(st_pos)
    end_pos.append(en_pos)
    list.append(df.iloc[i]['id'])
'''
In the below loop, the positions are stored in  set form, If any subset found, then store it to list_subset
'''
        for i in range(0,len(start_pos)-1):
            for j in range(i+1,len(start_pos)):
                if 
    set(range(start_pos[i],end_pos[i])).issubset(range(start_pos[j],end_pos[j])):
    list_subset.append(list[i])
                elif 
    set(range(start_pos[j],end_pos[j])).issubset(range(start_pos[i],end_pos[i])):
    list_subset.append(list[j])`

他们是一个更好的方式来pythonically做这个代码？