查找字符串中最重复（不是最常见）序列的算法（也称为串联重复）

Question

我正在寻找能够在字符串中找到最重复序列的算法（可能在Python中实现）。对于REPETITIVE，我指的是在不中断的情况下反复重复的任何字符组合（串联重复）。

我正在寻找的算法与“找到最常见的单词”不同。实际上，重复块不需要是字符串中最常见的单词（substring）。

例如：

s = 'asdfewfUBAUBAUBAUBAUBAasdkBAjnfBAenBAcs'
> f(s)
'UBAUBAUBAUBAUBA' #the "most common word" algo would return 'BA'

不幸的是，我不知道如何解决这个问题。非常欢迎任何帮助。

UPDATE

一个额外的例子来澄清我希望返回具有最多重复次数的序列，无论其基本构建块是什么。

g = 'some noisy spacer'
s = g + 'AB'*5 + g + '_ABCDEF'*2 + g + 'AB'*3
> f(s)
'ABABABABAB' #the one with the most repetitions, not the max len

@rici的例子：

s = 'aaabcabc'
> f(s)
'abcabc'

s = 'ababcababc'
> f(s)
'ababcababc' #'abab' would also be a solution here
             # since it is repeated 2 times in a row as 'ababcababc'.
             # The proper algorithm would return both solutions.

Answer 1

另一答案

Answer 2

另一答案

Answer 3

另一答案