VBScript通过重新编号从字幕文件中删除重复的数字

Posted

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了VBScript通过重新编号从字幕文件中删除重复的数字相关的知识,希望对你有一定的参考价值。

我有一个字幕(.srt)文件如下所示:

2
00:04:22,504 --> 00:04:23,520
Hello?

3
00:04:27,860 --> 00:04:29,112
Hey wait!
Hello!

3
00:06:18,860 --> 00:06:21,112
Uhh!

3
00:06:29,860 --> 00:06:32,112
Ah!

4
00:07:19,232 --> 00:07:21,284
What are you doing here?

5
00:07:21,608 --> 00:07:22,708
Tell me!

...

正如你所看到的那样,3在该文件中重复了三次,我想通过重新编号整个字幕文件来替换它(因为我猜这是唯一的选择,因为这个复制在这个文件的多个位置)。

我创建了以下脚本来选择该文件,并尝试用新生成的新数字(迭代次数)替换重复的数字,但它不起作用。

Dim strFile, objFS

strFile = SelectFile( )
If strFile = "" Then
    WScript.Echo "No file selected."
End If


Function SelectFile( )
    Dim objExec, strMSHTA, wshShell

    SelectFile = ""

    strMSHTA = "mshta.exe ""about:" & "<" & "input type=file id=FILE>" _
             & "<" & "script>FILE.click();new ActiveXObject('Scripting.FileSystemObject')" _
             & ".GetStandardStream(1).WriteLine(FILE.value);close();resizeTo(0,0);" & "<" & "/script>"""

    Set wshShell = CreateObject( "WScript.Shell" )
    Set objExec = wshShell.Exec( strMSHTA )

    SelectFile = objExec.StdOut.ReadLine( )

    Set objExec = Nothing
    Set wshShell = Nothing
End Function

Set objFS = CreateObject("Scripting.FileSystemObject")
Set objFile = objFS.OpenTextFile(strFile)
Set objFile2 = objFS.OpenTextFile(strFile, 8, True)
x = 0
Do Until objFile.AtEndOfStream
    strLine = objFile.ReadLine
    Set objRegEx = CreateObject("VBScript.RegExp")
    objRegEx.Global = True
    objRegEx.Pattern = "^d+$"
    Set colMatches = objRegEx.Execute(strLine)
    If colMatches.Count > 0 Then
        x = x + 1
        strLine = x
        strNewLine = Replace(strLine,strLine,x)
        objFile2.WriteLine strLine
    End If
Loop

任何人都可以帮助,搞清楚,如何使这项工作?

答案

在VBScript中使用带有regular expression和全局计数器的replacement function

f = "C:path	oyour.srt"
n = 1  'global counter

Function Renumber(m, g1, g2, pos, src)
  Renumber = g1 & n & g2
  n = n + 1  'increment global counter after current value was used
End Function

Set re = New RegExp
re.Pattern = "(^|

)d+(
)"
re.Global = True

Set fso = CreateObject("Scripting.FileSystemObject")
txt = fso.OpenTextFile(f).ReadAll
txt = re.Replace(txt, GetRef("Renumber"))
fso.OpenTextFile(f, 2).Write txt
另一答案

如果您有Unix盒或Unix VM,或者如果您可以使用awk模拟Unix环境,则可以在一行中完成:

命令:

awk 'BEGIN{c=1} $0~/^[0-9]+$/ {print c++} $0~/[a-zA-Z,:-!?]|^$/{print}' input_sub.txt > output_sub.txt

测试:

2
00:04:22,504 --> 00:04:23,520
Hello?

3
00:04:27,860 --> 00:04:29,112
Hey wait!
Hello!

3
00:06:18,860 --> 00:06:21,112
Uhh!

3
00:06:29,860 --> 00:06:32,112
Ah!

4
00:07:19,232 --> 00:07:21,284
What are you doing here?

5
00:07:21,608 --> 00:07:22,708
Tell me!

输出:

1
00:04:22,504 --> 00:04:23,520
Hello?

2
00:04:27,860 --> 00:04:29,112
Hey wait!
Hello!

3
00:06:18,860 --> 00:06:21,112
Uhh!

4
00:06:29,860 --> 00:06:32,112
Ah!

5
00:07:19,232 --> 00:07:21,284
What are you doing here?

6
00:07:21,608 --> 00:07:22,708
Tell me!

以上是关于VBScript通过重新编号从字幕文件中删除重复的数字的主要内容,如果未能解决你的问题,请参考以下文章

vbscript 8.检测方程是否按顺序编号,是否重复编号

vbscript 6.检测图/表/方案是否按顺序编号,是否有重复编号

FFmpeg 从文件“如果存在”中删除字幕

PostgreSQL,删除具有重新编号列值的行

如何使用 ShellExecute 从 VBScript 将参数传递给批处理文件 [重复]

在asp.net mvc中删除数据成功后重新排序自动编号列JQuery Datatable