批处理脚本：在最后/第一次出现字符之后/之前提取文本并将其存储在数组中

Posted 2023-02-16

技术标签:

【中文标题】批处理脚本：在最后/第一次出现字符之后/之前提取文本并将其存储在数组中【英文标题】：Batch script: extract text after/before last/first occurrence of characters and store it in array 【发布时间】：2021-11-07 01:45:57 【问题描述】：

假设我在“%userprofile%\~.txt”下的文件中有以下内容：

Monitor: Generic PnP Monitor
Device: \\.\DISPLAY1
Adapter: Intel(R) UHD Graphics 630
(1920 x 1080 x 32 bpp) 60Hz default up, attached (-1920,0)

Monitor: Generic PnP Monitor
Device: \\.\DISPLAY4
Adapter: NVIDIA Quadro P2000
(1280 x 1024 x 32 bpp) 60Hz default up, attached (1920,0)

Monitor: Generic PnP Monitor
Device: \\.\DISPLAY8
Adapter: DisplayLink USB Device
(1920 x 1080 x 32 bpp) 60Hz default up, attached, primary (0,0)

文本块的数量可以变化。

我想得到的是每个块最后一行出现的两个坐标中的第一个，所以根据例子，结果应该是：

-1920
1920
0

要在批处理脚本中执行此操作，我首先通过初始 for 循环分析文件，该循环检索包含字符串“default up, attach”的行。

然后对于每个检索到的字符串，我在最后一次出现 ( 之后搜索文本。

从前面的结果中，我搜索,第一次出现之前的文本。

我找到了一个在 for 循环之外工作的解决方案（请参阅下面标有 ** 的行），但我希望这些行位于循环内。我尝试了下面标有* 的行，但脚本退出了，我不知道错误是什么。希望是一个微不足道的缺失部分。请多多包涵我的批处理脚本知识。

我的脚本：

@echo off
setlocal EnableDelayedExpansion
set Cnt=0
FOR /F "tokens=*" %%a IN ('findstr "default up, attached" "%userprofile%\~.txt"') DO (
  set /a Cnt+=1
  set result=%%a
  for %%b in ("%result:(=" "%") do set "result=%%~b"                          <= * THESE LINES DON'T WORK!!!
  for /f "tokens=1 delims=," %%c in ("%result%") do set "result=%%~c"         <= * THESE LINES DON'T WORK!!!
  call Set Monitors[%%Cnt%%]=!result!
)
for %%b in ("%Monitors[1]:(=" "%") do set "Monitors[1]=%%~b"                  <= ** THESE LINES WORK
for %%b in ("%Monitors[2]:(=" "%") do set "Monitors[2]=%%~b"                  <= ** THESE LINES WORK
for %%b in ("%Monitors[3]:(=" "%") do set "Monitors[3]=%%~b"                  <= ** THESE LINES WORK
for /f "tokens=1 delims=," %%c in ("%Monitors[1]%") do set "Monitors[1]=%%~c" <= ** THESE LINES WORK
for /f "tokens=1 delims=," %%c in ("%Monitors[2]%") do set "Monitors[2]=%%~c" <= ** THESE LINES WORK
for /f "tokens=1 delims=," %%c in ("%Monitors[3]%") do set "Monitors[3]=%%~c" <= ** THESE LINES WORK
echo %Monitors[1]%
echo %Monitors[2]%
echo %Monitors[3]%
pause

【问题讨论】：

您标记的前两行存在变量扩展问题。在带括号的代码块内创建或更改的任何变量都必须使用延迟扩展。这意味着您在变量中使用感叹号而不是百分号。 @Squashman 你能帮我看看这些线条是什么样子的吗？ 【参考方案1】：

另一种方法是使用 powershell 正则表达式来做到这一点：

@Echo off

:# Replace with actual filename
 Set "File=%~dp0in.txt"
 Set "Content=\(-?\d+,\d+\)$"
 Set "Prefix=^.*?,+ \w+ \("
 Set "Suffix=,\d*\)"

:# https://docs.microsoft.com/en-us/powershell/module/microsoft.powershell.core/about/about_regular_expressions?view=powershell-7.1

Set "[i]=1"
For /f "Delims=" %%G in ('
 powershell -nologo -noprofile -c ^
 "(GC "%File%") -match '%Content%' | ForEach "$_" -replace '%Prefix%' -replace '%Suffix%'
')Do Call Set /A "[i]+=1","Monitor[%%[i]%%]=%%G"

 Set Monitor[

Goto :Eof

【讨论】：

【参考方案2】：

您可以使用括号和逗号作为分隔符来获取所需的值。因为您的输入行中有额外的逗号，这将需要两个嵌套的FOR 命令。第一个FOR 将用括号将其拆分，并为您提供第二组括号之间的所有内容。然后将该输出放入另一个FOR 命令中，以逗号分隔值。

@echo off
REM use the parentheses as a delimiter
FOR /F "tokens=3 delims=()" %%G IN ('findstr "default up, attached" "file.txt"') DO (
    REM USE the comma as a delimiter for the final split up of the variable.
    FOR /F "TOKENS=1 delims=," %%H IN ("%%~G") DO ECHO %%~H
)

【讨论】：

谢谢，非常好。我更喜欢@T3RR0R 的正则表达式解决方案，因为我认为这些行的内容将来可能会发生变化。

以上是关于批处理脚本：在最后/第一次出现字符之后/之前提取文本并将其存储在数组中的主要内容，如果未能解决你的问题，请参考以下文章