在 Service Fabric 中运行 Azure DevOps 自托管生成代理时出现“无法解析远程名称”

Posted

技术标签:

【中文标题】在 Service Fabric 中运行 Azure DevOps 自托管生成代理时出现“无法解析远程名称”【英文标题】:"The remote name could not be resolved" when running an Azure DevOps self-hosted build agent in Service Fabric 【发布时间】:2021-04-06 07:45:35 【问题描述】:

按照这篇文章,我可以在我的 Windows 10 机器上成功启动自托管代理:

https://docs.microsoft.com/en-us/azure/devops/pipelines/agents/docker?view=azure-devops#windows

但是,当我尝试使用 Windows Server 20H2 节点在 Service Fabric 中启动相同的 Docker 映像时,我得到了以下容器日志条目:

1. Determining matching Azure Pipelines agent...
Invoke-RestMethod : The remote name could not be resolved: 'dev.azure.com'
At C:\azp\start.ps1:34 char:12
+ $package = Invoke-RestMethod -Headers @Authorization=("Basic $base64 ...
+            ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : InvalidOperation: (System.Net.HttpWebRequest:Htt
   pWebRequest) [Invoke-RestMethod], WebException
    + FullyQualifiedErrorId : WebCmdletWebResponseException,Microsoft.PowerShe
   ll.Commands.InvokeRestMethodCommand

Dockerfile

FROM mcr.microsoft.com/windows/servercore:20H2

WORKDIR /azp

COPY start.ps1 .

CMD powershell .\start.ps1

ApplicationManifest.xml

<?xml version="1.0" encoding="utf-8"?>
<ApplicationManifest ApplicationTypeName="build-agent-20H2" ApplicationTypeVersion="20201228.5"
  xmlns="http://schemas.microsoft.com/2011/01/fabric"
  xmlns:xsd="http://www.w3.org/2001/XMLSchema"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">

  <ServiceManifestImport>
    <ServiceManifestRef ServiceManifestName="Package" ServiceManifestVersion="20201228.5" />
    <EnvironmentOverrides CodePackageRef="buildagent.Code">
      <EnvironmentVariable Name="AZP_URL" Value="https://dev.azure.com/myaccount" />
      <EnvironmentVariable Name="AZP_TOKEN" Value="mypat" />
      <EnvironmentVariable Name="AZP_AGENT_NAME" Value="myagentname" />
      <EnvironmentVariable Name="AZP_POOL" Value="mypoolname" />
    </EnvironmentOverrides>
    <Policies>
      <ServicePackageResourceGovernancePolicy CpuCores="1"/>
      <ResourceGovernancePolicy CodePackageRef="buildagent.Code" MemoryInMB="1024" />
      <ContainerHostPolicies CodePackageRef="buildagent.Code" UseTokenAuthenticationCredentials="true">
        <HealthConfig IncludeDockerHealthStatusInSystemHealthReport="true" RestartContainerOnUnhealthyDockerHealthStatus="false" TreatContainerUnhealthyStatusAsError="false" />
      </ContainerHostPolicies>
    </Policies>
  </ServiceManifestImport>
  <DefaultServices>
    <Service Name="buildagent">
      <StatelessService ServiceTypeName="buildagentType" InstanceCount="1">
        <SingletonPartition />
      </StatelessService>
    </Service>
  </DefaultServices>
</ApplicationManifest>

ServiceManifest.xml

<?xml version="1.0" encoding="utf-8"?>
<ServiceManifest xmlns:xsd="http://www.w3.org/2001/XMLSchema"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" Name="Package" Version="20201228.5"
  xmlns="http://schemas.microsoft.com/2011/01/fabric">
  <ServiceTypes>
    <StatelessServiceType ServiceTypeName="buildagentType" UseImplicitHost="true">
    </StatelessServiceType>
  </ServiceTypes>
  <CodePackage Name="buildagent.Code" Version="20201228.5">
    <EntryPoint>
      <ContainerHost>
        <ImageName>myacr.azurecr.io/build-agent-20h2:20201228.5</ImageName>
        <Commands>
        </Commands>
      </ContainerHost>
    </EntryPoint>
    <EnvironmentVariables>
      <EnvironmentVariable Name="AZP_URL" Value="" />
      <EnvironmentVariable Name="AZP_TOKEN" Value="" />
      <EnvironmentVariable Name="AZP_AGENT_NAME" Value="" />
      <EnvironmentVariable Name="AZP_POOL" Value="" />
    </EnvironmentVariables>
  </CodePackage>
</ServiceManifest>

我也试过没有这些规格:

ServicePackageResourceGovernancePolicy 资源治理政策 健康配置

我想知道是否可以在没有 HTTP 或 TCP 端点的情况下在 Service Fabric 中启动一个容器。

编辑:

另一个观察结果:当我通过 SSH/远程连接到我将映像部署到的节点之一和 docker run -it 直接映像时,可以解析远程名称并且安装过程有效(?)。所以它接缝 SF 以不同的方式启动容器。

编辑 2:

尝试使用简化的start.ps1

Invoke-WebRequest -OutFile dotnet.zip https://dotnetcli.blob.core.windows.net/dotnet/Sdk/2.2.402/dotnet-sdk-2.2.402-win-x64.zip
Read-Host

带来同样的错误。比较 SF 控制容器和直接启动容器上的 docker inspect,我发现可能相关的偏差:

SF(不工作)

        "Dns": [],
        "DnsOptions": [],
        "DnsSearch": [],
...
        "Networks": 
            "nat": 
                "IPAMConfig": null,
                "Links": null,
                "Aliases": null,
                "NetworkID": "b52a59a54d97f56f8fc866c0f742588023333ff1cb10b2c536be21d6745d609d",
                "EndpointID": "bc20ebb2124467f5135aefeefd7e3d38c14338479f7a5dbd04fd274dd0fa3ddf",
                "Gateway": "192.168.80.1",
                "IPAddress": "192.168.81.196",
                "IPPrefixLen": 24,

直接(工作)

        "Dns": [
            "10.1.0.211"
        ],
        "DnsOptions": [
            "ndots:1"
        ],
        "DnsSearch": [
            "build-agent-20H2"
        ],
...
        "Networks": 
            "nat": 
                "IPAMConfig": null,
                "Links": null,
                "Aliases": null,
                "NetworkID": "b52a59a54d97f56f8fc866c0f742588023333ff1cb10b2c536be21d6745d609d",
                "EndpointID": "",
                "Gateway": "",
                "IPAddress": "",
                "IPPrefixLen": 0,

【问题讨论】:

根据这个文档docs.microsoft.com/en-us/azure/service-fabric/…,本地单节点Service Fabric集群不支持容器,你有这种情况吗? 谢谢——不,我部署到常规的 3 节点 TEST 集群 如果你创建一个简单的 docker 容器,你能重现这个问题吗? @CeceDong-MSFT 我添加了一个具有相同结果的简单测试 - 再次在同一节点上本地运行 docker run 成功执行脚本 使用 open 或 nat 有区别吗? docs.microsoft.com/en-us/azure/service-fabric/… nslookup 也有不同的结果吗? 【参考方案1】:

当使用 compose 部署 部署相同的映像时,网络分辨率工作正常,并且构建代理已安装并成功启动。

比较清单我添加了端点并且它正在工作:

ApplicationManifest.xml

<?xml version="1.0" encoding="utf-8"?>
<ApplicationManifest ApplicationTypeName="build-agent-20H2" ApplicationTypeVersion="20201228.5"
  xmlns="http://schemas.microsoft.com/2011/01/fabric"
  xmlns:xsd="http://www.w3.org/2001/XMLSchema"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">

  <ServiceManifestImport>
    <ServiceManifestRef ServiceManifestName="Package" ServiceManifestVersion="20201228.5" />
    <EnvironmentOverrides CodePackageRef="buildagent.Code">
      <EnvironmentVariable Name="AZP_URL" Value="https://dev.azure.com/myaccount" />
      <EnvironmentVariable Name="AZP_TOKEN" Value="mypat" />
      <EnvironmentVariable Name="AZP_AGENT_NAME" Value="myagentname" />
      <EnvironmentVariable Name="AZP_POOL" Value="mypoolname" />
    </EnvironmentOverrides>
    <Policies>
      <ServicePackageResourceGovernancePolicy CpuCores="1"/>
      <ResourceGovernancePolicy CodePackageRef="buildagent.Code" MemoryInMB="1024" />
      <ContainerHostPolicies CodePackageRef="buildagent.Code" UseTokenAuthenticationCredentials="true">
        <PortBinding ContainerPort="80" EndpointRef="agentEndpoint" />
        <PortBinding ContainerPort="443" EndpointRef="agentEndpoint1" />
      </ContainerHostPolicies>
    </Policies>
  </ServiceManifestImport>
  <DefaultServices>
    <Service Name="buildagent">
      <StatelessService ServiceTypeName="buildagentType" InstanceCount="1">
        <SingletonPartition />
      </StatelessService>
    </Service>
  </DefaultServices>
</ApplicationManifest>

ServiceManifest.xml

<?xml version="1.0" encoding="utf-8"?>
<ServiceManifest xmlns:xsd="http://www.w3.org/2001/XMLSchema"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" Name="Package" Version="20201228.5"
  xmlns="http://schemas.microsoft.com/2011/01/fabric">
  <ServiceTypes>
    <StatelessServiceType ServiceTypeName="buildagentType" UseImplicitHost="true">
    </StatelessServiceType>
  </ServiceTypes>
  <CodePackage Name="buildagent.Code" Version="20201228.5">
    <EntryPoint>
      <ContainerHost>
        <ImageName>myacr.azurecr.io/build-agent-20h2:20201228.5</ImageName>
        <Commands>
        </Commands>
      </ContainerHost>
    </EntryPoint>
    <EnvironmentVariables>
      <EnvironmentVariable Name="AZP_URL" Value="" />
      <EnvironmentVariable Name="AZP_TOKEN" Value="" />
      <EnvironmentVariable Name="AZP_AGENT_NAME" Value="" />
      <EnvironmentVariable Name="AZP_POOL" Value="" />
    </EnvironmentVariables>
  </CodePackage>
  <Resources>
    <Endpoints>
      <Endpoint Name="agentEndpoint" UriScheme="http" />
      <Endpoint Name="agentEndpoint1" UriScheme="https" />
    </Endpoints>
  </Resources>
</ServiceManifest>

【讨论】:

以上是关于在 Service Fabric 中运行 Azure DevOps 自托管生成代理时出现“无法解析远程名称”的主要内容,如果未能解决你的问题,请参考以下文章

在 Service Fabric 中运行 Azure DevOps 自托管生成代理时出现“无法解析远程名称”

MSB3021 在 Visual Studio 中运行 Service Fabric 应用时无法复制文件找不到部分路径错误

Service Fabric 运行时未从 Actor 服务实例回收未使用的内存

Azure Service Fabric 集群在 Windows 7 单机上运行 Windows 容器

如何确定运行 Service Fabric 集群的操作系统类型

如何测试安装了哪个版本的 Service Fabric 运行时