Istio Gateway 流量控制策略实战：灰度发布与泳道隔离

Istio 实战 - 这篇文章属于一个选集。

§ 2: 本文

§ 16: Istio 超时与重试机制详解

§ 17: Istio Wasm 插件开发指南

概述
#

在生产环境中，服务发布不是简单的"一键部署"，而是需要精细化的流量控制来保障稳定性。本文探讨如何利用 Istio Ingress Gateway 实现各种流量控制策略。

核心场景：

灰度发布流程：内灰 → 外灰 → 切流 → 观察 → 下线
用户会话粘性：一旦进入 v2，后续流量都走 v2
按调用方路由：特定服务调用走特定版本
生产泳道：流量隔离与染色

基础配置
#

假设我们有一个服务 myapp，需要从 v1 发布到 v2：

# DestinationRule：定义服务版本（subset）
apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: myapp
  namespace: production
spec:
  host: myapp.production.svc.cluster.local
  trafficPolicy:
    connectionPool:
      tcp:
        maxConnections: 100
      http:
        h2UpgradePolicy: UPGRADE
  subsets:
  - name: v1
    labels:
      version: v1
  - name: v2
    labels:
      version: v2

# Deployment v1
apiVersion: apps/v1
kind: Deployment
metadata:
  name: myapp-v1
  namespace: production
spec:
  replicas: 3
  selector:
    matchLabels:
      app: myapp
      version: v1
  template:
    metadata:
      labels:
        app: myapp
        version: v1
    spec:
      containers:
      - name: myapp
        image: myapp:1.0.0
---
# Deployment v2
apiVersion: apps/v1
kind: Deployment
metadata:
  name: myapp-v2
  namespace: production
spec:
  replicas: 3
  selector:
    matchLabels:
      app: myapp
      version: v2
  template:
    metadata:
      labels:
        app: myapp
        version: v2
    spec:
      containers:
      - name: myapp
        image: myapp:2.0.0

场景一：完整灰度发布流程
#

发布阶段流程
#

flowchart LR
    subgraph 阶段1["阶段1: 内灰"]
        A1[v1: 100%] --> A2[v2: 0%
仅内部测试]
    end
    
    subgraph 阶段2["阶段2: 外灰"]
        B1[v1: 100%] --> B2[v2: 特定用户]
    end
    
    subgraph 阶段3["阶段3: 切流"]
        C1[v1: 90%] --> C2[v2: 10%]
    end
    
    subgraph 阶段4["阶段4: 全量"]
        D1[v1: 0%] --> D2[v2: 100%]
    end
    
    阶段1 --> 阶段2 --> 阶段3 --> 阶段4

阶段 1：内灰（0 线上流量）
#

内灰阶段，线上流量全部走 v1，只有携带特定 Header 的请求才能进入 v2。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 规则1：内灰流量 - 携带特定 Header 走 v2
  - match:
    - headers:
        x-gray-tag:
          exact: "internal-test"
        x-gray-token:
          exact: "secret-token-12345"  # 防止外部伪造
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2
    
  # 规则2：默认流量 - 全部走 v1
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

测试方式：

# 正常请求 - 走 v1
curl https://myapp.example.com/api/version
# 返回: v1

# 内灰请求 - 走 v2
curl -H "x-gray-tag: internal-test" \
     -H "x-gray-token: secret-token-12345" \
     https://myapp.example.com/api/version
# 返回: v2

阶段 2：外灰（特定用户）
#

验证 v2 基本功能后，开始引入特定特征的真实用户流量。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 规则1：内灰流量（保留）
  - match:
    - headers:
        x-gray-tag:
          exact: "internal-test"
        x-gray-token:
          exact: "secret-token-12345"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 规则2：外灰 - 特定用户 ID（尾号为 0 的用户）
  - match:
    - headers:
        x-user-id:
          regex: ".*0$"  # 用户 ID 尾号为 0
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 规则3：外灰 - 特定地区用户
  - match:
    - headers:
        x-user-region:
          exact: "beijing"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 规则4：外灰 - 特定渠道（如 iOS 灰度）
  - match:
    - headers:
        x-app-platform:
          exact: "ios"
        x-app-version:
          regex: "^3\\..*"  # iOS 3.x 版本
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 默认：走 v1
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

阶段 3：按权重切流
#

外灰验证通过后，逐步按权重切换流量。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 保留内灰入口
  - match:
    - headers:
        x-gray-tag:
          exact: "internal-test"
        x-gray-token:
          exact: "secret-token-12345"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 按权重分流
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1
      weight: 90
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2
      weight: 10

逐步调整权重：

# 阶段 3.1: v1=90%, v2=10%
# 阶段 3.2: v1=70%, v2=30%
# 阶段 3.3: v1=50%, v2=50%
# 阶段 3.4: v1=20%, v2=80%
# 阶段 3.5: v1=0%,  v2=100%

阶段 4：全量发布 + 观察期
#

全量切到 v2，但保留 v1 Deployment 用于回滚（观察期最长 24h）。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 紧急回滚入口（运维使用）
  - match:
    - headers:
        x-rollback:
          exact: "v1"
        x-ops-token:
          exact: "ops-secret-token"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

  # 全量走 v2
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

紧急回滚
#

如果 v2 出现问题，立即将流量切回 v1：

# 紧急回滚配置
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1
      weight: 100

进阶方案：EnvoyFilter + Lua 实现灵活灰度控制
#

上面的 VirtualService 方案存在问题：每次修改灰度规则都需要更新 YAML，规则多了难以维护。

使用 EnvoyFilter + Lua 可以实现 配置化、动态更新 的灰度控制。

架构设计
#

flowchart TB
    subgraph Gateway["Ingress Gateway"]
        A[请求到达] --> B[EnvoyFilter Lua]
        B --> C{匹配灰度规则}
        C -->|内灰| D[x-target-version: v2]
        C -->|外灰| D
        C -->|默认| E[x-target-version: v1]
        D --> F[VirtualService 路由]
        E --> F
    end
    
    subgraph Config["配置管理"]
        G[ConfigMap] -.->|加载| B
        H[配置中心] -.->|可选| B
    end

灰度配置（ConfigMap）
#

apiVersion: v1
kind: ConfigMap
metadata:
  name: gray-rules
  namespace: istio-system
data:
  rules.json: |
    {
      "myapp": {
        "default_version": "v1",
        "rules": [
          {
            "name": "internal-test",
            "priority": 100,
            "type": "header",
            "conditions": {"x-gray-tag": "internal-test", "x-gray-token": "secret-12345"},
            "version": "v2"
          },
          {
            "name": "user-tail-012",
            "priority": 90,
            "type": "user_id_tail",
            "tails": ["0", "1", "2"],
            "version": "v2"
          },
          {
            "name": "beijing-region",
            "priority": 80,
            "type": "header",
            "conditions": {"x-user-region": "beijing"},
            "version": "v2"
          },
          {
            "name": "ios-platform",
            "priority": 70,
            "type": "header",
            "conditions": {"x-app-platform": "ios"},
            "version": "v2"
          },
          {
            "name": "percentage-30",
            "priority": 10,
            "type": "percentage",
            "percent": 30,
            "version": "v2"
          }
        ]
      }
    }

EnvoyFilter + Lua 实现
#

apiVersion: networking.istio.io/v1alpha3
kind: EnvoyFilter
metadata:
  name: gray-router
  namespace: istio-system
spec:
  workloadSelector:
    labels:
      istio: ingressgateway
  configPatches:
  - applyTo: HTTP_FILTER
    match:
      context: GATEWAY
      listener:
        filterChain:
          filter:
            name: envoy.filters.network.http_connection_manager
            subFilter:
              name: envoy.filters.http.router
    patch:
      operation: INSERT_BEFORE
      value:
        name: envoy.filters.http.lua
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.filters.http.lua.v3.Lua
          inlineCode: |
            -- ============================================
            -- 灰度路由 Lua 脚本
            -- ============================================
            
            -- 灰度规则配置（生产环境可从共享内存加载）
            local gray_rules = {
              myapp = {
                default_version = "v1",
                rules = {
                  -- 规则1：内灰 - Header 精确匹配（优先级最高）
                  {
                    name = "internal-test",
                    priority = 100,
                    type = "header",
                    conditions = {
                      ["x-gray-tag"] = "internal-test",
                      ["x-gray-token"] = "secret-12345"
                    },
                    version = "v2"
                  },
                  -- 规则2：外灰 - 用户 ID 尾号
                  {
                    name = "user-tail-012",
                    priority = 90,
                    type = "user_id_tail",
                    tails = {"0", "1", "2"},  -- 尾号 0,1,2 走 v2（约 30%）
                    version = "v2"
                  },
                  -- 规则3：外灰 - 北京地区
                  {
                    name = "beijing-region",
                    priority = 80,
                    type = "header",
                    conditions = {["x-user-region"] = "beijing"},
                    version = "v2"
                  },
                  -- 规则4：外灰 - iOS 平台
                  {
                    name = "ios-platform",
                    priority = 70,
                    type = "header",
                    conditions = {["x-app-platform"] = "ios"},
                    version = "v2"
                  },
                  -- 规则5：按比例灰度（兜底）
                  {
                    name = "percentage-30",
                    priority = 10,
                    type = "percentage",
                    percent = 30,
                    version = "v2"
                  }
                }
              }
            }
            
            -- 计算字符串哈希
            local function hash_string(str)
              local hash = 0
              for i = 1, #str do
                hash = (hash * 31 + string.byte(str, i)) % 2147483647
              end
              return hash
            end
            
            -- 从 Host 提取服务名
            local function get_service(headers)
              local host = headers:get(":authority") or ""
              return string.match(host, "^([^%.]+)") or "default"
            end
            
            -- 检查规则是否匹配
            local function match_rule(rule, headers)
              local rule_type = rule.type
              
              if rule_type == "header" then
                -- Header 精确匹配
                for key, value in pairs(rule.conditions) do
                  if headers:get(key) ~= value then
                    return false
                  end
                end
                return true
                
              elseif rule_type == "user_id_tail" then
                -- 用户 ID 尾号匹配
                local user_id = headers:get("x-user-id")
                if not user_id then return false end
                local tail = string.sub(user_id, -1)
                for _, t in ipairs(rule.tails) do
                  if tail == t then return true end
                end
                return false
                
              elseif rule_type == "percentage" then
                -- 按比例匹配
                local user_id = headers:get("x-user-id") 
                              or headers:get("x-request-id") 
                              or tostring(os.time())
                local hash = hash_string(user_id)
                local threshold = (rule.percent / 100) * 2147483647
                return hash < threshold
                
              elseif rule_type == "cookie" then
                -- Cookie 匹配
                local cookie = headers:get("cookie") or ""
                for key, value in pairs(rule.conditions) do
                  if not string.find(cookie, key .. "=" .. value) then
                    return false
                  end
                end
                return true
              end
              
              return false
            end
            
            -- 主入口
            function envoy_on_request(handle)
              local headers = handle:headers()
              
              -- 1. 检查粘性 Cookie
              local cookie = headers:get("cookie") or ""
              local sticky = string.match(cookie, "x%-sticky%-version=([^;]+)")
              if sticky then
                headers:add("x-target-version", sticky)
                return
              end
              
              -- 2. 获取服务配置
              local service = get_service(headers)
              local config = gray_rules[service]
              if not config then return end
              
              -- 3. 按优先级匹配规则
              local target = config.default_version
              local matched = nil
              
              for _, rule in ipairs(config.rules) do
                if match_rule(rule, headers) then
                  target = rule.version
                  matched = rule.name
                  break
                end
              end
              
              -- 4. 设置路由 Header
              headers:add("x-target-version", target)
              if matched then
                headers:add("x-gray-rule", matched)
              end
              
              -- 5. 标记设置粘性 Cookie
              handle:streamInfo():dynamicMetadata():set(
                "envoy.filters.http.lua", "sticky_version", target
              )
            end
            
            function envoy_on_response(handle)
              local meta = handle:streamInfo():dynamicMetadata():get("envoy.filters.http.lua")
              if meta and meta["sticky_version"] then
                local cookie = string.format(
                  "x-sticky-version=%s; Path=/; Max-Age=604800; HttpOnly",
                  meta["sticky_version"]
                )
                handle:headers():add("set-cookie", cookie)
              end
            end

简化的 VirtualService
#

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 只需根据 x-target-version 路由，无需复杂 match
  - match:
    - headers:
        x-target-version:
          exact: "v2"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

方案对比
#

特性	VirtualService match	EnvoyFilter + Lua
规则配置	每条规则都要写 YAML	配置化，易维护
动态更新	需要 kubectl apply	更新 ConfigMap 或 Lua 变量
复杂逻辑	受限于 match 语法	任意逻辑（多条件组合等）
可观测性	有限	可添加日志、埋点
用户粘性	需要额外配置	内置 Cookie 粘性
灰度比例	只能用 weight	支持精确哈希控制

运维操作
#

# 调整灰度比例：修改 Lua 配置中的 percent
# 添加新规则：在 rules 数组中添加
# 紧急关闭灰度：设置 default_version = "v1"，清空 rules

# 查看灰度命中情况
kubectl logs -n istio-system -l istio=ingressgateway | grep "x-gray-rule"

# 测试内灰
curl -H "x-gray-tag: internal-test" -H "x-gray-token: secret-12345" https://myapp.example.com

# 测试用户尾号灰度
curl -H "x-user-id: 12340" https://myapp.example.com  # 应该走 v2
curl -H "x-user-id: 12345" https://myapp.example.com  # 应该走 v1

场景二：用户会话粘性
#

需求：一旦用户进入 v2，后续所有请求都必须走 v2，避免因版本切换导致的数据不一致。

方案 1：基于 Cookie 的粘性
#

apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: myapp
  namespace: production
spec:
  host: myapp.production.svc.cluster.local
  trafficPolicy:
    loadBalancer:
      consistentHash:
        httpCookie:
          name: x-app-version
          ttl: 86400s  # Cookie 有效期 24 小时
  subsets:
  - name: v1
    labels:
      version: v1
  - name: v2
    labels:
      version: v2

问题：一致性哈希只能保证同一用户去同一 Pod，但不能保证去同一版本。

方案 2：服务端设置版本 Cookie（推荐）
#

更可靠的方案是让应用在响应中设置版本 Cookie，Istio 根据 Cookie 路由。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 规则1：已经标记为 v2 的用户，继续走 v2
  - match:
    - headers:
        cookie:
          regex: ".*app-version=v2.*"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 规则2：已经标记为 v1 的用户，继续走 v1
  - match:
    - headers:
        cookie:
          regex: ".*app-version=v1.*"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

  # 规则3：新用户，按权重分配，应用需要在响应中设置 Cookie
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1
      weight: 50
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2
      weight: 50

应用端代码（Go 示例）：

func handler(w http.ResponseWriter, r *http.Request) {
    // 检查是否已有版本 Cookie
    cookie, err := r.Cookie("app-version")
    if err != nil {
        // 新用户，设置版本 Cookie
        version := os.Getenv("APP_VERSION") // v1 或 v2
        http.SetCookie(w, &http.Cookie{
            Name:     "app-version",
            Value:    version,
            Path:     "/",
            MaxAge:   86400 * 7, // 7 天
            HttpOnly: true,
        })
    }
    // 业务逻辑...
}

方案 3：基于用户 ID 的一致性哈希
#

如果请求中携带用户 ID，可以用一致性哈希保证同一用户始终去同一 subset。

apiVersion: networking.istio.io/v1beta1
kind: DestinationRule
metadata:
  name: myapp
  namespace: production
spec:
  host: myapp.production.svc.cluster.local
  trafficPolicy:
    loadBalancer:
      consistentHash:
        httpHeaderName: x-user-id
  subsets:
  - name: v1
    labels:
      version: v1
  - name: v2
    labels:
      version: v2

注意：一致性哈希是针对同一 subset 内的 Pod，不能跨 subset 保证粘性。要实现跨版本粘性，需要结合 VirtualService 的 match 规则。

方案 4：EnvoyFilter + Lua 实现版本级一致性哈希（推荐）
#

核心问题：Istio 的 consistentHash 只能在单个 subset 内部做 Pod 级别的哈希，无法跨版本（subset）保证一致性。

解决方案：在 Envoy 层面用 Lua 脚本计算用户 ID 的哈希值，根据哈希结果设置路由 Header，同时设置 Cookie 保证后续请求的粘性。

flowchart TB
    subgraph 首次访问
        A1[请求到达] --> B1{检查 Cookie}
        B1 -->|无 Cookie| C1[计算 user-id 哈希]
        C1 --> D1{哈希值 < 30%?}
        D1 -->|是| E1[设置 x-target-version: v2]
        D1 -->|否| F1[设置 x-target-version: v1]
        E1 --> G1[VirtualService 路由到 v2]
        F1 --> H1[VirtualService 路由到 v1]
        G1 --> I1[响应时设置 Cookie]
        H1 --> I1
    end
    
    subgraph 后续访问
        A2[请求到达] --> B2{检查 Cookie}
        B2 -->|有 Cookie| C2[读取版本]
        C2 --> D2[设置 x-target-version]
        D2 --> E2[路由到对应版本]
    end

EnvoyFilter 配置：

apiVersion: networking.istio.io/v1alpha3
kind: EnvoyFilter
metadata:
  name: version-consistent-hash
  namespace: istio-system
spec:
  workloadSelector:
    labels:
      istio: ingressgateway
  configPatches:
  - applyTo: HTTP_FILTER
    match:
      context: GATEWAY
      listener:
        filterChain:
          filter:
            name: envoy.filters.network.http_connection_manager
            subFilter:
              name: envoy.filters.http.router
    patch:
      operation: INSERT_BEFORE
      value:
        name: envoy.filters.http.lua
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.filters.http.lua.v3.Lua
          inlineCode: |
            -- 版本级一致性哈希 + Cookie 粘性
            function envoy_on_request(handle)
              local headers = handle:headers()
              local cookie = headers:get("cookie") or ""
              
              -- 1. 检查是否已有粘性 Cookie
              local sticky_version = string.match(cookie, "x%-sticky%-version=([^;]+)")
              if sticky_version then
                -- 已有粘性，直接使用
                headers:add("x-target-version", sticky_version)
                return
              end
              
              -- 2. 新用户，获取用户标识
              local user_id = headers:get("x-user-id")
              if not user_id then
                -- 尝试从 session cookie 获取
                user_id = string.match(cookie, "session%-id=([^;]+)")
              end
              if not user_id then
                -- 兜底：使用请求 ID
                user_id = headers:get("x-request-id") or tostring(os.time())
              end
              
              -- 3. 计算哈希值
              local hash = 0
              for i = 1, #user_id do
                hash = (hash * 31 + string.byte(user_id, i)) % 2147483647
              end
              
              -- 4. 根据哈希值决定版本（30% 走 v2）
              local threshold = 0.3 * 2147483647
              local target_version = "v1"
              if hash < threshold then
                target_version = "v2"
              end
              
              -- 5. 设置路由 Header
              headers:add("x-target-version", target_version)
              
              -- 6. 标记需要设置 Cookie（供响应阶段使用）
              handle:streamInfo():dynamicMetadata():set(
                "envoy.filters.http.lua", 
                "set_version_cookie", 
                target_version
              )
            end
            
            function envoy_on_response(handle)
              -- 检查是否需要设置粘性 Cookie
              local metadata = handle:streamInfo():dynamicMetadata():get("envoy.filters.http.lua")
              if metadata and metadata["set_version_cookie"] then
                local version = metadata["set_version_cookie"]
                -- 设置 Cookie，有效期 7 天
                local cookie = string.format(
                  "x-sticky-version=%s; Path=/; Max-Age=604800; HttpOnly; SameSite=Lax",
                  version
                )
                handle:headers():add("set-cookie", cookie)
              end
            end

配套 VirtualService：

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: myapp
  namespace: production
spec:
  hosts:
  - myapp.example.com
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # Lua 脚本设置了 x-target-version: v2
  - match:
    - headers:
        x-target-version:
          exact: "v2"
    route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v2

  # 默认走 v1（兜底）
  - route:
    - destination:
        host: myapp.production.svc.cluster.local
        subset: v1

动态调整灰度比例：

修改 Lua 脚本中的 threshold 值即可调整比例：

-- 10% 走 v2
local threshold = 0.1 * 2147483647

-- 50% 走 v2
local threshold = 0.5 * 2147483647

-- 100% 走 v2（全量）
local threshold = 1.0 * 2147483647

方案对比：

方案	版本级一致性	需要应用改造	配置复杂度	推荐场景
DestinationRule consistentHash	❌ 仅 Pod 级	否	低	同版本内负载均衡
应用设置 Cookie	✅	是	中	应用可改造时
VirtualService regex	✅	否	低	简单场景
EnvoyFilter + Lua	✅	否	高	生产推荐

场景三：按调用方路由
#

需求：服务 A 和 B 都调用服务 C，希望只让 A 调用 C 的 v2 版本。

flowchart LR
    A[Service A] -->|走 v2| C
    B[Service B] -->|走 v1| C
    
    subgraph C["Service C"]
        C1[v1]
        C2[v2]
    end

方案 1：基于 Header 的调用方识别
#

Istio Sidecar 会自动注入 x-forwarded-client-cert 头，但更简单的方式是让调用方携带标识。

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-c
  namespace: production
spec:
  hosts:
  - service-c.production.svc.cluster.local
  http:
  # Service A 调用走 v2（部分权重）
  - match:
    - headers:
        x-source-service:
          exact: "service-a"
    route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v1
      weight: 70
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v2
      weight: 30

  # Service B 调用全部走 v1
  - match:
    - headers:
        x-source-service:
          exact: "service-b"
    route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v1

  # 默认走 v1
  - route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v1

调用方配置（Service A）：

# Service A 的 Deployment
spec:
  template:
    spec:
      containers:
      - name: service-a
        env:
        - name: OUTBOUND_HEADER_SOURCE_SERVICE
          value: "service-a"

或者在代码中添加：

req.Header.Set("x-source-service", "service-a")

方案 2：基于 Source 的路由（需要 AuthorizationPolicy 配合）
#

利用 Istio 的 mTLS 身份识别调用方：

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-c
  namespace: production
spec:
  hosts:
  - service-c.production.svc.cluster.local
  http:
  # 基于 source principal 路由
  - match:
    - sourceLabels:
        app: service-a
    route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v2

  - route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v1

注意：sourceLabels 匹配的是调用方 Pod 的标签，需要调用方也在 Istio 网格内。

场景四：生产泳道（Swimlane）
#

泳道是一种流量隔离机制，用于：

多环境复用同一集群（测试、预发、生产）
特定流量走特定版本链路（全链路灰度）
故障隔离（核心流量与非核心流量分离）

泳道架构
#

flowchart TB
    subgraph 入口["Ingress Gateway"]
        GW[Gateway]
    end
    
    subgraph 泳道1["泳道: production"]
        A1[Service A v1] --> B1[Service B v1] --> C1[Service C v1]
    end
    
    subgraph 泳道2["泳道: gray"]
        A2[Service A v2] --> B2[Service B v2] --> C2[Service C v2]
    end
    
    GW -->|x-swimlane: production| A1
    GW -->|x-swimlane: gray| A2

泳道实现：Header 染色 + 传播
#

核心原理：

入口处根据条件设置泳道 Header（染色）
服务间调用传播泳道 Header
各服务根据泳道 Header 路由到对应版本

Step 1：入口染色
#

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: frontend
  namespace: production
spec:
  hosts:
  - "*.example.com"
  gateways:
  - istio-system/istio-ingressgateway
  http:
  # 已染色的请求，直接透传
  - match:
    - headers:
        x-swimlane:
          regex: ".+"
    route:
    - destination:
        host: frontend.production.svc.cluster.local
  
  # 灰度用户，染色为 gray 泳道
  - match:
    - headers:
        x-user-id:
          regex: ".*0$"  # 尾号 0 的用户
    headers:
      request:
        set:
          x-swimlane: "gray"
    route:
    - destination:
        host: frontend.production.svc.cluster.local
  
  # 默认用户，染色为 production 泳道
  - headers:
      request:
        set:
          x-swimlane: "production"
    route:
    - destination:
        host: frontend.production.svc.cluster.local

Step 2：服务间 Header 传播
#

确保应用代码传播 x-swimlane Header：

// Go 示例：HTTP 客户端传播 Header
func callDownstream(ctx context.Context, url string) (*http.Response, error) {
    req, _ := http.NewRequestWithContext(ctx, "GET", url, nil)
    
    // 从上游请求中获取泳道 Header
    if swimlane := ctx.Value("x-swimlane"); swimlane != nil {
        req.Header.Set("x-swimlane", swimlane.(string))
    }
    
    // 也传播其他追踪 Header
    propagateHeaders := []string{
        "x-request-id",
        "x-b3-traceid",
        "x-b3-spanid",
        "x-swimlane",
    }
    // ...
    
    return http.DefaultClient.Do(req)
}

Step 3：各服务根据泳道路由
#

# Service A 的路由
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-a
  namespace: production
spec:
  hosts:
  - service-a.production.svc.cluster.local
  http:
  - match:
    - headers:
        x-swimlane:
          exact: "gray"
    route:
    - destination:
        host: service-a.production.svc.cluster.local
        subset: v2  # gray 泳道走 v2
  - route:
    - destination:
        host: service-a.production.svc.cluster.local
        subset: v1  # production 泳道走 v1
---
# Service B 的路由（同样配置）
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-b
  namespace: production
spec:
  hosts:
  - service-b.production.svc.cluster.local
  http:
  - match:
    - headers:
        x-swimlane:
          exact: "gray"
    route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v2
  - route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v1
---
# Service C 的路由（同样配置）
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-c
  namespace: production
spec:
  hosts:
  - service-c.production.svc.cluster.local
  http:
  - match:
    - headers:
        x-swimlane:
          exact: "gray"
    route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v2
  - route:
    - destination:
        host: service-c.production.svc.cluster.local
        subset: v1

泳道 vs 服务级灰度：关键区别
#

泳道和蓝绿/灰度发布是两个不同的概念，不应混用同一个 Header：

特性	泳道（Swimlane）	蓝绿/灰度
作用范围	全链路一致	仅当前服务
Header 传播	必须传播	不应传播
影响下游	是	否
目标	A→B→C 都走同一版本	只控制 A 的版本

问题场景：共用 Header 导致的问题
#

flowchart TB
    subgraph 错误场景["错误场景：Service A 做灰度发布"]
        A[请求进入
设置 x-version: v2] --> B["Service A (v2)
✅ 正确：走 v2"]
        B -->|传播 x-version: v2| C["Service B (v2)
❌ 错误！B 不应该受影响"]
        C -->|传播 x-version: v2| D["Service C (v2)
❌ 错误！C 不应该受影响"]
    end

预期行为：A 走 v2，但 B 和 C 应该走它们自己的默认版本。

正确设计：使用不同的 Header
#

# 泳道 Header（全链路传播）
x-swimlane: gray          # 全链路灰度，A/B/C 都走 gray 泳道

# 服务级版本（仅入口使用，不传播）
# 通过 sourceLabels 或 入口 match 实现，不依赖传播

服务级灰度实现（不传播 Header）
#

# Service A 的 VirtualService - 服务级灰度
apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-a
spec:
  hosts:
  - service-a.production.svc.cluster.local
  http:
  # 规则1：来自 Gateway 的流量，根据用户特征灰度
  - match:
    - headers:
        x-user-id:
          regex: ".*[0-2]$"
      sourceLabels:
        istio: ingressgateway  # 只匹配来自 Gateway 的流量
    route:
    - destination:
        host: service-a.production.svc.cluster.local
        subset: v2
  
  # 规则2：内部服务调用，走默认版本（不受灰度影响）
  - route:
    - destination:
        host: service-a.production.svc.cluster.local
        subset: v1

效果：

用户请求 → Gateway → Service A (v2) ✅ 灰度
Service B → Service A (v1) ✅ 内部调用走默认版本

Header 传播机制：为什么 Istio 无法自动传播
#

很多人会问：能否让 Istio/Envoy 自动传播泳道 Header，而不需要应用修改代码？

答案：纯 Istio/Envoy 无法完全自动传播自定义 Header。

原因分析
#

flowchart LR
    subgraph 请求["请求 x-swimlane: gray"]
        R[请求]
    end
    
    subgraph Pod["Pod: app1"]
        E1["Envoy
(入站)
✅ 能看到
x-swimlane"]
        A["App1
(处理)
❓ 是否 set?"]
        E2["Envoy
(出站)
❌ 不知道
原始 header"]
        E1 --> A --> E2
    end
    
    R --> E1
    E2 --> Q["?"]

问题在于：当 App1 发起新的 HTTP 请求调用 App2 时：

这是一个全新的 HTTP 请求
请求的 Headers 完全由 App1 的代码决定
Envoy (outbound) 无法知道"这个出站请求对应哪个入站请求"

// App1 必须主动 set header
func handleRequest(w http.ResponseWriter, r *http.Request) {
    swimlane := r.Header.Get("x-swimlane")  // 从入站请求读取
    
    // 调用 app2 时，必须主动 set
    req, _ := http.NewRequest("GET", "http://app2/api", nil)
    req.Header.Set("x-swimlane", swimlane)  // ← 必须！
    client.Do(req)
}

降低应用成本的传播方案
#

虽然无法完全自动化，但可以通过以下方案大幅降低应用改造成本：

方案 1：OpenTelemetry Baggage（推荐）
#

利用 OpenTelemetry 的 Baggage 机制，只需一次初始化，后续自动传播：

flowchart TB
    subgraph App1["App1 进程内部"]
        subgraph 入站["HTTP 请求进入"]
            H1["Headers:
traceparent: 00-abc123...
baggage: swimlane=haha"]
        end
        
        S["OTel HTTP Server Middleware
自动解析 header → 存入 Context"]
        C["Context
trace + baggage 都在这里"]
        T["OTel HTTP Client Transport
自动从 Context 读取 → 自动 set header"]
        
        subgraph 出站["HTTP 请求发出"]
            H2["Headers:
traceparent: 00-abc123... ✅自动
baggage: swimlane=haha ✅自动"]
        end
        
        H1 --> S --> C --> T --> H2
    end

Go 实现示例：

import (
    "go.opentelemetry.io/otel"
    "go.opentelemetry.io/otel/baggage"
    "go.opentelemetry.io/otel/propagation"
    "go.opentelemetry.io/contrib/instrumentation/net/http/otelhttp"
)

// 1. 初始化（一次性）
func init() {
    otel.SetTextMapPropagator(
        propagation.NewCompositeTextMapPropagator(
            propagation.TraceContext{},
            propagation.Baggage{},  // 启用 Baggage 传播
        ),
    )
}

// 2. HTTP Server middleware - 读取 x-swimlane 存入 Baggage
func swimlaneMiddleware(next http.Handler) http.Handler {
    return http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
        ctx := r.Context()
        swimlane := r.Header.Get("x-swimlane")
        if swimlane != "" {
            member, _ := baggage.NewMember("swimlane", swimlane)
            bag, _ := baggage.New(member)
            ctx = baggage.ContextWithBaggage(ctx, bag)
        }
        next.ServeHTTP(w, r.WithContext(ctx))
    })
}

// 3. HTTP Client - 自动从 Baggage 添加 Header
var client = &http.Client{
    Transport: otelhttp.NewTransport(http.DefaultTransport),
}

// 4. 发请求时传 context，OTel 自动注入 baggage header
func callDownstream(ctx context.Context) {
    req, _ := http.NewRequestWithContext(ctx, "GET", "http://app2/api", nil)
    client.Do(req)  // baggage header 自动带上
}

W3C Baggage Header 格式：

baggage: swimlane=gray,user-id=12345,region=beijing

方案 2：轻量级 HTTP Client Wrapper
#

无需完整 OTel，只需包装 HTTP Client：

Go 示例：

type swimlaneTransport struct {
    base http.RoundTripper
}

func (t *swimlaneTransport) RoundTrip(req *http.Request) (*http.Response, error) {
    // 从 Context 获取 swimlane
    if swimlane, ok := req.Context().Value("x-swimlane").(string); ok {
        req.Header.Set("x-swimlane", swimlane)
    }
    return t.base.RoundTrip(req)
}

// 使用
var client = &http.Client{
    Transport: &swimlaneTransport{base: http.DefaultTransport},
}

Java Spring 示例：

@Bean
public RestTemplate restTemplate() {
    RestTemplate template = new RestTemplate();
    template.getInterceptors().add((request, body, execution) -> {
        String swimlane = MDC.get("x-swimlane");
        if (swimlane != null) {
            request.getHeaders().set("x-swimlane", swimlane);
        }
        return execution.execute(request, body);
    });
    return template;
}

// Servlet Filter 读取并存入 MDC
@Bean
public Filter swimlaneFilter() {
    return (req, res, chain) -> {
        String swimlane = ((HttpServletRequest)req).getHeader("x-swimlane");
        if (swimlane != null) {
            MDC.put("x-swimlane", swimlane);
        }
        try {
            chain.doFilter(req, res);
        } finally {
            MDC.remove("x-swimlane");
        }
    };
}

Python Flask 示例：

import threading
import requests

_swimlane = threading.local()

class SwimlaneSession(requests.Session):
    def request(self, method, url, **kwargs):
        headers = kwargs.setdefault('headers', {})
        if hasattr(_swimlane, 'value') and _swimlane.value:
            headers['x-swimlane'] = _swimlane.value
        return super().request(method, url, **kwargs)

# Flask middleware
@app.before_request
def extract_swimlane():
    _swimlane.value = request.headers.get('x-swimlane')

# 使用
session = SwimlaneSession()
session.get('http://app2/api')  # 自动带上 x-swimlane

方案 3：Envoy Lua + x-request-id 关联
#

如果应用已经传播了 x-request-id（很多框架默认支持），可以用 Lua 关联：

apiVersion: networking.istio.io/v1alpha3
kind: EnvoyFilter
metadata:
  name: swimlane-propagation
  namespace: istio-system
spec:
  configPatches:
  # Sidecar Inbound: 存储 request-id -> swimlane 映射
  - applyTo: HTTP_FILTER
    match:
      context: SIDECAR_INBOUND
    patch:
      operation: INSERT_BEFORE
      value:
        name: envoy.filters.http.lua
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.filters.http.lua.v3.Lua
          inlineCode: |
            function envoy_on_request(handle)
              local swimlane = handle:headers():get("x-swimlane")
              local req_id = handle:headers():get("x-request-id")
              if swimlane and req_id then
                local cache = handle:sharedDict():get("swimlane_cache")
                if cache then
                  cache:set(req_id, swimlane, 10)  -- 10秒过期
                end
              end
            end

  # Sidecar Outbound: 从映射中恢复 swimlane
  - applyTo: HTTP_FILTER
    match:
      context: SIDECAR_OUTBOUND
    patch:
      operation: INSERT_BEFORE
      value:
        name: envoy.filters.http.lua
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.filters.http.lua.v3.Lua
          inlineCode: |
            function envoy_on_request(handle)
              if handle:headers():get("x-swimlane") then
                return  -- 应用已经传了，不覆盖
              end
              local req_id = handle:headers():get("x-request-id")
              if req_id then
                local cache = handle:sharedDict():get("swimlane_cache")
                if cache then
                  local swimlane = cache:get(req_id)
                  if swimlane then
                    handle:headers():add("x-swimlane", swimlane)
                  end
                end
              end
            end

限制：需要应用传播 x-request-id，且有过期时间限制。

传播方案对比
#

方案	应用改动	可靠性	复杂度	推荐场景
OTel Baggage	初始化 + middleware	⭐⭐⭐⭐⭐	中	新应用/已用 OTel
HTTP Client Wrapper	包装 Client	⭐⭐⭐⭐	低	老应用改造
Lua + x-request-id	传播 x-request-id	⭐⭐⭐	高	临时方案
手动传播	每个调用点改代码	⭐⭐	低	少量服务

泳道回落（Fallback）
#

如果某个服务在灰度泳道没有 v2 版本，需要回落到 production 泳道。

重要提示：Istio 不会自动 fallback 到其他 subset。如果指定的 subset 没有健康实例，Envoy 会返回 503 错误，而不是尝试其他 subset。

错误示例（不会自动 fallback）
#

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-b
spec:
  hosts:
  - service-b.production.svc.cluster.local
  http:
  - match:
    - headers:
        x-swimlane:
          exact: "gray"
    route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v2  # 如果 v2 无实例，直接返回 503，不会 fallback！
  - route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v1

正确方案 1：VirtualService 配置双目的地
#

通过权重配置，当 v2 无实例时流量自然走 v1：

apiVersion: networking.istio.io/v1beta1
kind: VirtualService
metadata:
  name: service-b
spec:
  hosts:
  - service-b.production.svc.cluster.local
  http:
  - match:
    - headers:
        x-swimlane:
          exact: "gray"
    route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v2
      weight: 100
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v1
      weight: 0    # 权重为 0，但作为 fallback 存在
    # 注意：这种方式在 v2 完全无实例时可能仍返回 503
  - route:
    - destination:
        host: service-b.production.svc.cluster.local
        subset: v1

正确方案 2：应用层 Fallback（推荐）
#

让调用方处理 503 错误并重试到 v1：

func callServiceB(ctx context.Context, swimlane string) (*Response, error) {
    // 首先尝试目标泳道
    resp, err := callWithHeader(ctx, "x-swimlane", swimlane)
    
    // 如果返回 503，fallback 到 production 泳道
    if err != nil || resp.StatusCode == 503 {
        if swimlane == "gray" {
            log.Warn("gray swimlane unavailable, fallback to production")
            return callWithHeader(ctx, "x-swimlane", "production")
        }
    }
    return resp, err
}

正确方案 3：使用 Flagger 自动回滚
#

Flagger 可以监控金丝雀版本的健康状态，自动回滚：

apiVersion: flagger.app/v1beta1
kind: Canary
metadata:
  name: service-b
spec:
  targetRef:
    apiVersion: apps/v1
    kind: Deployment
    name: service-b-v2
  service:
    port: 80
  analysis:
    threshold: 3           # 失败 3 次自动回滚
    metrics:
    - name: request-success-rate
      threshold: 99
    - name: request-duration
      threshold: 500

Istio 相关机制说明
#

机制	说明	是否跨 Subset Fallback
Panic Mode	健康实例 < 阈值时，流量发到所有实例（含不健康）	❌ 同 subset 内
Locality Failover	跨区域/可用区故障转移	❌ 同 subset 内
Retry	失败重试	❌ 同 subset 内
应用层 Fallback	捕获错误，重试其他版本	✅
Flagger	自动金丝雀分析和回滚	✅

最佳实践
#

1. 灰度发布检查清单
#

□ DestinationRule 定义了新旧版本 subset
□ VirtualService 配置了灰度规则
□ 灰度入口有鉴权（防止伪造 Header）
□ 监控告警已配置（新版本错误率、延迟）
□ 回滚方案已准备
□ 观察期计划已制定

2. Header 传播配置
#

在 Istio 配置中启用 Header 传播：

apiVersion: install.istio.io/v1alpha1
kind: IstioOperator
spec:
  meshConfig:
    defaultConfig:
      tracing:
        sampling: 100
      # 自定义传播的 Header
      proxyHeaders:
        forwardedClientCert: SANITIZE_SET

应用侧需要主动传播自定义 Header（如 x-swimlane）。

3. 灰度流量可观测
#

# 通过 EnvoyFilter 添加日志字段
apiVersion: networking.istio.io/v1alpha3
kind: EnvoyFilter
metadata:
  name: gray-logging
  namespace: istio-system
spec:
  configPatches:
  - applyTo: NETWORK_FILTER
    match:
      context: ANY
      listener:
        filterChain:
          filter:
            name: envoy.filters.network.http_connection_manager
    patch:
      operation: MERGE
      value:
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
          access_log:
          - name: envoy.access_loggers.file
            typed_config:
              "@type": type.googleapis.com/envoy.extensions.access_loggers.file.v3.FileAccessLog
              path: /dev/stdout
              log_format:
                json_format:
                  swimlane: "%REQ(x-swimlane)%"
                  version: "%REQ(x-app-version)%"

4. 发布流程自动化
#

#!/bin/bash
# deploy.sh - 灰度发布脚本

PHASE=$1
VERSION=$2

case $PHASE in
  "internal")
    echo "阶段1: 内灰 - 仅内部测试流量进入 $VERSION"
    kubectl apply -f virtualservice-internal-gray.yaml
    ;;
  "external")
    echo "阶段2: 外灰 - 特定用户流量进入 $VERSION"
    kubectl apply -f virtualservice-external-gray.yaml
    ;;
  "canary")
    WEIGHT=$3
    echo "阶段3: 切流 - $WEIGHT% 流量进入 $VERSION"
    sed "s/WEIGHT_PLACEHOLDER/$WEIGHT/g" virtualservice-canary.yaml | kubectl apply -f -
    ;;
  "full")
    echo "阶段4: 全量 - 100% 流量进入 $VERSION"
    kubectl apply -f virtualservice-full.yaml
    ;;
  "rollback")
    echo "紧急回滚 - 流量切回 v1"
    kubectl apply -f virtualservice-rollback.yaml
    ;;
esac

总结
#

流量控制策略对比
#

场景	核心机制	关键配置
内灰	Header 匹配	`match.headers` + 鉴权 Token
外灰	用户特征匹配	`match.headers` + regex
切流	权重分配	`route.weight`
用户粘性	Cookie + 路由	应用设置 Cookie + `match.headers`
按调用方	Source 识别	`sourceLabels` 或自定义 Header
泳道	Header 染色 + 传播	入口染色 + 应用传播 + 服务路由

泳道 vs 服务级灰度
#

对比项	泳道（全链路）	服务级灰度
作用范围	A→B→C 全链路	仅当前服务
Header 传播	需要应用传播	不需要传播
实现方式	OTel Baggage / SDK	sourceLabels / 入口匹配
使用场景	全链路灰度、多环境复用	单服务发布、蓝绿部署

Header 传播方案选择
#

方案	改动成本	可靠性	推荐场景
OTel Baggage	初始化配置	高	新应用、已集成 OTel
HTTP Client Wrapper	包装 Client	高	老应用改造
Lua + x-request-id	无需改应用	中	临时方案

流量控制是 Istio 最强大的能力之一，合理运用可以实现精细化的发布管理，保障生产环境的稳定性。关键是根据场景选择合适的策略：服务级发布用 sourceLabels，全链路灰度用泳道 + Header 传播。

Istio 实战 - 这篇文章属于一个选集。

§ 2: 本文

§ 16: Istio 超时与重试机制详解

§ 17: Istio Wasm 插件开发指南

Istio Gateway 生产部署最佳实践

2025年12月27日·1141 字·6 分钟

云原生 Kubernetes Istio Ingress 网关最佳实践

Istio Gateway 部署方式深度分析

2025年12月27日·7011 字·33 分钟

云原生 Kubernetes Istio Ingress 网关架构

Linux 连接跟踪表 conntrack 详解

2025年12月28日·1746 字·9 分钟

网络 Linux 网络 Conntrack Iptables Kubernetes

概述#

基础配置#

场景一：完整灰度发布流程#

发布阶段流程#

阶段 1：内灰（0 线上流量）#

阶段 2：外灰（特定用户）#

阶段 3：按权重切流#

阶段 4：全量发布 + 观察期#

紧急回滚#

进阶方案：EnvoyFilter + Lua 实现灵活灰度控制#

架构设计#

灰度配置（ConfigMap）#

EnvoyFilter + Lua 实现#

简化的 VirtualService#

方案对比#

运维操作#

场景二：用户会话粘性#

方案 1：基于 Cookie 的粘性#

方案 2：服务端设置版本 Cookie（推荐）#

方案 3：基于用户 ID 的一致性哈希#

方案 4：EnvoyFilter + Lua 实现版本级一致性哈希（推荐）#

场景三：按调用方路由#

方案 1：基于 Header 的调用方识别#

方案 2：基于 Source 的路由（需要 AuthorizationPolicy 配合）#

场景四：生产泳道（Swimlane）#

泳道架构#

泳道实现：Header 染色 + 传播#

Step 1：入口染色#

Step 2：服务间 Header 传播#

Step 3：各服务根据泳道路由#

泳道 vs 服务级灰度：关键区别#

问题场景：共用 Header 导致的问题#

正确设计：使用不同的 Header#

服务级灰度实现（不传播 Header）#

Header 传播机制：为什么 Istio 无法自动传播#

原因分析#

降低应用成本的传播方案#

方案 1：OpenTelemetry Baggage（推荐）#

方案 2：轻量级 HTTP Client Wrapper#

方案 3：Envoy Lua + x-request-id 关联#

传播方案对比#

泳道回落（Fallback）#

错误示例（不会自动 fallback）#

正确方案 1：VirtualService 配置双目的地#

正确方案 2：应用层 Fallback（推荐）#

正确方案 3：使用 Flagger 自动回滚#

Istio 相关机制说明#

最佳实践#

1. 灰度发布检查清单#

2. Header 传播配置#

3. 灰度流量可观测#

4. 发布流程自动化#

总结#

流量控制策略对比#

泳道 vs 服务级灰度#

Header 传播方案选择#

相关文章