Docker Compose 配置详解

博主： shpym
发布时间：2026 年 02 月 25 日
117 次浏览
暂无评论
19127字数
分类：默认分类 Docker容器化

Docker Compose 配置详解

基于 docker-compose.yaml，逐块拆解各配置的含义与设计思路。

源码解析

docker-compose.yaml

# Docker Compose - 本地开发环境
# 一键启动所有依赖服务（PostgreSQL、Redis、Prometheus、Grafana）
# 使用方式：docker-compose up -d
#
# 这模拟了生产环境中各个组件的配合：
# - 应用服务（saas-shortener）
# - 数据库（PostgreSQL） → 生产环境用云数据库（如 AWS RDS）
# - 缓存（Redis）→ 生产环境用云 Redis（如 AWS ElastiCache）
# - 监控（Prometheus）→ 生产环境用 Prometheus Operator
# - 可视化（Grafana）→ 生产环境用 Grafana Cloud 或自建
#
# ==================== 资源规划（2C4G 服务器） ====================
# 组件          | 内存限制 | 说明
# Go 应用       | 128MB   | Go 二进制非常轻量
# PostgreSQL    | 512MB   | 适当限制 shared_buffers
# Redis         | 128MB   | 学习项目数据量小
# Prometheus    | 512MB   | 限制时序数据内存占用
# Grafana       | 256MB   | 前端服务
# 合计          | ~1.5GB  | 预留约 2.5GB 给系统和 Docker 引擎

services:
  # ==================== 应用服务 ====================
  app:
    build:
      context: ../../
      dockerfile: deploy/docker/Dockerfile
    container_name: saas-shortener
    ports:
      - "8080:8080"
    environment:
      # 12-Factor App: 所有配置通过环境变量注入
      - APP_ENV=development
      - SERVER_PORT=8080
      - DB_HOST=postgres
      - DB_PORT=5432
      - DB_USER=postgres
      - DB_PASSWORD=postgres
      - DB_NAME=saas_shortener
      - DB_SSLMODE=disable
      - REDIS_ADDR=redis:6379
      - REDIS_PASSWORD=
      - TENANT_DEFAULT_RATE_LIMIT=100
      - TENANT_MAX_URLS=1000
      # Go 运行时优化（小内存服务器）
      - GOMAXPROCS=2
      - GOMEMLIMIT=100MiB
    depends_on:
      postgres:
        condition: service_healthy
      redis:
        condition: service_healthy
    restart: unless-stopped
    # 资源限制 - 防止单个容器吃满内存导致 OOM
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 128M
        reservations:
          cpus: '0.1'
          memory: 32M
    networks:
      - saas-network

  # ==================== PostgreSQL 数据库 ====================
  postgres:
    image: postgres:16-alpine
    container_name: saas-postgres
    restart: unless-stopped
    ports:
      - "5432:5432"
    environment:
      - POSTGRES_USER=postgres
      - POSTGRES_PASSWORD=postgres
      - POSTGRES_DB=saas_shortener
    # PostgreSQL 内存优化（适配小内存服务器）
    command:
      - "postgres"
      - "-c" 
      - "shared_buffers=128MB"        # 默认128MB，小服务器够用
      - "-c"
      - "effective_cache_size=256MB"   # 告诉优化器可用缓存大小
      - "-c"
      - "work_mem=4MB"                # 每个排序/哈希操作的内存
      - "-c"
      - "max_connections=50"          # 限制最大连接数（默认100太多）
    volumes:
      - postgres_data:/var/lib/postgresql/data
    healthcheck:
      test: ["CMD-SHELL", "pg_isready -U postgres"]
      interval: 5s
      timeout: 5s
      retries: 5
    deploy:
      resources:
        limits:
          cpus: '0.8'
          memory: 512M
        reservations:
          cpus: '0.2'
          memory: 128M
    networks:
      - saas-network

  # ==================== Redis 缓存 ====================
  redis:
    image: redis:7-alpine
    container_name: saas-redis
    restart: unless-stopped
    ports:
      - "6379:6379"
    # Redis 内存限制：超过后使用 LRU 策略淘汰旧数据
    command: redis-server --maxmemory 64mb --maxmemory-policy allkeys-lru
    volumes:
      - redis_data:/data
    healthcheck:
      test: ["CMD", "redis-cli", "ping"]
      interval: 5s
      timeout: 5s
      retries: 5
    deploy:
      resources:
        limits:
          cpus: '0.3'
          memory: 128M
        reservations:
          cpus: '0.1'
          memory: 32M
    networks:
      - saas-network

  # ==================== Prometheus 监控 ====================
  # Prometheus 是云原生监控的事实标准
  # 它通过 Pull 模式定期从应用的 /metrics 端点拉取指标数据
  prometheus:
    image: prom/prometheus:v2.51.0
    container_name: saas-prometheus
    restart: unless-stopped
    ports:
      - "9090:9090"
    volumes:
      - ../k8s/monitoring/prometheus.yaml:/etc/prometheus/prometheus.yml
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.path=/prometheus'
      - '--storage.tsdb.retention.time=7d'        # 只保留7天数据（节省磁盘）
      - '--storage.tsdb.retention.size=1GB'        # 最多占用1GB磁盘
      - '--web.console.libraries=/etc/prometheus/console_libraries'
      - '--web.console.templates=/etc/prometheus/consoles'
      - '--web.enable-lifecycle'
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 512M
        reservations:
          cpus: '0.1'
          memory: 128M
    networks:
      - saas-network

  # ==================== Grafana 可视化 ====================
  # Grafana 是最流行的监控可视化平台
  # 它从 Prometheus 读取指标数据，展示成漂亮的仪表盘
  grafana:
    image: grafana/grafana:10.4.0
    container_name: saas-grafana
    restart: unless-stopped
    ports:
      - "3000:3000"
    environment:
      - GF_SECURITY_ADMIN_USER=admin
      - GF_SECURITY_ADMIN_PASSWORD=admin
      - GF_USERS_ALLOW_SIGN_UP=false
    volumes:
      - grafana_data:/var/lib/grafana
      - ../k8s/monitoring/grafana-datasource.yaml:/etc/grafana/provisioning/datasources/datasource.yaml
    depends_on:
      - prometheus
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 256M
        reservations:
          cpus: '0.1'
          memory: 64M
    networks:
      - saas-network

# ==================== 持久化存储 ====================
volumes:
  postgres_data:
  redis_data:
  prometheus_data:
  grafana_data:

# ==================== 网络 ====================
networks:
  saas-network:
    driver: bridge

整体架构概览

本项目用 Docker Compose 一键启动完整的本地开发环境，包含 5 个服务：

┌──────────────────────────────────────────────────────────────┐
│                      saas-network (bridge)                    │
│                                                              │
│  ┌──────────┐    ┌──────────┐    ┌──────────┐               │
│  │ Postgres │    │  Redis   │    │Prometheus│               │
│  │  :5432   │    │  :6379   │    │  :9090   │               │
│  │  512M    │    │  128M    │    │  512M    │               │
│  └────┬─────┘    └────┬─────┘    └────┬─────┘               │
│       │               │               │                      │
│       │ depends_on    │ depends_on    │ depends_on           │
│       │ (healthy)     │ (healthy)     │                      │
│       ▼               ▼               ▼                      │
│  ┌─────────────────────────┐    ┌──────────┐                │
│  │          App            │    │ Grafana  │                │
│  │        :8080            │    │  :3000   │                │
│  │         128M            │    │  256M    │                │
│  └─────────────────────────┘    └──────────┘                │
│                                                              │
│  总计 ≈ 1.5GB，预留 ~2.5GB 给系统和 Docker 引擎               │
└──────────────────────────────────────────────────────────────┘

组件	镜像	端口	内存限制	角色
App	本地构建	8080	128M	Go 应用服务
PostgreSQL	postgres:16-alpine	5432	512M	关系型数据库
Redis	redis:7-alpine	6379	128M	缓存
Prometheus	prom/prometheus:v2.51.0	9090	512M	指标采集
Grafana	grafana/grafana:10.4.0	3000	256M	监控可视化

应用服务（App）

app:
  build:
    context: ../../
    dockerfile: deploy/docker/Dockerfile
  container_name: saas-shortener
  ports:
    - "8080:8080"
  environment:
    - APP_ENV=development
    - SERVER_PORT=8080
    - DB_HOST=postgres
    - DB_PORT=5432
    - DB_USER=postgres
    - DB_PASSWORD=postgres
    - DB_NAME=saas_shortener
    - DB_SSLMODE=disable
    - REDIS_ADDR=redis:6379
    - REDIS_PASSWORD=
    - TENANT_DEFAULT_RATE_LIMIT=100
    - TENANT_MAX_URLS=1000
    - GOMAXPROCS=2
    - GOMEMLIMIT=100MiB
  depends_on:
    postgres:
      condition: service_healthy
    redis:
      condition: service_healthy
  restart: unless-stopped

关键配置解读

build — 多阶段构建

context: ../../ 表示构建上下文是项目根目录
dockerfile 指向 deploy/docker/Dockerfile，使用多阶段构建生成轻量镜像

environment — 12-Factor App 原则

所有配置通过环境变量注入，而不是硬编码在代码中。好处：

同一份代码可以在开发、测试、生产环境运行，只需切换环境变量
敏感信息（密码等）不进入代码仓库
DB_HOST=postgres 这里直接写的是服务名，Docker 网络会自动把它解析为对应容器的 IP

Go 运行时优化（小内存服务器）

变量	值	作用
`GOMAXPROCS`	2	限制 Go 使用的 CPU 核心数，避免过度调度
`GOMEMLIMIT`	100MiB	Go 1.19+ 软内存上限，GC 会在接近此值时更积极回收

在容器环境中，Go 默认会检测到宿主机的全部 CPU 核心，而不是容器限制的核心数。手动设置 GOMAXPROCS 可以避免不必要的线程开销。

depends_on + condition: service_healthy

depends_on:
  postgres:
    condition: service_healthy
  redis:
    condition: service_healthy

这确保了启动顺序：App 会等 PostgreSQL 和 Redis 的健康检查通过后才启动。如果只写 depends_on: [postgres]，Docker 只保证容器启动，不保证服务就绪——数据库可能还在初始化，App 连接就会失败。

restart: unless-stopped

策略	行为
`no`	默认，不自动重启
`always`	总是重启，包括手动停止后重启 Docker 也会拉起
`unless-stopped`	自动重启，但手动 `docker stop` 后不会再拉起
`on-failure`	只在非零退出码时重启

unless-stopped 是开发环境的最佳选择——崩溃自动恢复，手动停止时不纠缠。

资源限制（deploy.resources）

deploy:
  resources:
    limits:
      cpus: '0.5'
      memory: 128M
    reservations:
      cpus: '0.1'
      memory: 32M

limits vs reservations

配置	含义	类比
`limits`	硬上限，超过会被 OOM Kill 或 CPU 节流	信用卡额度
`reservations`	预留资源，Docker 保证至少分配这么多	银行保底余额

为什么要设置资源限制？

在 2C4G 的小服务器上，如果不限制：

一个容器内存泄漏可能吃掉所有内存，导致其他容器被 OOM Kill
一个服务 CPU 飙升可能饿死其他服务

设置限制后，每个容器都在自己的"格子"里运行，互不影响。

资源规划

组件	CPU 限制	内存限制	CPU 预留	内存预留
App	0.5 核	128M	0.1 核	32M
PostgreSQL	0.8 核	512M	0.2 核	128M
Redis	0.3 核	128M	0.1 核	32M
Prometheus	0.5 核	512M	0.1 核	128M
Grafana	0.5 核	256M	0.1 核	64M
合计	2.6 核	1.5 GB	0.6 核	384M

limits 合计可以超过物理资源（超卖），因为不是每个服务都同时满载。reservations 合计不应超过物理资源。

PostgreSQL 数据库

postgres:
  image: postgres:16-alpine
  command:
    - "postgres"
    - "-c"
    - "shared_buffers=128MB"
    - "-c"
    - "effective_cache_size=256MB"
    - "-c"
    - "work_mem=4MB"
    - "-c"
    - "max_connections=50"

内存参数调优

参数	值	说明
`shared_buffers`	128MB	PostgreSQL 用于缓存数据页的共享内存，通常设为可用内存的 25%
`effective_cache_size`	256MB	告诉查询优化器"系统总共有多少缓存可用"，影响查询计划选择
`work_mem`	4MB	单个排序/哈希操作可用内存，注意是每个操作，并发高时实际占用 = work_mem × 并发数
`max_connections`	50	最大连接数，默认 100 太多，每个连接占用 ~5-10MB 内存

shared_buffers 和 effective_cache_size 的区别：前者是 PostgreSQL 自己管理的缓存，后者是告诉优化器"操作系统文件缓存 + shared_buffers 一共有多少"，帮助它决定用索引扫描还是全表扫描。

Redis 缓存

redis:
  image: redis:7-alpine
  command: redis-server --maxmemory 64mb --maxmemory-policy allkeys-lru

内存淘汰策略

参数	值	说明
`--maxmemory`	64mb	Redis 最大使用内存
`--maxmemory-policy`	allkeys-lru	内存满时的淘汰策略

常见淘汰策略对比：

策略	行为	适用场景
`noeviction`	内存满了直接报错	不允许丢数据
`allkeys-lru`	淘汰所有 key 中最近最少使用的	通用缓存（本项目选用）
`volatile-lru`	只淘汰设置了过期时间的 key 中最少使用的	部分 key 需要永久保留
`allkeys-random`	随机淘汰	无明显访问模式

allkeys-lru 是缓存场景的最佳选择——总是保留最"热"的数据，冷数据自动淘汰。

Prometheus 监控

prometheus:
  image: prom/prometheus:v2.51.0
  volumes:
    - ../k8s/monitoring/prometheus.yaml:/etc/prometheus/prometheus.yml
    - prometheus_data:/prometheus
  command:
    - '--config.file=/etc/prometheus/prometheus.yml'
    - '--storage.tsdb.path=/prometheus'
    - '--storage.tsdb.retention.time=7d'
    - '--storage.tsdb.retention.size=1GB'
    - '--web.enable-lifecycle'

存储参数说明

参数	值	说明
`storage.tsdb.retention.time`	7d	时序数据只保留 7 天
`storage.tsdb.retention.size`	1GB	最多占用 1GB 磁盘
`web.enable-lifecycle`	-	允许通过 HTTP API 热重载配置

Prometheus 采用 Pull 模式：它主动定期访问应用的 /metrics 端点拉取指标，而不是应用主动推送。这样应用不需要知道监控系统的存在，解耦更彻底。

Grafana 可视化

grafana:
  image: grafana/grafana:10.4.0
  environment:
    - GF_SECURITY_ADMIN_USER=admin
    - GF_SECURITY_ADMIN_PASSWORD=admin
    - GF_USERS_ALLOW_SIGN_UP=false
  volumes:
    - grafana_data:/var/lib/grafana
    - ../k8s/monitoring/grafana-datasource.yaml:/etc/grafana/provisioning/datasources/datasource.yaml
  depends_on:
    - prometheus

GF_USERS_ALLOW_SIGN_UP=false：禁止自助注册，开发环境用 admin/admin 登录即可
数据源配置通过文件挂载自动导入，启动即连接 Prometheus，无需手动配置
depends_on: prometheus 保证 Grafana 在 Prometheus 之后启动

网络（Networks）

networks:
  saas-network:
    driver: bridge

各部分含义

networks: — 定义自定义网络，让容器间安全通信
saas-network — 网络名称，仅在此 Docker Compose 项目内可见
driver: bridge — 桥接网络驱动，Docker 默认的网络类型，提供容器间隔离通信、端口映射和 DNS 解析

服务间通信示例

services:
  app:
    networks:
      - saas-network

  postgres:
    networks:
      - saas-network

这样配置后：

App 服务可以通过 postgres 主机名访问数据库（DB_HOST=postgres）
服务间通过服务名互相访问，Docker 内置 DNS 自动解析为容器 IP
外部无法直接访问这些服务（除非通过 ports 明确暴露）

2026-02-24T16:39:18.png

Docker 网络自动分配机制

Docker Engine 自动处理以下过程：

创建 saas-network 网络（通常分配 172.x.x.0/16 网段）
每个容器加入网络时，自动分配唯一 IP 地址
建立容器名 → IP 的 DNS 映射

# 查看容器实际分配的 IP
docker exec saas-postgres hostname -i

为什么不需要手动指定 IP？

动态分配：IP 地址根据可用范围自动分配，避免冲突
DNS 解析：通过服务名 postgres 自动解析，不依赖具体 IP
这就是为什么配置中写 DB_HOST=postgres 而不是具体 IP 地址——Docker 网络系统自动处理底层细节

健康检查（Healthcheck）

healthcheck:
  test: ["CMD-SHELL", "pg_isready -U postgres"]
  interval: 5s
  timeout: 5s
  retries: 5

参数详解

参数	值	含义
`test`	`pg_isready -U postgres`	PostgreSQL 内置检查工具，验证数据库是否接受连接
`interval`	5s	每 5 秒执行一次检查
`timeout`	5s	单次检查最多等待 5 秒
`retries`	5	连续失败 5 次才标记为 unhealthy

各服务的健康检查方式

服务	检查命令	原理
PostgreSQL	`pg_isready -U postgres`	专用工具检测数据库连接
Redis	`redis-cli ping`	发送 PING 命令，期望返回 PONG
App	`wget --spider http://localhost:8080/healthz`	HTTP 请求健康检查端点

健康状态流转

容器启动
   │
   ▼
starting ──(interval)──► 执行 test 命令
                              │
                    ┌─────────┴─────────┐
                    ▼                   ▼
                  成功               失败
                    │                   │
                    ▼                   ▼
               healthy          重试（最多 retries 次）
                                        │
                                        ▼
                                   unhealthy

为什么重要？

服务依赖管理：depends_on + condition: service_healthy 确保数据库就绪后再启动 App
故障自动恢复：配合 restart: unless-stopped，不健康的容器会被自动重启
避免级联故障：防止 App 向还未就绪的数据库发送请求导致启动失败

持久化存储（Volumes）

volumes:
  postgres_data:
  redis_data:
  prometheus_data:
  grafana_data:

命名卷 vs 匿名卷

命名卷（Named Volumes）— 本项目使用的方式：

volumes:
  postgres_data:                    # 顶层声明

services:
  postgres:
    volumes:
      - postgres_data:/var/lib/postgresql/data  # 引用命名卷

匿名卷（Anonymous Volumes）：

services:
  app:
    volumes:
      - /app/data    # 只有容器内路径，没有名称

两者对比：

特性	命名卷	匿名卷
有明确名称	✅	❌
`docker-compose down` 时保留	✅ 默认保留	❌ 默认删除
易于管理和备份	✅	❌
可在多个服务间共享	✅	❌

各服务的持久化内容

卷名	挂载路径	存储内容
`postgres_data`	`/var/lib/postgresql/data`	数据库文件
`redis_data`	`/data`	Redis RDB/AOF 持久化文件
`prometheus_data`	`/prometheus`	时序指标数据
`grafana_data`	`/var/lib/grafana`	仪表盘配置、用户数据

查看和管理卷

# 查看所有命名卷
docker volume ls

# 查看某个卷的详细信息（存储位置等）
docker volume inspect saas-shortener_postgres_data

停止 Docker Compose 环境

.PHONY: docker-down
docker-down:
	docker compose -f $(DOCKER_COMPOSE_LOCAL) down

停止过程

向所有容器发送 SIGTERM 信号（通知优雅退出）
等待超时时间（默认 10 秒）
对仍在运行的容器发送 SIGKILL 强制终止

注意：所有容器几乎同时收到停止信号，不会按照 depends_on 的反向顺序停止。

自定义优雅停止时间

services:
  app:
    stop_grace_period: 30s    # 给应用更多时间处理完当前请求

控制删除行为

# 默认：停止容器、移除网络，保留命名卷
docker-compose down

# 删除所有卷（包括命名卷，数据库数据会丢失！）
docker-compose down -v

# 移除孤立容器（Compose 文件中已删除但仍在运行的服务）
docker-compose down --remove-orphans

⚠️ 生产环境慎用 -v 参数，它会删除包括数据库在内的所有数据！

最后修改：2026 年 02 月 25 日

如果觉得我的文章对你有用，请随意赞赏

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

评论 *

私密评论

名称 *

🎲

邮箱 *

地址

Docker Compose 配置详解

shpym • 2026 年 02 月 25 日

<h1><a id="content-docker-compose-配置详解" href="#content-docker-compose-配置详解" class="heading-permalink" aria-hidden="true" title="Permalink"></a>Docker Compose 配置详解</h1>
<blockquote>
<p>基于 <code>docker-compose.yaml</code>，逐块拆解各配置的含义与设计思路。</p>
</blockquote>
<hr />
<h2><a id="content-源码解析" href="#content-源码解析" class="heading-permalink" aria-hidden="true" title="Permalink"></a>源码解析</h2>
<p><div class="panel panel-default collapse-panel box-shadow-wrap-lg"><div class="panel-heading panel-collapse" data-toggle="collapse" data-target="#collapse-9669264d91c56e53b91c074b6268fdaa28" aria-expanded="true"><div class="accordion-toggle"><span style="">docker-compose.yaml</span>
<i class="pull-right fontello icon-fw fontello-angle-right"></i>
</div>
</div>
<div class="panel-body collapse-panel-body">
<div id="collapse-9669264d91c56e53b91c074b6268fdaa28" class="collapse collapse-content"><p></p></p>
<pre><code># Docker Compose - 本地开发环境
# 一键启动所有依赖服务（PostgreSQL、Redis、Prometheus、Grafana）
# 使用方式：docker-compose up -d
#
# 这模拟了生产环境中各个组件的配合：
# - 应用服务（saas-shortener）
# - 数据库（PostgreSQL） → 生产环境用云数据库（如 AWS RDS）
# - 缓存（Redis）→ 生产环境用云 Redis（如 AWS ElastiCache）
# - 监控（Prometheus）→ 生产环境用 Prometheus Operator
# - 可视化（Grafana）→ 生产环境用 Grafana Cloud 或自建
#
# ==================== 资源规划（2C4G 服务器） ====================
# 组件          | 内存限制 | 说明
# Go 应用       | 128MB   | Go 二进制非常轻量
# PostgreSQL    | 512MB   | 适当限制 shared_buffers
# Redis         | 128MB   | 学习项目数据量小
# Prometheus    | 512MB   | 限制时序数据内存占用
# Grafana       | 256MB   | 前端服务
# 合计          | ~1.5GB  | 预留约 2.5GB 给系统和 Docker 引擎

services:
  # ==================== 应用服务 ====================
  app:
    build:
      context: ../../
      dockerfile: deploy/docker/Dockerfile
    container_name: saas-shortener
    ports:
      - &quot;8080:8080&quot;
    environment:
      # 12-Factor App: 所有配置通过环境变量注入
      - APP_ENV=development
      - SERVER_PORT=8080
      - DB_HOST=postgres
      - DB_PORT=5432
      - DB_USER=postgres
      - DB_PASSWORD=postgres
      - DB_NAME=saas_shortener
      - DB_SSLMODE=disable
      - REDIS_ADDR=redis:6379
      - REDIS_PASSWORD=
      - TENANT_DEFAULT_RATE_LIMIT=100
      - TENANT_MAX_URLS=1000
      # Go 运行时优化（小内存服务器）
      - GOMAXPROCS=2
      - GOMEMLIMIT=100MiB
    depends_on:
      postgres:
        condition: service_healthy
      redis:
        condition: service_healthy
    restart: unless-stopped
    # 资源限制 - 防止单个容器吃满内存导致 OOM
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 128M
        reservations:
          cpus: '0.1'
          memory: 32M
    networks:
      - saas-network

# ==================== PostgreSQL 数据库 ====================
  postgres:
    image: postgres:16-alpine
    container_name: saas-postgres
    restart: unless-stopped
    ports:
      - &quot;5432:5432&quot;
    environment:
      - POSTGRES_USER=postgres
      - POSTGRES_PASSWORD=postgres
      - POSTGRES_DB=saas_shortener
    # PostgreSQL 内存优化（适配小内存服务器）
    command:
      - &quot;postgres&quot;
      - &quot;-c&quot; 
      - &quot;shared_buffers=128MB&quot;        # 默认128MB，小服务器够用
      - &quot;-c&quot;
      - &quot;effective_cache_size=256MB&quot;   # 告诉优化器可用缓存大小
      - &quot;-c&quot;
      - &quot;work_mem=4MB&quot;                # 每个排序/哈希操作的内存
      - &quot;-c&quot;
      - &quot;max_connections=50&quot;          # 限制最大连接数（默认100太多）
    volumes:
      - postgres_data:/var/lib/postgresql/data
    healthcheck:
      test: [&quot;CMD-SHELL&quot;, &quot;pg_isready -U postgres&quot;]
      interval: 5s
      timeout: 5s
      retries: 5
    deploy:
      resources:
        limits:
          cpus: '0.8'
          memory: 512M
        reservations:
          cpus: '0.2'
          memory: 128M
    networks:
      - saas-network

# ==================== Redis 缓存 ====================
  redis:
    image: redis:7-alpine
    container_name: saas-redis
    restart: unless-stopped
    ports:
      - &quot;6379:6379&quot;
    # Redis 内存限制：超过后使用 LRU 策略淘汰旧数据
    command: redis-server --maxmemory 64mb --maxmemory-policy allkeys-lru
    volumes:
      - redis_data:/data
    healthcheck:
      test: [&quot;CMD&quot;, &quot;redis-cli&quot;, &quot;ping&quot;]
      interval: 5s
      timeout: 5s
      retries: 5
    deploy:
      resources:
        limits:
          cpus: '0.3'
          memory: 128M
        reservations:
          cpus: '0.1'
          memory: 32M
    networks:
      - saas-network

# ==================== Prometheus 监控 ====================
  # Prometheus 是云原生监控的事实标准
  # 它通过 Pull 模式定期从应用的 /metrics 端点拉取指标数据
  prometheus:
    image: prom/prometheus:v2.51.0
    container_name: saas-prometheus
    restart: unless-stopped
    ports:
      - &quot;9090:9090&quot;
    volumes:
      - ../k8s/monitoring/prometheus.yaml:/etc/prometheus/prometheus.yml
      - prometheus_data:/prometheus
    command:
      - '--config.file=/etc/prometheus/prometheus.yml'
      - '--storage.tsdb.path=/prometheus'
      - '--storage.tsdb.retention.time=7d'        # 只保留7天数据（节省磁盘）
      - '--storage.tsdb.retention.size=1GB'        # 最多占用1GB磁盘
      - '--web.console.libraries=/etc/prometheus/console_libraries'
      - '--web.console.templates=/etc/prometheus/consoles'
      - '--web.enable-lifecycle'
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 512M
        reservations:
          cpus: '0.1'
          memory: 128M
    networks:
      - saas-network

# ==================== Grafana 可视化 ====================
  # Grafana 是最流行的监控可视化平台
  # 它从 Prometheus 读取指标数据，展示成漂亮的仪表盘
  grafana:
    image: grafana/grafana:10.4.0
    container_name: saas-grafana
    restart: unless-stopped
    ports:
      - &quot;3000:3000&quot;
    environment:
      - GF_SECURITY_ADMIN_USER=admin
      - GF_SECURITY_ADMIN_PASSWORD=admin
      - GF_USERS_ALLOW_SIGN_UP=false
    volumes:
      - grafana_data:/var/lib/grafana
      - ../k8s/monitoring/grafana-datasource.yaml:/etc/grafana/provisioning/datasources/datasource.yaml
    depends_on:
      - prometheus
    deploy:
      resources:
        limits:
          cpus: '0.5'
          memory: 256M
        reservations:
          cpus: '0.1'
          memory: 64M
    networks:
      - saas-network

# ==================== 持久化存储 ====================
volumes:
  postgres_data:
  redis_data:
  prometheus_data:
  grafana_data:

# ==================== 网络 ====================
networks:
  saas-network:
    driver: bridge

</code></pre>
<p><p></p></div></div></div></p>
<hr />
<h2><a id="content-整体架构概览" href="#content-整体架构概览" class="heading-permalink" aria-hidden="true" title="Permalink"></a>整体架构概览</h2>
<p>本项目用 Docker Compose 一键启动完整的本地开发环境，包含 5 个服务：</p>
<pre><code>┌──────────────────────────────────────────────────────────────┐
│                      saas-network (bridge)                    │
│                                                              │
│  ┌──────────┐    ┌──────────┐    ┌──────────┐               │
│  │ Postgres │    │  Redis   │    │Prometheus│               │
│  │  :5432   │    │  :6379   │    │  :9090   │               │
│  │  512M    │    │  128M    │    │  512M    │               │
│  └────┬─────┘    └────┬─────┘    └────┬─────┘               │
│       │               │               │                      │
│       │ depends_on    │ depends_on    │ depends_on           │
│       │ (healthy)     │ (healthy)     │                      │
│       ▼               ▼               ▼                      │
│  ┌─────────────────────────┐    ┌──────────┐                │
│  │          App            │    │ Grafana  │                │
│  │        :8080            │    │  :3000   │                │
│  │         128M            │    │  256M    │                │
│  └─────────────────────────┘    └──────────┘                │
│                                                              │
│  总计 ≈ 1.5GB，预留 ~2.5GB 给系统和 Docker 引擎               │
└──────────────────────────────────────────────────────────────┘
</code></pre>
<table>
<thead>
<tr>
<th>组件</th>
<th>镜像</th>
<th>端口</th>
<th>内存限制</th>
<th>角色</th>
</tr>
</thead>
<tbody>
<tr>
<td>App</td>
<td>本地构建</td>
<td>8080</td>
<td>128M</td>
<td>Go 应用服务</td>
</tr>
<tr>
<td>PostgreSQL</td>
<td>postgres:16-alpine</td>
<td>5432</td>
<td>512M</td>
<td>关系型数据库</td>
</tr>
<tr>
<td>Redis</td>
<td>redis:7-alpine</td>
<td>6379</td>
<td>128M</td>
<td>缓存</td>
</tr>
<tr>
<td>Prometheus</td>
<td>prom/prometheus:v2.51.0</td>
<td>9090</td>
<td>512M</td>
<td>指标采集</td>
</tr>
<tr>
<td>Grafana</td>
<td>grafana/grafana:10.4.0</td>
<td>3000</td>
<td>256M</td>
<td>监控可视化</td>
</tr>
</tbody>
</table>
<hr />
<h2><a id="content-应用服务app" href="#content-应用服务app" class="heading-permalink" aria-hidden="true" title="Permalink"></a>应用服务（App）</h2>
<pre><code class="language-yaml">app:
  build:
    context: ../../
    dockerfile: deploy/docker/Dockerfile
  container_name: saas-shortener
  ports:
    - &quot;8080:8080&quot;
  environment:
    - APP_ENV=development
    - SERVER_PORT=8080
    - DB_HOST=postgres
    - DB_PORT=5432
    - DB_USER=postgres
    - DB_PASSWORD=postgres
    - DB_NAME=saas_shortener
    - DB_SSLMODE=disable
    - REDIS_ADDR=redis:6379
    - REDIS_PASSWORD=
    - TENANT_DEFAULT_RATE_LIMIT=100
    - TENANT_MAX_URLS=1000
    - GOMAXPROCS=2
    - GOMEMLIMIT=100MiB
  depends_on:
    postgres:
      condition: service_healthy
    redis:
      condition: service_healthy
  restart: unless-stopped
</code></pre>
<h3><a id="content-关键配置解读" href="#content-关键配置解读" class="heading-permalink" aria-hidden="true" title="Permalink"></a>关键配置解读</h3>
<p><strong><code>build</code> — 多阶段构建</strong></p>
<ul>
<li><code>context: ../../</code> 表示构建上下文是项目根目录</li>
<li><code>dockerfile</code> 指向 <code>deploy/docker/Dockerfile</code>，使用多阶段构建生成轻量镜像</li>
</ul>
<p><strong><code>environment</code> — 12-Factor App 原则</strong></p>
<p>所有配置通过环境变量注入，而不是硬编码在代码中。好处：</p>
<ul>
<li>同一份代码可以在开发、测试、生产环境运行，只需切换环境变量</li>
<li>敏感信息（密码等）不进入代码仓库</li>
<li><code>DB_HOST=postgres</code> 这里直接写的是<strong>服务名</strong>，Docker 网络会自动把它解析为对应容器的 IP</li>
</ul>
<p><strong>Go 运行时优化（小内存服务器）</strong></p>
<table>
<thead>
<tr>
<th>变量</th>
<th>值</th>
<th>作用</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>GOMAXPROCS</code></td>
<td>2</td>
<td>限制 Go 使用的 CPU 核心数，避免过度调度</td>
</tr>
<tr>
<td><code>GOMEMLIMIT</code></td>
<td>100MiB</td>
<td>Go 1.19+ 软内存上限，GC 会在接近此值时更积极回收</td>
</tr>
</tbody>
</table>
<blockquote>
<p>在容器环境中，Go 默认会检测到宿主机的全部 CPU 核心，而不是容器限制的核心数。手动设置 <code>GOMAXPROCS</code> 可以避免不必要的线程开销。</p>
</blockquote>
<p><strong><code>depends_on</code> + <code>condition: service_healthy</code></strong></p>
<pre><code class="language-yaml">depends_on:
  postgres:
    condition: service_healthy
  redis:
    condition: service_healthy
</code></pre>
<p>这确保了<strong>启动顺序</strong>：App 会等 PostgreSQL 和 Redis 的健康检查通过后才启动。如果只写 <code>depends_on: [postgres]</code>，Docker 只保证容器启动，不保证服务就绪——数据库可能还在初始化，App 连接就会失败。</p>
<p><strong><code>restart: unless-stopped</code></strong></p>
<table>
<thead>
<tr>
<th>策略</th>
<th>行为</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>no</code></td>
<td>默认，不自动重启</td>
</tr>
<tr>
<td><code>always</code></td>
<td>总是重启，包括手动停止后重启 Docker 也会拉起</td>
</tr>
<tr>
<td><code>unless-stopped</code></td>
<td>自动重启，但手动 <code>docker stop</code> 后不会再拉起</td>
</tr>
<tr>
<td><code>on-failure</code></td>
<td>只在非零退出码时重启</td>
</tr>
</tbody>
</table>
<p><code>unless-stopped</code> 是开发环境的最佳选择——崩溃自动恢复，手动停止时不纠缠。</p>
<hr />
<h2><a id="content-资源限制deployresources" href="#content-资源限制deployresources" class="heading-permalink" aria-hidden="true" title="Permalink"></a>资源限制（deploy.resources）</h2>
<pre><code class="language-yaml">deploy:
  resources:
    limits:
      cpus: '0.5'
      memory: 128M
    reservations:
      cpus: '0.1'
      memory: 32M
</code></pre>
<h3><a id="content-limits-vs-reservations" href="#content-limits-vs-reservations" class="heading-permalink" aria-hidden="true" title="Permalink"></a>limits vs reservations</h3>
<table>
<thead>
<tr>
<th>配置</th>
<th>含义</th>
<th>类比</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>limits</code></td>
<td><strong>硬上限</strong>，超过会被 OOM Kill 或 CPU 节流</td>
<td>信用卡额度</td>
</tr>
<tr>
<td><code>reservations</code></td>
<td><strong>预留资源</strong>，Docker 保证至少分配这么多</td>
<td>银行保底余额</td>
</tr>
</tbody>
</table>
<h3><a id="content-为什么要设置资源限制" href="#content-为什么要设置资源限制" class="heading-permalink" aria-hidden="true" title="Permalink"></a>为什么要设置资源限制？</h3>
<p>在 2C4G 的小服务器上，如果不限制：</p>
<ul>
<li>一个容器内存泄漏可能吃掉所有内存，导致其他容器被 OOM Kill</li>
<li>一个服务 CPU 飙升可能饿死其他服务</li>
</ul>
<p>设置限制后，<strong>每个容器都在自己的&quot;格子&quot;里运行</strong>，互不影响。</p>
<h3><a id="content-资源规划" href="#content-资源规划" class="heading-permalink" aria-hidden="true" title="Permalink"></a>资源规划</h3>
<table>
<thead>
<tr>
<th>组件</th>
<th>CPU 限制</th>
<th>内存限制</th>
<th>CPU 预留</th>
<th>内存预留</th>
</tr>
</thead>
<tbody>
<tr>
<td>App</td>
<td>0.5 核</td>
<td>128M</td>
<td>0.1 核</td>
<td>32M</td>
</tr>
<tr>
<td>PostgreSQL</td>
<td>0.8 核</td>
<td>512M</td>
<td>0.2 核</td>
<td>128M</td>
</tr>
<tr>
<td>Redis</td>
<td>0.3 核</td>
<td>128M</td>
<td>0.1 核</td>
<td>32M</td>
</tr>
<tr>
<td>Prometheus</td>
<td>0.5 核</td>
<td>512M</td>
<td>0.1 核</td>
<td>128M</td>
</tr>
<tr>
<td>Grafana</td>
<td>0.5 核</td>
<td>256M</td>
<td>0.1 核</td>
<td>64M</td>
</tr>
<tr>
<td><strong>合计</strong></td>
<td><strong>2.6 核</strong></td>
<td><strong>1.5 GB</strong></td>
<td><strong>0.6 核</strong></td>
<td><strong>384M</strong></td>
</tr>
</tbody>
</table>
<blockquote>
<p>limits 合计可以超过物理资源（超卖），因为不是每个服务都同时满载。reservations 合计不应超过物理资源。</p>
</blockquote>
<hr />
<h2><a id="content-postgresql-数据库" href="#content-postgresql-数据库" class="heading-permalink" aria-hidden="true" title="Permalink"></a>PostgreSQL 数据库</h2>
<pre><code class="language-yaml">postgres:
  image: postgres:16-alpine
  command:
    - &quot;postgres&quot;
    - &quot;-c&quot;
    - &quot;shared_buffers=128MB&quot;
    - &quot;-c&quot;
    - &quot;effective_cache_size=256MB&quot;
    - &quot;-c&quot;
    - &quot;work_mem=4MB&quot;
    - &quot;-c&quot;
    - &quot;max_connections=50&quot;
</code></pre>
<h3><a id="content-内存参数调优" href="#content-内存参数调优" class="heading-permalink" aria-hidden="true" title="Permalink"></a>内存参数调优</h3>
<table>
<thead>
<tr>
<th>参数</th>
<th>值</th>
<th>说明</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>shared_buffers</code></td>
<td>128MB</td>
<td>PostgreSQL 用于缓存数据页的共享内存，通常设为可用内存的 25%</td>
</tr>
<tr>
<td><code>effective_cache_size</code></td>
<td>256MB</td>
<td>告诉查询优化器&quot;系统总共有多少缓存可用&quot;，影响查询计划选择</td>
</tr>
<tr>
<td><code>work_mem</code></td>
<td>4MB</td>
<td>单个排序/哈希操作可用内存，注意是<strong>每个操作</strong>，并发高时实际占用 = work_mem × 并发数</td>
</tr>
<tr>
<td><code>max_connections</code></td>
<td>50</td>
<td>最大连接数，默认 100 太多，每个连接占用 ~5-10MB 内存</td>
</tr>
</tbody>
</table>
<blockquote>
<p><code>shared_buffers</code> 和 <code>effective_cache_size</code> 的区别：前者是 PostgreSQL 自己管理的缓存，后者是告诉优化器&quot;操作系统文件缓存 + shared_buffers 一共有多少&quot;，帮助它决定用索引扫描还是全表扫描。</p>
</blockquote>
<hr />
<h2><a id="content-redis-缓存" href="#content-redis-缓存" class="heading-permalink" aria-hidden="true" title="Permalink"></a>Redis 缓存</h2>
<pre><code class="language-yaml">redis:
  image: redis:7-alpine
  command: redis-server --maxmemory 64mb --maxmemory-policy allkeys-lru
</code></pre>
<h3><a id="content-内存淘汰策略" href="#content-内存淘汰策略" class="heading-permalink" aria-hidden="true" title="Permalink"></a>内存淘汰策略</h3>
<table>
<thead>
<tr>
<th>参数</th>
<th>值</th>
<th>说明</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>--maxmemory</code></td>
<td>64mb</td>
<td>Redis 最大使用内存</td>
</tr>
<tr>
<td><code>--maxmemory-policy</code></td>
<td>allkeys-lru</td>
<td>内存满时的淘汰策略</td>
</tr>
</tbody>
</table>
<p>常见淘汰策略对比：</p>
<table>
<thead>
<tr>
<th>策略</th>
<th>行为</th>
<th>适用场景</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>noeviction</code></td>
<td>内存满了直接报错</td>
<td>不允许丢数据</td>
</tr>
<tr>
<td><code>allkeys-lru</code></td>
<td>淘汰<strong>所有 key</strong> 中最近最少使用的</td>
<td><strong>通用缓存（本项目选用）</strong></td>
</tr>
<tr>
<td><code>volatile-lru</code></td>
<td>只淘汰设置了过期时间的 key 中最少使用的</td>
<td>部分 key 需要永久保留</td>
</tr>
<tr>
<td><code>allkeys-random</code></td>
<td>随机淘汰</td>
<td>无明显访问模式</td>
</tr>
</tbody>
</table>
<p><code>allkeys-lru</code> 是缓存场景的最佳选择——总是保留最&quot;热&quot;的数据，冷数据自动淘汰。</p>
<hr />
<h2><a id="content-prometheus-监控" href="#content-prometheus-监控" class="heading-permalink" aria-hidden="true" title="Permalink"></a>Prometheus 监控</h2>
<pre><code class="language-yaml">prometheus:
  image: prom/prometheus:v2.51.0
  volumes:
    - ../k8s/monitoring/prometheus.yaml:/etc/prometheus/prometheus.yml
    - prometheus_data:/prometheus
  command:
    - '--config.file=/etc/prometheus/prometheus.yml'
    - '--storage.tsdb.path=/prometheus'
    - '--storage.tsdb.retention.time=7d'
    - '--storage.tsdb.retention.size=1GB'
    - '--web.enable-lifecycle'
</code></pre>
<h3><a id="content-存储参数说明" href="#content-存储参数说明" class="heading-permalink" aria-hidden="true" title="Permalink"></a>存储参数说明</h3>
<table>
<thead>
<tr>
<th>参数</th>
<th>值</th>
<th>说明</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>storage.tsdb.retention.time</code></td>
<td>7d</td>
<td>时序数据只保留 7 天</td>
</tr>
<tr>
<td><code>storage.tsdb.retention.size</code></td>
<td>1GB</td>
<td>最多占用 1GB 磁盘</td>
</tr>
<tr>
<td><code>web.enable-lifecycle</code></td>
<td>-</td>
<td>允许通过 HTTP API 热重载配置</td>
</tr>
</tbody>
</table>
<blockquote>
<p>Prometheus 采用 <strong>Pull 模式</strong>：它主动定期访问应用的 <code>/metrics</code> 端点拉取指标，而不是应用主动推送。这样应用不需要知道监控系统的存在，解耦更彻底。</p>
</blockquote>
<hr />
<h2><a id="content-grafana-可视化" href="#content-grafana-可视化" class="heading-permalink" aria-hidden="true" title="Permalink"></a>Grafana 可视化</h2>
<pre><code class="language-yaml">grafana:
  image: grafana/grafana:10.4.0
  environment:
    - GF_SECURITY_ADMIN_USER=admin
    - GF_SECURITY_ADMIN_PASSWORD=admin
    - GF_USERS_ALLOW_SIGN_UP=false
  volumes:
    - grafana_data:/var/lib/grafana
    - ../k8s/monitoring/grafana-datasource.yaml:/etc/grafana/provisioning/datasources/datasource.yaml
  depends_on:
    - prometheus
</code></pre>
<ul>
<li><code>GF_USERS_ALLOW_SIGN_UP=false</code>：禁止自助注册，开发环境用 admin/admin 登录即可</li>
<li>数据源配置通过文件挂载自动导入，启动即连接 Prometheus，无需手动配置</li>
<li><code>depends_on: prometheus</code> 保证 Grafana 在 Prometheus 之后启动</li>
</ul>
<hr />
<h2><a id="content-网络networks" href="#content-网络networks" class="heading-permalink" aria-hidden="true" title="Permalink"></a>网络（Networks）</h2>
<pre><code class="language-yaml">networks:
  saas-network:
    driver: bridge
</code></pre>
<h3><a id="content-各部分含义" href="#content-各部分含义" class="heading-permalink" aria-hidden="true" title="Permalink"></a>各部分含义</h3>
<ul>
<li><strong><code>networks:</code></strong> — 定义自定义网络，让容器间安全通信</li>
<li><strong><code>saas-network</code></strong> — 网络名称，仅在此 Docker Compose 项目内可见</li>
<li><strong><code>driver: bridge</code></strong> — 桥接网络驱动，Docker 默认的网络类型，提供容器间隔离通信、端口映射和 DNS 解析</li>
</ul>
<h3><a id="content-服务间通信示例" href="#content-服务间通信示例" class="heading-permalink" aria-hidden="true" title="Permalink"></a>服务间通信示例</h3>
<pre><code class="language-yaml">services:
  app:
    networks:
      - saas-network

postgres:
    networks:
      - saas-network
</code></pre>
<p>这样配置后：</p>
<ul>
<li>App 服务可以通过 <code>postgres</code> 主机名访问数据库（<code>DB_HOST=postgres</code>）</li>
<li>服务间通过<strong>服务名</strong>互相访问，Docker 内置 DNS 自动解析为容器 IP</li>
<li>外部无法直接访问这些服务（除非通过 <code>ports</code> 明确暴露）</li>
</ul>
<p><img src="https://blog.shpym.cn/usr/uploads/2026/02/2567232500.png" alt="2026-02-24T16:39:18.png" loading="lazy"  style=""></p>
<h3><a id="content-docker-网络自动分配机制" href="#content-docker-网络自动分配机制" class="heading-permalink" aria-hidden="true" title="Permalink"></a>Docker 网络自动分配机制</h3>
<p>Docker Engine 自动处理以下过程：</p>
<ol>
<li>创建 <code>saas-network</code> 网络（通常分配 <code>172.x.x.0/16</code> 网段）</li>
<li>每个容器加入网络时，自动分配唯一 IP 地址</li>
<li>建立<strong>容器名 → IP</strong> 的 DNS 映射</li>
</ol>
<pre><code class="language-bash"># 查看容器实际分配的 IP
docker exec saas-postgres hostname -i
</code></pre>
<p><strong>为什么不需要手动指定 IP？</strong></p>
<ul>
<li><strong>动态分配</strong>：IP 地址根据可用范围自动分配，避免冲突</li>
<li><strong>DNS 解析</strong>：通过服务名 <code>postgres</code> 自动解析，不依赖具体 IP</li>
<li>这就是为什么配置中写 <code>DB_HOST=postgres</code> 而不是具体 IP 地址——Docker 网络系统自动处理底层细节</li>
</ul>
<hr />
<h2><a id="content-健康检查healthcheck" href="#content-健康检查healthcheck" class="heading-permalink" aria-hidden="true" title="Permalink"></a>健康检查（Healthcheck）</h2>
<pre><code class="language-yaml">healthcheck:
  test: [&quot;CMD-SHELL&quot;, &quot;pg_isready -U postgres&quot;]
  interval: 5s
  timeout: 5s
  retries: 5
</code></pre>
<h3><a id="content-参数详解" href="#content-参数详解" class="heading-permalink" aria-hidden="true" title="Permalink"></a>参数详解</h3>
<table>
<thead>
<tr>
<th>参数</th>
<th>值</th>
<th>含义</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>test</code></td>
<td><code>pg_isready -U postgres</code></td>
<td>PostgreSQL 内置检查工具，验证数据库是否接受连接</td>
</tr>
<tr>
<td><code>interval</code></td>
<td>5s</td>
<td>每 5 秒执行一次检查</td>
</tr>
<tr>
<td><code>timeout</code></td>
<td>5s</td>
<td>单次检查最多等待 5 秒</td>
</tr>
<tr>
<td><code>retries</code></td>
<td>5</td>
<td>连续失败 5 次才标记为 unhealthy</td>
</tr>
</tbody>
</table>
<h3><a id="content-各服务的健康检查方式" href="#content-各服务的健康检查方式" class="heading-permalink" aria-hidden="true" title="Permalink"></a>各服务的健康检查方式</h3>
<table>
<thead>
<tr>
<th>服务</th>
<th>检查命令</th>
<th>原理</th>
</tr>
</thead>
<tbody>
<tr>
<td>PostgreSQL</td>
<td><code>pg_isready -U postgres</code></td>
<td>专用工具检测数据库连接</td>
</tr>
<tr>
<td>Redis</td>
<td><code>redis-cli ping</code></td>
<td>发送 PING 命令，期望返回 PONG</td>
</tr>
<tr>
<td>App</td>
<td><code>wget --spider http://localhost:8080/healthz</code></td>
<td>HTTP 请求健康检查端点</td>
</tr>
</tbody>
</table>
<h3><a id="content-健康状态流转" href="#content-健康状态流转" class="heading-permalink" aria-hidden="true" title="Permalink"></a>健康状态流转</h3>
<pre><code>容器启动
   │
   ▼
starting ──(interval)──► 执行 test 命令
                              │
                    ┌─────────┴─────────┐
                    ▼                   ▼
                  成功               失败
                    │                   │
                    ▼                   ▼
               healthy          重试（最多 retries 次）
                                        │
                                        ▼
                                   unhealthy
</code></pre>
<h3><a id="content-为什么重要" href="#content-为什么重要" class="heading-permalink" aria-hidden="true" title="Permalink"></a>为什么重要？</h3>
<ul>
<li><strong>服务依赖管理</strong>：<code>depends_on + condition: service_healthy</code> 确保数据库就绪后再启动 App</li>
<li><strong>故障自动恢复</strong>：配合 <code>restart: unless-stopped</code>，不健康的容器会被自动重启</li>
<li><strong>避免级联故障</strong>：防止 App 向还未就绪的数据库发送请求导致启动失败</li>
</ul>
<hr />
<h2><a id="content-持久化存储volumes" href="#content-持久化存储volumes" class="heading-permalink" aria-hidden="true" title="Permalink"></a>持久化存储（Volumes）</h2>
<pre><code class="language-yaml">volumes:
  postgres_data:
  redis_data:
  prometheus_data:
  grafana_data:
</code></pre>
<h3><a id="content-命名卷-vs-匿名卷" href="#content-命名卷-vs-匿名卷" class="heading-permalink" aria-hidden="true" title="Permalink"></a>命名卷 vs 匿名卷</h3>
<p><strong>命名卷（Named Volumes）— 本项目使用的方式：</strong></p>
<pre><code class="language-yaml">volumes:
  postgres_data:                    # 顶层声明

services:
  postgres:
    volumes:
      - postgres_data:/var/lib/postgresql/data  # 引用命名卷
</code></pre>
<p><strong>匿名卷（Anonymous Volumes）：</strong></p>
<pre><code class="language-yaml">services:
  app:
    volumes:
      - /app/data    # 只有容器内路径，没有名称
</code></pre>
<p>两者对比：</p>
<table>
<thead>
<tr>
<th>特性</th>
<th>命名卷</th>
<th>匿名卷</th>
</tr>
</thead>
<tbody>
<tr>
<td>有明确名称</td>
<td>✅</td>
<td>❌</td>
</tr>
<tr>
<td><code>docker-compose down</code> 时保留</td>
<td>✅ 默认保留</td>
<td>❌ 默认删除</td>
</tr>
<tr>
<td>易于管理和备份</td>
<td>✅</td>
<td>❌</td>
</tr>
<tr>
<td>可在多个服务间共享</td>
<td>✅</td>
<td>❌</td>
</tr>
</tbody>
</table>
<h3><a id="content-各服务的持久化内容" href="#content-各服务的持久化内容" class="heading-permalink" aria-hidden="true" title="Permalink"></a>各服务的持久化内容</h3>
<table>
<thead>
<tr>
<th>卷名</th>
<th>挂载路径</th>
<th>存储内容</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>postgres_data</code></td>
<td><code>/var/lib/postgresql/data</code></td>
<td>数据库文件</td>
</tr>
<tr>
<td><code>redis_data</code></td>
<td><code>/data</code></td>
<td>Redis RDB/AOF 持久化文件</td>
</tr>
<tr>
<td><code>prometheus_data</code></td>
<td><code>/prometheus</code></td>
<td>时序指标数据</td>
</tr>
<tr>
<td><code>grafana_data</code></td>
<td><code>/var/lib/grafana</code></td>
<td>仪表盘配置、用户数据</td>
</tr>
</tbody>
</table>
<h3><a id="content-查看和管理卷" href="#content-查看和管理卷" class="heading-permalink" aria-hidden="true" title="Permalink"></a>查看和管理卷</h3>
<pre><code class="language-bash"># 查看所有命名卷
docker volume ls

# 查看某个卷的详细信息（存储位置等）
docker volume inspect saas-shortener_postgres_data
</code></pre>
<hr />
<h2><a id="content-停止-docker-compose-环境" href="#content-停止-docker-compose-环境" class="heading-permalink" aria-hidden="true" title="Permalink"></a>停止 Docker Compose 环境</h2>
<pre><code class="language-makefile">.PHONY: docker-down
docker-down:
	docker compose -f $(DOCKER_COMPOSE_LOCAL) down
</code></pre>
<h3><a id="content-停止过程" href="#content-停止过程" class="heading-permalink" aria-hidden="true" title="Permalink"></a>停止过程</h3>
<ol>
<li>向所有容器发送 <code>SIGTERM</code> 信号（通知优雅退出）</li>
<li>等待超时时间（默认 10 秒）</li>
<li>对仍在运行的容器发送 <code>SIGKILL</code> 强制终止</li>
</ol>
<blockquote>
<p>注意：所有容器<strong>几乎同时</strong>收到停止信号，不会按照 <code>depends_on</code> 的反向顺序停止。</p>
</blockquote>
<h3><a id="content-自定义优雅停止时间" href="#content-自定义优雅停止时间" class="heading-permalink" aria-hidden="true" title="Permalink"></a>自定义优雅停止时间</h3>
<pre><code class="language-yaml">services:
  app:
    stop_grace_period: 30s    # 给应用更多时间处理完当前请求
</code></pre>
<h3><a id="content-控制删除行为" href="#content-控制删除行为" class="heading-permalink" aria-hidden="true" title="Permalink"></a>控制删除行为</h3>
<pre><code class="language-bash"># 默认：停止容器、移除网络，保留命名卷
docker-compose down

# 删除所有卷（包括命名卷，数据库数据会丢失！）
docker-compose down -v

# 移除孤立容器（Compose 文件中已删除但仍在运行的服务）
docker-compose down --remove-orphans
</code></pre>
<blockquote>
<p>⚠️ <strong>生产环境慎用 <code>-v</code> 参数</strong>，它会删除包括数据库在内的所有数据！</p>
</blockquote>

Docker Compose 配置详解

源码解析

整体架构概览

应用服务（App）

关键配置解读

资源限制（deploy.resources）

limits vs reservations

为什么要设置资源限制？

资源规划

PostgreSQL 数据库

内存参数调优

Redis 缓存

内存淘汰策略

Prometheus 监控

存储参数说明

Grafana 可视化

网络（Networks）

各部分含义

服务间通信示例

Docker 网络自动分配机制

健康检查（Healthcheck）

参数详解

各服务的健康检查方式

健康状态流转

为什么重要？

持久化存储（Volumes）

命名卷 vs 匿名卷

各服务的持久化内容

查看和管理卷

停止 Docker Compose 环境

停止过程

自定义优雅停止时间

控制删除行为

发表评论 取消回复 使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款

Docker Compose 配置详解

发表评论取消回复
使用cookie技术保留您的个人信息以便您下次快速评论，继续评论表示您已同意该条款