Labels
Clear labels
area:activity
活动/统计
area:admin
管理后台
area:admin-api
area:ai
AI/RAG
area:ai-runtime
AI Runtime / AI 分析体系相关
area:analytics
area:api
API 接口
area:auth
认证与授权
area:cos
对象存储
area:database
数据库/Migration
area:import
文件导入/解析
area:knowledge
知识库/知识点
area:learning-info
area:learning-session
area:quiz
测验/自测
area:reading-event
area:reading-progress
area:review
复习系统
area:security
安全相关
audit:api-admin-info
audit:api-info
audit:planned
已完成宏观规划,尚未代码审查
audit:reviewed
blocked-by:api-info-aggregation
blocked-by:api-info-core
blocked-by:api-info-ops
blocked-by:api-info-schema
blocked-by:processor
blocked-by:schema
priority:p0
最高优先级,阻塞发布
priority:p1
高优先级,里程碑必需
priority:p2
中优先级,后续版本
repo:api
API 仓库 Issue
status:blocked
被阻塞
status:done
已完成
status:partial
status:todo
type:aggregation
type:bug
缺陷修复
type:design
设计
type:docs
文档
type:feature
新功能
type:migration
type:refactor
重构
type:test
work:admin-api
work:aggregation
work:api
work:artifact
题目/卡片产物
work:audit
work:circuit-breaker
熔断
work:contract
work:design
架构/协议设计工作
work:docs
work:export
work:extend-existing
work:internal-api
Runtime 内部接口
work:job
Job 调度相关
work:new-module
work:new-table
work:ops
work:query
work:quota
额度/限流
work:schema
Prisma Schema 设计
work:security
work:service
Service 层实现
work:snapshot
Snapshot 构建
work:test
No Label
Milestone
No items
No Milestone
M1:AI / RAG 运行时与检索底座(P0~P1)
Projects
Clear projects
No project
No Assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: wangdl/api-server#15
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
目标
在 M0-08 AI Gateway 基础版之上深化 AI 网关能力,实现基础主备降级、Admin 模型路由管理、JSON Schema 校验和成本日志闭环。
本 Issue 只做深化设计,基础能力(统一调用封装、Prompt 管理、token 统计、重试、超时)已在 M0-08 完成。
背景说明
M0-08 建立了 AI Gateway 的统一调用框架。本阶段需要在此基础上补充生产级能力:当 DeepSeek 主模型不可用时自动 fallback 到备用模型、Admin 能在页面上管理模型路由规则、AI 输出通过 JSON Schema 校验确保格式正确。
注意:本阶段只做基础主备 fallback,不做生产级自动熔断系统(复杂熔断窗口、自动健康评分、动态权重调度不在 M1 范围内)。
模块深化内容
模型降级策略:
Admin 模型路由管理:
JSON Schema 校验:
成本日志闭环:
基础设施依赖变更
相比 M0-08 基础版,新增依赖:
接口设计(新增部分)
AAPI 新增/深化:
Domain Event(新增)
交付检查
验收标准
禁止事项
不建议当前阶段实现
✅ M1-01 实施完成
交付内容
prisma/schema.prisma(ModelRoute/ProviderConfig/FallbackEvent)20260524000000_add_model_routesrc/modules/ai/model-router.tsloadFromDb()热加载src/modules/ai/gateway/ai-gateway.service.tssrc/modules/ai/gateway/ai-gateway.service.tssrc/modules/ai/ai.controller.tsE2E 测试
8 个测试全部通过(
test/m1.e2e-spec.tsM1-01 章节):GET /admin-api/ai-gateway/statusGET /admin-api/ai-gateway/routesPOST /admin-api/ai-gateway/routesPUT /admin-api/ai-gateway/routes/:idGET /admin-api/ai-gateway/providersPUT /admin-api/ai-gateway/providers/:nameGET /admin-api/ai-gateway/fallback-events运行
wangdl referenced this issue2026-06-05 19:34:43 +08:00
wangdl referenced this issue2026-06-05 19:36:08 +08:00
关闭
架构设计阶段已完成。具体实现已通过后续 M1-M7 milestone 的 issue 交付。本 issue 作为设计文档保留。