H0-14 P1 | RAG Chat SSE 流式输出 + DeepSeek V4 Pro 思考过程展示 #71
Closed
opened 2026-06-06 14:41:33 +08:00 by wangdl
·
2 comments
Labels
Clear labels
area:activity
活动/统计
area:admin
管理后台
area:admin-api
area:ai
AI/RAG
area:ai-runtime
AI Runtime / AI 分析体系相关
area:analytics
area:api
API 接口
area:auth
认证与授权
area:cos
对象存储
area:database
数据库/Migration
area:import
文件导入/解析
area:knowledge
知识库/知识点
area:learning-info
area:learning-session
area:quiz
测验/自测
area:reading-event
area:reading-progress
area:review
复习系统
area:security
安全相关
audit:api-admin-info
audit:api-info
audit:planned
已完成宏观规划,尚未代码审查
audit:reviewed
blocked-by:api-info-aggregation
blocked-by:api-info-core
blocked-by:api-info-ops
blocked-by:api-info-schema
blocked-by:processor
blocked-by:schema
priority:p0
最高优先级,阻塞发布
priority:p1
高优先级,里程碑必需
priority:p2
中优先级,后续版本
repo:api
API 仓库 Issue
status:blocked
被阻塞
status:done
已完成
status:partial
status:todo
type:aggregation
type:bug
缺陷修复
type:design
设计
type:docs
文档
type:feature
新功能
type:migration
type:refactor
重构
type:test
work:admin-api
work:aggregation
work:api
work:artifact
题目/卡片产物
work:audit
work:circuit-breaker
熔断
work:contract
work:design
架构/协议设计工作
work:docs
work:export
work:extend-existing
work:internal-api
Runtime 内部接口
work:job
Job 调度相关
work:new-module
work:new-table
work:ops
work:query
work:quota
额度/限流
work:schema
Prisma Schema 设计
work:security
work:service
Service 层实现
work:snapshot
Snapshot 构建
work:test
No Label
Milestone
No items
No Milestone
Projects
Clear projects
No project
No Assignees
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: wangdl/api-server#71
Reference in New Issue
Block a user
Blocking a user prevents them from interacting with repositories, such as opening or commenting on pull requests or issues. Learn more about blocking a user.
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
背景
当前 AI 对话使用同步
fetch()一次性取结果,response_format: json_object模式下 DeepSeek V4 Pro 不输出reasoning_content(思考过程)。iOS 端全链路阻塞等待,无流式效果。需求
后端
DeepSeekProvider新增generateStream():stream: true,去掉response_format,ReadableStream 逐 chunk yieldAiProvider接口新增generateStream()方法AiGatewayService新增generateStream():透传 provider stream,流结束后汇总 usageRagChatService新增sendMessageStream():调 gateway stream,保存最终 AI 回复到 DBPOST /api/rag-chat/sessions/:id/streamSSE endpoint数据流
关联
后端实现完成 (2026-06-06)
改动文件
ai-provider.interface.tsStreamChunk类型 +generateStream()方法deepseek.provider.tsgenerateStream():stream: true,ReadableStream 逐 chunk yield,reasoning_content → thinkingai-gateway.service.tsgenerateStream():透传 provider stream + 记录用量rag-chat.service.tssendMessageStream():流式调用 + 保存最终回复到 DBrag-chat.controller.tsPOST /api/rag-chat/sessions/:id/streamSSE endpointSSE 数据格式
待完成
iOS 端 SSE 接收 + 思考过程 UI (ios-projects #38)
后端完成 (2026-06-06)
新增端点
POST /api/rag-chat/sessions/:id/stream— SSE 流式输出改动文件
ai-provider.interface.ts— StreamChunk 类型 + generateStream()deepseek.provider.ts— stream:true,ReadableStream yield thinking/contentai-gateway.service.ts— generateStream() 透传 + 记录用量rag-chat.service.ts— sendMessageStream() 流式调用rag-chat.controller.ts— SSE endpoint状态
✅ 完成。