Go to file

User e145f1d97e feat(s2s-text): dedicated text-mode prompt + Markdown rendering

Architecture fix: voice and text mode now have completely separate prompts.

Backend:
- VoiceAssistantProfileSupport.buildTextSystemRole: dedicated text-mode system
  role that inherits all business rules (identity, KB-first, sensitive topics,
  sales guidance, personal info) but removes voice-specific constraints (short
  sentences, colloquial, single-line conclusion).
- DEFAULT_TEXT_SPEAKING_STYLE: text-specific style demanding detailed,
  structured, Markdown-formatted answers with complete information.
- VoiceGatewayService.handleStart: switch between voice/text system role and
  speaking style based on state.textMode.
- VoiceGatewayService.buildStartSessionPayload: preserve Markdown in text mode
  (voice mode still strips asterisks/backticks via normalizeTextForSpeech to
  avoid TTS pronouncing format chars).

Frontend:
- Added react-markdown@9 + remark-gfm@4 dependencies.
- ChatPanel renders assistant messages (non-voice) with ReactMarkdown:
  headings, lists (ul/ol), bold, italic, inline/block code, tables, blockquote,
  links, horizontal rules — all styled with Tailwind classes matching the dark
  theme.
- User messages and voice-handoff messages remain plain text.

Verification: mvn test VoiceGatewaySmokeTest 20/20 pass, vite build succeeds.

2026-04-17 10:10:20 +08:00

2026火山知识库用/2026火山知识库用

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

ai-knowledge-splitter @ 92e7fc5bda

Update code

2026-03-12 12:47:56 +08:00

codechat

fix: 品牌保护+知识库全量覆盖 - 6层防御解决传销问题 + 30+产品关键词补全

2026-03-17 11:00:09 +08:00

coze_api

Update code

2026-03-12 12:47:56 +08:00

coze_config

Update code

2026-03-12 12:47:56 +08:00

delivery/client

feat(s2s): add S2S text dialog via /ws/realtime-text + event 501 ChatTextQuery

2026-04-17 09:33:56 +08:00

dev-assistant-mcp

Update code

2026-03-12 12:47:56 +08:00

java-server/src

feat(s2s-text): dedicated text-mode prompt + Markdown rendering

2026-04-17 10:10:20 +08:00

mcp-server-ssh

fix(voice-kb): sync assistant profile and stabilize reply flow

2026-03-23 13:58:41 +08:00

parsers

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

realtime_dialog

feat: 添加realtime_dialog和realtime_dialog_external_rag_test项目，更新test2项目

2026-03-13 13:06:46 +08:00

realtime_dialog_external_rag_test

feat: 添加realtime_dialog和realtime_dialog_external_rag_test项目，更新test2项目

2026-03-13 13:06:46 +08:00

test2

feat(s2s-text): dedicated text-mode prompt + Markdown rendering

2026-04-17 10:10:20 +08:00

test_results

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

tests

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

.windsurfrules

Update code

2026-03-12 12:47:56 +08:00

agents_prompts.txt

Update code

2026-03-12 12:47:56 +08:00

api_client.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

batch.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

chunker.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

conftest.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

exceptions.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

main.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

models.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

OPTIMIZATION.md

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

PROJECT_STRUCTURE.md

Update code

2026-03-12 12:47:56 +08:00

prompts.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

README.md

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

requirements.txt

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

splitter.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

test_final.md

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

test_output.md

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

writer.py

Initial commit: AI 知识库文档智能分块工具

2026-03-02 17:38:28 +08:00

大沃_clean.txt

Update code

2026-03-12 12:47:56 +08:00

大沃_content.txt

Update code

2026-03-12 12:47:56 +08:00

大沃.doc

Update code

2026-03-12 12:47:56 +08:00

README.md

AI 知识库文档智能分块工具

将多种格式文档解析为文本，通过 DeepSeek API 进行语义级智能分块，输出为 Markdown 文件。

支持格式

PDF、Word (.docx)、Excel (.xlsx/.xls)、CSV、HTML、TXT/MD、图片 (PNG/JPG/BMP/GIF/WEBP)

安装

cd ai-knowledge-splitter
pip install -r requirements.txt

使用

python main.py <输入文件> -k <DeepSeek API Key> [-o 输出路径] [-d 分隔符]

示例：

# 基本用法（输出为同名 .md 文件）
python main.py report.pdf -k sk-xxxxxxxx

# 指定输出路径
python main.py data.docx -k sk-xxxxxxxx -o output/result.md

# 自定义分隔符
python main.py notes.txt -k sk-xxxxxxxx -d "==="

参数说明

参数	必需	说明
`input_file`	是	输入文件路径
`-k, --api-key`	是	DeepSeek API Key
`-o, --output`	否	输出文件路径（默认：同名 .md）
`-d, --delimiter`	否	分块分隔符（默认：`---`）

运行测试

cd ai-knowledge-splitter
pytest tests/ -v

README.md Unescape Escape

AI 知识库文档智能分块工具

支持格式

安装

使用

参数说明

运行测试

README.md