Skip to content

Commit c445126

Browse files
committed
修复YAML格式错误:转义标题中的单引号
1 parent 83b59cb commit c445126

File tree

3 files changed

+1
-4
lines changed

3 files changed

+1
-4
lines changed

content/publications/step-audio-r1/index.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: 'Step-Audio R1: China's First Speech Reasoning Model'
2+
title: 'Step-Audio R1: China''s First Speech Reasoning Model'
33

44
authors:
55
- admin
@@ -24,4 +24,3 @@ pager: false
2424
**Role: Project Lead**
2525

2626
Step-Audio R1 represents the Deepseek R1 moment for speech large models, creating China's first leading speech reasoning model with perception and reasoning capabilities that fully match Gemini 2.5 Pro. By integrating our proprietary Step MPS framework, we have achieved a world-first innovation: endowing the model with sophisticated reasoning capabilities and highly human-like interactive intelligence without adding any additional latency, truly realizing zero time gap between thinking and responding.
27-

content/publications/step-editx/index.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,4 +24,3 @@ pager: false
2424
**Role: Co-Project Lead**
2525

2626
Step EditX is a groundbreaking next-generation speech editing model that completely transforms traditional tool-based audio post-processing into natural language instruction-based "conversational creation." Users can perform comprehensive intelligent editing of audio—from content to style, from emotion to coloring—simply through text prompts. Step EditX not only possesses powerful zero-shot TTS capabilities, over 14 types of emotion enhancement, and more than 30 style transfer options, but also features precise "one-click audio enhancement" functionality that intelligently repairs various audio imperfections and extracts target voices. Its most significant breakthrough lies in the model's deep understanding of text-level addition, deletion, and modification instructions, enabling context-aware speech regeneration that corrects content while perfectly preserving the speaker's timbre and prosody. This marks the first entry of speech editing into the era of true "semantic-level" intelligent operations.
27-

content/publications/step-mps/index.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,4 +24,3 @@ pager: false
2424
**Role: Project Lead**
2525

2626
Step MPS (Mind-Paced Speaking) is a revolutionary brain-inspired proprietary framework designed to endow speech large models with truly human-like abilities to "think while speaking." Its core innovation lies in the "dual-brain" architecture: a "planning brain" responsible for high-level logical reasoning that guides a separate "expression brain" in real-time for fluent speech generation. This collaborative division of labor represents the world's first solution to the fundamental contradiction between complex "chain-of-thought" reasoning and real-time interaction. It achieves zero latency increase while maintaining virtually full reasoning accuracy, thereby granting the model genuine advanced logical intelligence and empathetic interactive capabilities, ultimately achieving seamless synchronization between thinking and expression.
27-

0 commit comments

Comments
 (0)