Skip to content

Commit 283e38e

Browse files
committed
Simplify publications layout: keep only Abstract and PDF button, add beautiful abstract cover images
1 parent 14497ba commit 283e38e

File tree

4 files changed

+65
-74
lines changed

4 files changed

+65
-74
lines changed

content/publications/chronological-thinking/index.md

Lines changed: 29 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -13,29 +13,39 @@ publication_short: arXiv
1313

1414
abstract: This work explores chronological thinking capabilities in full-duplex spoken dialogue language models, enabling more natural and coherent conversational interactions.
1515

16-
summary: Exploring chronological thinking in full-duplex spoken dialogue language models.
17-
18-
tags:
19-
- Audio LLM
20-
- Full-Duplex
21-
- Spoken Dialogue
22-
- Language Models
23-
2416
featured: true
2517

18+
# Featured image - 优美抽象画面封面建议:
19+
# Option 1: 彩色流体艺术(蓝紫金色)
20+
image:
21+
url: 'https://images.unsplash.com/photo-1541701494587-cb58502866ab?w=1200&q=80'
22+
caption: 'Abstract Art'
23+
# Option 2: 渐变水彩质感(粉紫色系)
24+
# image:
25+
# url: 'https://images.unsplash.com/photo-1557672172-298e090bd0f1?w=1200&q=80'
26+
# caption: 'Abstract Watercolor'
27+
# Option 3: 抽象光影纹理(暖色调)
28+
# image:
29+
# url: 'https://images.unsplash.com/photo-1550859492-d5da9d8e45f3?w=1200&q=80'
30+
# caption: 'Abstract Light'
31+
# Option 4: 流动色彩(多彩渐变)
32+
# image:
33+
# url: 'https://images.unsplash.com/photo-1506259091721-347e791bab0f?w=1200&q=80'
34+
# caption: 'Fluid Colors'
35+
# Option 5: 抽象烟雾质感(蓝橙色)
36+
# image:
37+
# url: 'https://images.unsplash.com/photo-1553356084-58ef4a67b2a7?w=1200&q=80'
38+
# caption: 'Abstract Smoke'
39+
2640
links:
2741
- type: pdf
28-
url: https://arxiv.org/abs/2510.05150
29-
- type: source
30-
url: https://arxiv.org/abs/2510.05150
42+
url: https://arxiv.org/pdf/2510.05150
3143

3244
url_pdf: 'https://arxiv.org/pdf/2510.05150'
33-
url_code: ''
34-
url_dataset: ''
35-
url_poster: ''
36-
url_project: ''
37-
url_slides: ''
38-
url_source: 'https://arxiv.org/abs/2510.05150'
39-
url_video: ''
40-
---
4145

46+
# Hide page metadata
47+
share: false
48+
show_date: false
49+
profile: false
50+
pager: false
51+
---

content/publications/step-audio-2/index.md

Lines changed: 12 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -13,29 +13,22 @@ publication_short: arXiv
1313

1414
abstract: Step-Audio 2 is the world's first industrial-grade end-to-end audio LLM with deep thinking capabilities, introducing Chain-of-Thought reasoning and audio reinforcement learning into speech models for the first time.
1515

16-
summary: The world's first industrial-grade end-to-end audio LLM with CoT reasoning and reinforcement learning.
17-
18-
tags:
19-
- Audio LLM
20-
- Chain-of-Thought
21-
- Reinforcement Learning
22-
- Speech Understanding
23-
2416
featured: true
2517

18+
# Featured image - 优美抽象画面
19+
image:
20+
url: 'https://images.unsplash.com/photo-1550859492-d5da9d8e45f3?w=1200&q=80'
21+
caption: 'Abstract Art'
22+
2623
links:
2724
- type: pdf
28-
url: https://arxiv.org/abs/2507.16632
29-
- type: source
30-
url: https://arxiv.org/abs/2507.16632
25+
url: https://arxiv.org/pdf/2507.16632
3126

3227
url_pdf: 'https://arxiv.org/pdf/2507.16632'
33-
url_code: ''
34-
url_dataset: ''
35-
url_poster: ''
36-
url_project: ''
37-
url_slides: ''
38-
url_source: 'https://arxiv.org/abs/2507.16632'
39-
url_video: ''
40-
---
4128

29+
# Hide page metadata
30+
share: false
31+
show_date: false
32+
profile: false
33+
pager: false
34+
---

content/publications/step-audio-aqaa/index.md

Lines changed: 12 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -13,28 +13,22 @@ publication_short: arXiv
1313

1414
abstract: Step-Audio-AQAA presents a fully end-to-end expressive large audio language model, pushing the boundaries of expressive speech synthesis and understanding.
1515

16-
summary: A fully end-to-end expressive large audio language model.
17-
18-
tags:
19-
- Audio LLM
20-
- Expressive Speech
21-
- End-to-End Model
22-
2316
featured: true
2417

18+
# Featured image - 优美抽象画面
19+
image:
20+
url: 'https://images.unsplash.com/photo-1506259091721-347e791bab0f?w=1200&q=80'
21+
caption: 'Abstract Art'
22+
2523
links:
2624
- type: pdf
27-
url: https://arxiv.org/abs/2506.08967
28-
- type: source
29-
url: https://arxiv.org/abs/2506.08967
25+
url: https://arxiv.org/pdf/2506.08967
3026

3127
url_pdf: 'https://arxiv.org/pdf/2506.08967'
32-
url_code: ''
33-
url_dataset: ''
34-
url_poster: ''
35-
url_project: ''
36-
url_slides: ''
37-
url_source: 'https://arxiv.org/abs/2506.08967'
38-
url_video: ''
39-
---
4028

29+
# Hide page metadata
30+
share: false
31+
show_date: false
32+
profile: false
33+
pager: false
34+
---

content/publications/step-audio/index.md

Lines changed: 12 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -13,28 +13,22 @@ publication_short: arXiv
1313

1414
abstract: Step-Audio represents a unified approach to understanding and generation in intelligent speech interaction systems, advancing the capabilities of audio language models.
1515

16-
summary: A unified framework for speech understanding and generation in intelligent interaction systems.
17-
18-
tags:
19-
- Audio LLM
20-
- Speech Understanding
21-
- Speech Generation
22-
2316
featured: true
2417

18+
# Featured image - 优美抽象画面
19+
image:
20+
url: 'https://images.unsplash.com/photo-1557672172-298e090bd0f1?w=1200&q=80'
21+
caption: 'Abstract Art'
22+
2523
links:
2624
- type: pdf
27-
url: https://arxiv.org/abs/2502.11946
28-
- type: source
29-
url: https://arxiv.org/abs/2502.11946
25+
url: https://arxiv.org/pdf/2502.11946
3026

3127
url_pdf: 'https://arxiv.org/pdf/2502.11946'
32-
url_code: ''
33-
url_dataset: ''
34-
url_poster: ''
35-
url_project: ''
36-
url_slides: ''
37-
url_source: 'https://arxiv.org/abs/2502.11946'
38-
url_video: ''
39-
---
4028

29+
# Hide page metadata
30+
share: false
31+
show_date: false
32+
profile: false
33+
pager: false
34+
---

0 commit comments

Comments
 (0)