-
-
Notifications
You must be signed in to change notification settings - Fork 3
Add blog post on Asturian TTS voice cloning methodology #25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
WalkthroughNew blog post added documenting a hybrid low-resource Asturian TTS build using donor voice cloning, covering data sources, preprocessing, WPM filtering, zero-shot cloning, and released artifacts; no code or exported/public declarations changed. (50 words) Changes
Estimated code review effort🎯 2 (Simple) | ⏱️ ~8 minutes
Poem
Pre-merge checks and finishing touches✅ Passed checks (3 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 1
🧹 Nitpick comments (2)
_posts/2025-11-30-ast.md (2)
17-17: Consider rephrasing to avoid overused word.Line 17 uses "exactly," which is flagged as overused. Minor style improvement.
- That's exactly the challenge we tackled for **Asturian (ast)**, a beautiful Romance language spoken in Asturias, Spain. + That's precisely the challenge we tackled for **Asturian (ast)**, a beautiful Romance language spoken in Asturias, Spain.
42-44: Reduce repetitive sentence starters.Three consecutive sentences begin with "We." Consider combining or rewording for better flow.
- We converted all audio to a standard format and ensured the volume was consistent across all recordings (normalization). - We trimmed silence from the beginning and end of each recording. - We filtered out recordings where people spoke too fast or too slow (outliers based on **Words-Per-Minute**), keeping only the most natural and consistent segments. This focused our dataset on the best quality transcripts. + We converted all audio to a standard format, ensured consistent volume (normalization), and trimmed silence from the beginning and end of each recording. Finally, we filtered out recordings where people spoke too fast or too slow (outliers based on **Words-Per-Minute**), keeping only the most natural and consistent segments to focus on the highest quality transcripts.
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
⛔ Files ignored due to path filters (1)
public/assets/blog/ast/thumb.pngis excluded by!**/*.png
📒 Files selected for processing (1)
_posts/2025-11-30-ast.md(1 hunks)
🧰 Additional context used
🪛 LanguageTool
_posts/2025-11-30-ast.md
[style] ~17-~17: Consider an alternative for the overused word “exactly”.
Context: ...ommonly supported by big tech? That's exactly the challenge we tackled for **Asturian...
(EXACTLY_PRECISELY)
[style] ~44-~44: Three successive sentences begin with the same word. Consider rewording the sentence or use a thesaurus to find a synonym.
Context: ... beginning and end of each recording. - We filtered out recordings where people sp...
(ENGLISH_WORD_REPEAT_BEGINNING_RULE)
🪛 markdownlint-cli2 (0.18.1)
_posts/2025-11-30-ast.md
33-33: No empty links
(MD042, no-empty-links)
🔇 Additional comments (1)
_posts/2025-11-30-ast.md (1)
58-60: External links verified as accessible.Both Hugging Face URLs (model and dataset) return HTTP 200 responses and are ready for publication.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Actionable comments posted: 0
♻️ Duplicate comments (1)
_posts/2025-11-30-ast.md (1)
33-33: Fix broken empty link reference (duplicate of past review).This issue was flagged in a previous review and remains unresolved. The link to the whitepaper has no URL:
[Whitepaper on Hybrid TTS Dataset Synthesis]().Either add the URL or remove the link formatting:
- Here's a simplified look at the process we followed (for a more detailed, technical explanation, check out our **[Whitepaper on Hybrid TTS Dataset Synthesis]()**): + Here's a simplified look at the process we followed (for a more detailed, technical explanation, check out our Whitepaper on Hybrid TTS Dataset Synthesis):Or if the whitepaper URL is available:
- Here's a simplified look at the process we followed (for a more detailed, technical explanation, check out our **[Whitepaper on Hybrid TTS Dataset Synthesis]()**): + Here's a simplified look at the process we followed (for a more detailed, technical explanation, check out our **[Whitepaper on Hybrid TTS Dataset Synthesis](https://example.com/whitepaper)**):
🧹 Nitpick comments (1)
_posts/2025-11-30-ast.md (1)
40-44: Reduce consecutive sentence repetition.Multiple consecutive sentences begin with "We" (lines 42, 43, 44). Consider rewording for better flow:
- We converted all audio to a standard format and ensured the volume was consistent across all recordings (normalization). - - We trimmed silence from the beginning and end of each recording. - - We filtered out recordings where people spoke too fast or too slow (outliers based on **Words-Per-Minute**), keeping only the most natural and consistent segments. This focused our dataset on the best quality transcripts. + - Silence was trimmed from the beginning and end of each recording. + - Recordings where people spoke too fast or too slow (outliers based on **Words-Per-Minute**) were filtered out, keeping only the most natural and consistent segments to focus the dataset on the best quality transcripts.
📜 Review details
Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro
📒 Files selected for processing (1)
_posts/2025-11-30-ast.md(1 hunks)
🧰 Additional context used
🪛 LanguageTool
_posts/2025-11-30-ast.md
[style] ~17-~17: Consider an alternative for the overused word “exactly”.
Context: ...ommonly supported by big tech? That's exactly the challenge we tackled for **Asturian...
(EXACTLY_PRECISELY)
[style] ~44-~44: Three successive sentences begin with the same word. Consider rewording the sentence or use a thesaurus to find a synonym.
Context: ... beginning and end of each recording. - We filtered out recordings where people sp...
(ENGLISH_WORD_REPEAT_BEGINNING_RULE)
🪛 markdownlint-cli2 (0.18.1)
_posts/2025-11-30-ast.md
33-33: No empty links
(MD042, no-empty-links)
🔇 Additional comments (1)
_posts/2025-11-30-ast.md (1)
1-76: Empty link unresolved and audio sample files missing—manual verification required.The empty Whitepaper link on line 33 remains unfixed:
[Whitepaper on Hybrid TTS Dataset Synthesis](). Additionally, no audio sample files exist inpublic/assets/blog/ast/—only the thumbnail image is present, while other blog posts contain audio samples (.wav files).Given the PR description states "TODO - do not merge until female voice + audio samples added," please manually verify:
- Whether the merge-blocking condition has been satisfied
- The current PR status and description
- If the whitepaper link should be populated with a URL or removed entirely
TODO - do not merge until female voice + audio samples added
Summary by CodeRabbit
✏️ Tip: You can customize this high-level summary in your review settings.