Skip to content

Commit ecfae80

Browse files
authored
Fix typo in speech synthesis readme
Spectrograms are acoustic features rather than linguistic features.
1 parent 690233c commit ecfae80

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

PyTorch/SpeechSynthesis/README.md

+2-2
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,7 @@ In this collection, we will cover:
1515
TTS synthesis is a 2-step process described as follows:
1616

1717
1. Text to Spectrogram Model:
18-
This model Transforms the text into time-aligned features such as spectrogram, mel spectrogram, or F0 frequencies and other linguistic features. We use architectures like Tacotron
18+
This model Transforms the text into time-aligned features such as spectrogram, mel spectrogram, or F0 frequencies and other acoustic features. We use architectures like Tacotron
1919

2020
2. Spectrogram to Audio Model:
2121
Converts generated spectrogram time-aligned representation into continuous human-like audio—for example, WaveGlow.
@@ -51,4 +51,4 @@ Here are the examples relevant for image segmentation, directly from [Deep Learn
5151

5252
2. FastPitch for text to melspectogram generation using PyTorch
5353
- [Git repository](https://github.com/NVIDIA/DeepLearningExamples/tree/master/PyTorch/SpeechSynthesis/FastPitch)
54-
- Uses PyTorch 20.03-py3 [NGC container](https://ngc.nvidia.com/registry/nvidia-pytorch)
54+
- Uses PyTorch 20.03-py3 [NGC container](https://ngc.nvidia.com/registry/nvidia-pytorch)

0 commit comments

Comments
 (0)