Skip to content

A project to analyze the effectiveness of various neural networks on the topic of automatic speech recognition.

Notifications You must be signed in to change notification settings

Sam-limyr/automatic-speech-recognition

 
 

Repository files navigation

speech-recognition

  • Currently data is stored at data/audio and data/label. Maybe can follow LibriSpeech directory structure.
  • AudioDataset - read data from directory and convert to spectogram. Variable length.
  • dataloader_audio.collate_fn pad sequence. Yet to check compatiblity with nn.
  • utils.TextProcess to clean text. Currently include to lower case, remove punctuations, numbers(including years) to words.

About

A project to analyze the effectiveness of various neural networks on the topic of automatic speech recognition.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.2%
  • Python 1.8%