Skip to content

wjnwjn59/Vietnamese_Automatic_Speech_Recognition

Repository files navigation

Vietnamese Automatic Speech Recognition

Description

This repo is about learning the basic process in speech's field by practicing creating an Automatic Speech Recognition (ASR) system. The process contains data preprocessing, model training and evaluation.

Dataset

The dataset used in training ASR model is the VIVOS Corpus. You can download the dataset from here.

Model

Here is a DeepSpeech2 model trained on VIVOS with the batch size of 8 and epochs of 200, which reached 0.4390 WER on VIVOS test set. You can download the model in link

Instruction

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published