Concat VC

A Personal Experiment in Real-Time Voice Conversion

学習が高速なリアルタイム声質変換を作ってみる個人的な実験

開発のはじめかた

Linux や WSL2 での開発を想定しています。

# Install `asdf` before you start if you haven't already.
# asdf: https://asdf-vm.com/guide/getting-started.html

# Clone this repository.
git clone https://github.com/omasakun/concat-vc.git
cd concat-vc

# Install the required tools and packages.
asdf install
pdm  install -G :all

# Now you are ready to go!
source .venv/bin/activate
python engine/hello.py

やってみたこと

Attempt 01: 単純な音声の切り貼り

変換先の音声が十分にあるなら、変換元の声の発音に近い変換先話者の音声を切り貼りするだけでもそれなりにうまく変換できるかもしれないと思ったので、試してみた。

Attempt 02: 音程によらない表現をつかう

先の Attempt 01 では、音程があっていない音声を無理やりつなげているせいでうまくいかないように見えた。

なので今度は、音程を調節してから切り貼りするようにしたら良くなるか試してみる。

→ 音程によらない表現を作るのがうまくできなかったので、ひとまず後回しにした。

Attempt 03: FragmentVC をつかう

Fragment VC がどれくらいの性能なのか、実際に確認してみる。

公式の実装があるので、それを使ってみる。

参考にしたものなど

Faiss (efficient similarity search)
wav2vec 2.0 (phonetic feature extraction)
CREPE (pitch estimation)
AdaSpeech (conditional layer normalization)
HiFi-GAN (audio waveform generation)
JVS corpus (free multi-speaker voice corpus)
FastSpeech 2, FastPitch (introduced me to the world of voice conversion)
FragmentVC (inspired me to use a similarity search)

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.vscode		.vscode
engine		engine
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
.tool-versions		.tool-versions
README.md		README.md
pdm.lock		pdm.lock
pdm.toml		pdm.toml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Concat VC

開発のはじめかた

やってみたこと

Attempt 01: 単純な音声の切り貼り

Attempt 02: 音程によらない表現をつかう

Attempt 03: FragmentVC をつかう

参考にしたものなど

About

Uh oh!

Uh oh!

Languages

omasakun/concat-vc

Folders and files

Latest commit

History

Repository files navigation

Concat VC

開発のはじめかた

やってみたこと

Attempt 01: 単純な音声の切り貼り

Attempt 02: 音程によらない表現をつかう

Attempt 03: FragmentVC をつかう

参考にしたものなど

About

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages