Brain2Text

This project was started in the context of Competitive Programming with Deep Learning Seminar at HPI. We investigate if knowledge transfer from Wav2Vec2 (speech recognition from audio) to the task of brain to text decoding is possible. arXiv preprint

Setup

Create conda env conda env create -f environment.yaml
Activate env: conda activate b2t
Set Python path to repository root dir: export PYTHONPATH="[...]/brain2text"
Run python run.py --experiment_type=b2p2t_gru+w2v, it will create a config.yaml in project root and prompt you to specify cache directories, API keys etc. before running again

Update conda environment from environment.yaml

conda activate b2t
conda env update --file environment.yaml --prune

If Update leads to cudaKernelExc Error

conda create --name b2t pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
conda activate b2t
conda install scipy pydantic tokenizers transformers matplotlib
conda install -c conda-forge wandb

Execute experiments

To run the experiments for 45 Brain Feature Extractor Architectures each for our three experiment setups (we used a NVIDIA A100 GPU and 32GB of RAM):

Start WandB sweep for corresponding experiment setup: wandb sweep sweeps/[...experiment setup sweep file].yaml
Connect one or more agents to execute the runs wandb agent [SWEEP ID]

The results of our runs can be found in the analysis dir. If you set --lm_decode_test_predictions=true when executing run.py, you probably need 64GB of RAM (at least for batch size 64).

To run the Wav2Vec2Conformer experiment to reproduce our results, execute python run.py --encoder_fc_hidden_sizes=[256] --encoder_gru_hidden_size=512 --encoder_num_gru_layers=3 --use_wandb=true --experiment_type=b2p2t_gru+w2v_conformer --loss_function=ctc --early_stopping_patience=10 --epochs=100 --batch_size=32 --learning_rate=0.0001 --return_best_model=false --encoder_learnable_inital_state=false --unfreeze_strategy=brain_encoder+w2v --weight_decay=8.324385138271928e-05 --encoder_dropout=0.4570249990196249 --gaussian_smooth_width=1.5290517142639226 --w2v_learning_rate=9.506050391898906e-06 --w2v_warmup_steps=7 --w2v_warmup_start_step=7 --whiteNoiseSD=0.01978441712172472 --constantOffsetSD=0.2443028255597108 --lm_decode_test_predictions=true --wav2vec_checkpoint=facebook/wav2vec2-conformer-rope-large-960h-ft --tokenizer_checkpoint=facebook/wav2vec2-conformer-rope-large-960h-ft --experiment_name="gru+w2v conformer large"

Name		Name	Last commit message	Last commit date
Latest commit History 349 Commits
.vscode		.vscode
src		src
sweeps		sweeps
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
environment.yaml		environment.yaml
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Brain2Text

Setup

Update conda environment from environment.yaml

If Update leads to cudaKernelExc Error

Execute experiments

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

tfiedlerdev/Wav2Vec2ForBrain

Folders and files

Latest commit

History

Repository files navigation

Brain2Text

Setup

Update conda environment from environment.yaml

If Update leads to cudaKernelExc Error

Execute experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages