Adapted from the LGBM model from Predicting the translation efficiency of messenger RNA in mammalian cells, original code available.
- Create conda environment:
conda env create -f environment.yml --prefix ./TE_classic_ML_env/ - Activate conda environment:
conda activate ./TE_clasic_ML_env
Training data is already provided in ./data/.
If generating training data yourself:
- Place/symlink
appris_human_v2_selected.faandappris_mouse_v2_selected.fain./data/. - Place/symlink
human_all_biochem_feature_no_len.csvin./biochem_and_struct_dataif training with biochem data.
Training examples can be found in experiments.py. Use -e human_corr argument to train human correlation model (save models with -s flag).
See original code repo for examples.