Benchmarking and analysis for DEBIAS-M

Welcome to DEBIAS-M! DEBIAS-M is a computational methods designed to identify and correct for processing bias and batch effects in microbiome data.

Code Walkthrough

Every folder within this repository contains analyses to produce a different component within the results or plots directories. The different analyses contained within each folder are as follows:

File/Folder	Description
CRC-simulations	Code to run and plot the simulutations synthetically varying different underlying parameters in the colorectal cancer prediction benchmark (Fig S5)
pseudocount-evaluation	Code evaluating the impact of pseudocounts and flooring as pre- and post-processing steps for DEBIAS-M (Fig S10a-c)
HIV-using-age	Code evaluating the use of different covariates during batch-correction on the HIV benchmark (Figs 2a, S3c)
regression	The regression benchmark for metabolite prediction (Figs 5c, S8)
regression-all-methods	The regression benchmark for the plot incorporating a wider array of batch-correction methods (Fig 5b)
Simulations	Implementing the simulations using synthetically generated data (Figs 3, S4, S6)
runtime-benchmark	Evaluates DEBIAS-M's runtime under various scenarios (Fig S10h,i)
convex-assumption	Generates visualizations describing the optimization space of DEBIAS-M, illustrating non-convex spaces
single-batch-cv	Testing DEBIAS-M as a single-study processor correcting for biases while only considering a single batch (Fig S9b)
study-weighting	Evaluates the impact of weighting the influence of each study's impact on the cross-batch loss based on each study's size (Fig R3-4)
loss-functions	Evaluates DEBIAS-M on some of the main benchmarks using a wide variety of loss functions (Fig S10f,g)
taxonomy-aggregation	Evaluates the impact of aggregating microbial features to different levels of taxonomy (Fig S10d,e)
new-cervix-carcinoma	Implementation of DEBIAS-M cervical carcinoma analyses, using a newly created combination of cervical datasets (Figs 2d, 6a,b)
within-control-comparison	Evaluates a version of Online DEBIAS-M in which similarity is only enforced within the controls of the training samples (Fig R2-2)
v1-DEBIAS-M-Analysis	Includes code for all remaining analyses (see nested README within this folder)
make-PR-csv.ipynb	Saves the table describing all auPRs in the main benchmarks (Supplementary Table 2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmarking and analysis for DEBIAS-M

Code Walkthrough

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CRC-simulations		CRC-simulations
HIV-using-age		HIV-using-age
Simulations		Simulations
convex-assumption		convex-assumption
data		data
loss-functions		loss-functions
new-cervix-carcinoma		new-cervix-carcinoma
plots		plots
pseudocount-evaluation		pseudocount-evaluation
regression-all-methods		regression-all-methods
regression		regression
results		results
runtime-benchmark		runtime-benchmark
single-batch-cv		single-batch-cv
study-weighting		study-weighting
taxonomy-aggregation		taxonomy-aggregation
v1-DEBIAS-M-Analysis		v1-DEBIAS-M-Analysis
within-control-comparison		within-control-comparison
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
make-PR-csv.ipynb		make-PR-csv.ipynb

korem-lab/DEBIAS-M-Analysis

Folders and files

Latest commit

History

Repository files navigation

Benchmarking and analysis for DEBIAS-M

Code Walkthrough

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages