Skip to content

Commit 216f146

Browse files
author
Kai Hu
committed
minor
1 parent 231e48d commit 216f146

6 files changed

+96
-0
lines changed

README.md

+70
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,7 @@
99
- [DOWNSTREAM_ARCHR](#downstream_archr)
1010

1111
[Quick Start](#quick-start)
12+
[Example: plots in paper](#Example:-plots-in-paper)
1213
[Documentation](#documentation)
1314
[Credits](#credits)
1415
[Bug report/Support](#bug-reportsupport)
@@ -143,6 +144,75 @@ nextflow run main.nf --help
143144

144145
See documentation [usage](https://github.com/hukai916/scATACpipe/blob/main/docs/usage.md) for all of the available options.
145146

147+
## Example: plots in paper
148+
149+
This section describes how the plots in the manuscript (to be added) were generated using scATACpipe. For comparison, the manuscript conducted 3 separate analyses, each using a different preprocessing strategy (`default`, `10xgenomics`, `chromap`). Since the commands and preprocessed results are quite similar across the three methods, only the `chromap` option will be demonstrated here.
150+
151+
1. The initial execution:
152+
153+
```
154+
nextflow run main.nf -profile singularity,lsf -c conf/test_chromap.config --preprocess chromap --outdir ./chromap_results_haibo --input_fastq /home/hl84w/lucio_castilla/scATAC-seq/docs/10X_human_scatac_fastq_new_style.csv --ref_fasta_ensembl homo_sapiens --species_latin_name 'homo sapiens' --archr_blacklist /home/hl84w/lucio_castilla/scATAC-seq/docs/hg38-blacklist.v2.bed.gz --doublet_removal_algorithm archr --archr_thread 8 -resume e209b4a2-1ada-4893-af13-132d1e3f5a55
155+
```
156+
157+
```
158+
nextflow run main.nf -profile singularity,lsf --preprocess chromap --outdir ./results_chromap --input_fastq ./assets/10X_human_scatac_fastq.csv --ref_fasta_ensembl homo_sapiens --species_latin_name 'homo sapiens' --archr_scrnaseq '/path/scRNA-Hematopoiesis-Granja-2019.rds' --archr_blacklist /home/hl84w/lucio_castilla/scATAC-seq/docs/hg38-blacklist.v2.bed.gz
159+
```
160+
161+
Break down:
162+
163+
* `-profile singularity,lsf`:
164+
165+
This option instructs scATACpipe to use Singularity containers and LSF as the executor. Multiple parameters are separated by commas. Since `profile` is pipeline-level flag, it is prefixed with a single dash (`-`). Module-level flags are prefixed with double dash (`--`).
166+
167+
* `--preprocess chromap`:
168+
169+
This instructs scATACpipe to use Chromap preprocessing strategy.
170+
171+
* `--outdir ./results_chromap`:
172+
173+
Output will be saved into `./results_chromap` folder.
174+
175+
* `--input_fastq ./assets/10X_human_scatac_fastq.csv`:
176+
177+
Please replace the `/path/` in the `10X_human_scatac_fastq.csv` with absolute paths. Details regarding the 6 samples can be found in the supplementary section of the paper. If you detect any outlier samples, you can remove them from the downstream analyses using the `--filter_sample = 'PBMC_10K_C, PBMC_10K_X'` flag.
178+
179+
* `--ref_fasta_ensembl homo_sapiens`:
180+
181+
This specifies that the genome _Homo Sapiens_ from ENSEMBLE will be used as reference. To view all supported genomes, check out `nextflow run main.nf --support_genome`.
182+
183+
* `--species_latin_name 'homo sapiens'`:
184+
185+
Simply the Latin name of the reference genome.
186+
187+
* `--archr_scrnaseq '/path/scRNA-Hematopoiesis-Granja-2019.rds'`
188+
189+
Matching scRNA-seq data. Can ignore if not available. The example file can be downloaded [here](https://jeffgranja.s3.amazonaws.com/ArchR/TestData/scRNA-Hematopoiesis-Granja-2019.rds).
190+
191+
* `--archr_blacklist ./assets/hg38-blacklist.v2.bed.gz`:
192+
193+
Blacklist to exclude for downstream analysis. Click [here](https://github.com/Boyle-Lab/Blacklist/tree/master/lists) for other species.
194+
195+
Instead of passing each flag option via the command line, you can include them all in a configuration file and supply it with the `-c` option. Below is equivalent to above:
196+
```
197+
nextflow run main.nf -profile singularity,lsf -c ./conf/test_chromap_initial.config
198+
```
199+
200+
2. The final execution:
201+
After examining the results from the initial execution, we decided to remove the
202+
203+
204+
--archr_thread 8
205+
206+
207+
208+
209+
210+
211+
212+
213+
To replicate the plots showed in the manuscript (to be added),
214+
215+
146216
## Documentation
147217

148218
The scATACpipe workflow comes with documentation about the pipeline: [usage](https://github.com/hukai916/scATACpipe/blob/main/docs/usage.md) and [output](https://github.com/hukai916/scATACpipe/blob/main/docs/output.md).

assets/10X_human_scatac_fastq.csv

+13
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,13 @@
1+
sample_name,path_fastq_1,path_fastq_2,path_barcode
2+
PBMC_10K_N,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L001_R1_001.fastq.gz,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L001_R3_001.fastq.gz,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L001_R2_001.fastq.gz
3+
PBMC_10K_N,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L002_R1_001.fastq.gz,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L002_R3_001.fastq.gz,/path/atac_pbmc_10k_nextgem_fastqs/atac_pbmc_10k_nextgem_S1_L002_R2_001.fastq.gz
4+
PBMC_1K_N,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L001_R1_001.fastq.gz,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L001_R3_001.fastq.gz,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L001_R2_001.fastq.gz
5+
PBMC_1K_N,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L002_R1_001.fastq.gz,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L002_R3_001.fastq.gz,/path/atac_pbmc_1k_nextgem_fastqs/atac_pbmc_1k_nextgem_S1_L002_R2_001.fastq.gz
6+
PBMC_500_N,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L001_R1_001.fastq.gz,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L001_R3_001.fastq.gz,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L001_R2_001.fastq.gz
7+
PBMC_500_N,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L002_R1_001.fastq.gz,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L002_R3_001.fastq.gz,/path/atac_pbmc_500_nextgem_fastqs/atac_pbmc_500_nextgem_S1_L002_R2_001.fastq.gz
8+
PBMC_5K_N,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L001_R1_001.fastq.gz,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L001_R3_001.fastq.gz,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L001_R2_001.fastq.gz
9+
PBMC_5K_N,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L002_R1_001.fastq.gz,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L002_R3_001.fastq.gz,/path/atac_pbmc_5k_nextgem_fastqs/atac_pbmc_5k_nextgem_S1_L002_R2_001.fastq.gz
10+
PBMC_10K_V,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L001_R1_001.fastq.gz,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L001_R3_001.fastq.gz,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L001_R2_001.fastq.gz
11+
PBMC_10K_V,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L002_R1_001.fastq.gz,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L002_R3_001.fastq.gz,/path/atac_v1_pbmc_10k_fastqs/atac_v1_pbmc_10k_S1_L002_R2_001.fastq.gz
12+
PBMC_5K_V,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L001_R1_001.fastq.gz,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L001_R3_001.fastq.gz,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L001_R2_001.fastq.gz
13+
PBMC_5K_V,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L002_R1_001.fastq.gz,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L002_R3_001.fastq.gz,/path/atac_v1_pbmc_5k_fastqs/atac_v1_pbmc_5k_S1_L002_R2_001.fastq.gz

assets/hg38-blacklist.v2.bed.gz

5.73 KB
Binary file not shown.

conf/tem.config

+3
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
params {
2+
profile = 'singularity,lsf'
3+
}

conf/test_chromap_initial.config

+10
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
params {
2+
profile = 'singularity,lsf'
3+
preprocess = 'chromap'
4+
outdir = './results_chromap'
5+
input_fastq = './assets/10X_human_scatac_fastq.csv'
6+
ref_fasta_ensembl = 'homo_sapiens'
7+
species_latin_name = 'homo sapiens'
8+
archr_blacklist = '/home/hl84w/lucio_castilla/scATAC-seq/docs/hg38-blacklist.v2.bed.gz'
9+
archr_scrnaseq = '/path/scRNA-Hematopoiesis-Granja-2019.rds'
10+
}

docs/template_of_method.docx

-554 Bytes
Binary file not shown.

0 commit comments

Comments
 (0)