-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add notebook for downloading McFarland 2020 Figure 1 data #2
base: main
Are you sure you want to change the base?
Add notebook for downloading McFarland 2020 Figure 1 data #2
Conversation
This PR adds a Jupyter notebook to download the data from McFarland et al., 2020 used to produce Figure 1 (i.e., response to idasanutlin and control DMSO for different cell lines). This PR also adds a `utils.py` file to the datasets folder containing reusable functions for downloading/preprocessing.
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
This PR adds a notebook to download + preprocess the Norman 2019 dataset starting directly from downloading the raw counts. The notebook currently downloads the data, and fills in various metadata values. I made this PR as the current Norman 2019 notebook depends on downloading another h5ad file first- I personally like being able to see the full workflow (i.e., going from author provided files to final anndata) as part of the notebooks. As mentioned in theislab#2, I'm not sure what QC steps you prefer so this notebook simply produces an anndata with raw counts.
Hey Ethan, great questions! I'll post the answers here for now but ideally there would be some other documentation somewhere other than an obscure
|
* Add Norman 2019 notebook with more details This PR adds a notebook to download + preprocess the Norman 2019 dataset starting directly from downloading the raw counts. The notebook currently downloads the data, and fills in various metadata values. I made this PR as the current Norman 2019 notebook depends on downloading another h5ad file first- I personally like being able to see the full workflow (i.e., going from author provided files to final anndata) as part of the notebooks. As mentioned in #2, I'm not sure what QC steps you prefer so this notebook simply produces an anndata with raw counts. * Add standard metadata fields * standardize naming Authored-by: Ethan Weinberger <[email protected]>
Got it- the distinction between the curation/preprocessing notebooks makes sense to me. Based on that distinction, it seems like it makes sense to have the |
Closing since this is taken care of by `mcfarland_2020_curation.ipynb' |
Reopening per @yugeji's request |
This PR adds a Jupyter notebook to download the data from
McFarland et al., 2020 used to produce Figure 1 (i.e.,
response to idasanutlin and control DMSO for different cell lines).
This PR also adds a
utils.py
file to the datasets foldercontaining reusable functions for downloading/preprocessing.
A couple things that should probably be hashed out before this gets merged:
h5ad
file.anndata
object in my notebook just contains raw counts.