Pseudobulk Analysis with Decoupler and edgeR #5617

dianichj · 2024-12-06T20:00:53Z

🚀 Pseudobulk Analysis with Decoupler and edgeR

Summary

This tutorial introduces pseudobulk analysis using Decoupler and edgeR in Galaxy 🌌. It covers data preparation, generating pseudobulk expression matrices, and performing differential expression analysis, including a Volcano Plot for final visualization of results 🌋📊.

🔗 Zenodo Link

https://zenodo.org/records/13929549

🎯 Objectives

🧬 Understand pseudobulk analysis principles.
🛠️ Generate pseudobulk expression matrices with Decoupler.
📈 Perform differential expression analysis with edgeR.

✨ Key Points

🔄 Pseudobulk bridges single-cell and bulk RNA-seq data.
🧮 Decoupler enables pseudobulk matrix generation.
🛡️ edgeR is robust for differential expression in pseudobulk data.

📋 Pending Items

Finalize tutorial last steps and instructions.
Add final plots and figures 🖼️.
Revise explanations for parameters and key steps 🧐.
Review formatting and style ✍️.

💡 Feel free to review and share your feedback—your input is much appreciated! 🙌

added single-cell tag: "tags": ["single-cell"]

Edited Tags in Workflow file

Added edgeR explanation for performing DE in pseudobulk aggregates.

bgruening · 2025-01-17T13:47:44Z

The failing test is because of:

Liquid Exception: Liquid syntax error (line 325): Tag '{% Remove columns %}' was not properly terminated with regexp: /%}/ in topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

shiltemann

Thanks a lot @dianichj! This looks good, just a couple broken boxes and formatting tweaks (see below)

Just out of curiosity, are you all set up to get a local preview of your tutorial? This can also be done online using CodeSpaces now (see tutotrial)

shiltemann · 2025-01-17T15:01:57Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+
+In this tutorial, we will guide you through a pseudobulk analysis workflow using the **Decoupler** and **edgeR** tools available in Galaxy ({% cite Badia-iMompel2022 %}) ({% cite Liu2015 %}). These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.
+
+> <agenda-title>Pseudobulk Analysis Pipeline Agenda</agenda-title>


the agenda will be automatically generated based on your section titles, pleas replace this with

> <agenda-title></agenda-title> > > In this tutorial, we will cover: > > 1. TOC > {:toc} > {: .agenda}

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

shiltemann · 2025-01-17T15:06:05Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+> 1. What are the output(s) of the edgeR tool?  
+> 2. How can we interpret our output result file?  
+>
+> <solution-title>edgeR Outputs and Interpretation</solution-title>


since these boxes are nested, make sure to add a > in front of this line and every line below until the end of the solution box

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

shiltemann · 2025-01-17T15:10:22Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+# Key Takeaways and Recommendations
+
+## Key Takeaways
+- **Pseudobulk Analysis Advantage:** Pseudobulk analysis bridges single-cell and bulk RNA-seq approaches, combining high resolution with statistical robustness.


anything you add in key_points metadata will be added in a box at the end of the tutorial, so you could consider moving these there.

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

dianichj · 2025-01-17T15:24:08Z

Thanks a lot @dianichj! This looks good, just a couple broken boxes and formatting tweaks (see below)

Just out of curiosity, are you all set up to get a local preview of your tutorial? This can also be done online using CodeSpaces now (see tutotrial)

Thanks a lot for all your comments, Saskia! I will check it again locally after going over all your revisions and will ping you with a comment. 😊

Co-authored-by: Saskia Hiltemann <[email protected]>

dadrasarmin · 2025-01-20T13:52:12Z

Thanks a lot @dianichj! This looks good, just a couple broken boxes and formatting tweaks (see below)

Just out of curiosity, are you all set up to get a local preview of your tutorial? This can also be done online using CodeSpaces now (see tutotrial)

I did not know this Saskia. Thanks for mentioning it! I was following this. I noticed the linked you mentioned here is available here but I think the link in the README is the one that a new user would check.

Thanks.

shiltemann · 2025-01-20T15:22:43Z

@dadrasarmin that is a very good point, I tend to forget about the README file, I will update that!

dianichj · 2025-01-27T10:51:18Z

Hi @MarisaJL , here is the tutorial. Thanks lots for your help <3 !

pavanvidem

@dianichj looks super nice with great explanations!!

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

pavanvidem · 2025-01-28T13:32:08Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+
+Beyond enhancing statistical validity, pseudobulk analysis enables the identification of cell-type-specific gene expression and functional changes across biological conditions. It balances the detailed resolution of single-cell data with the statistical power of bulk RNA-seq, providing insights into the functional transcriptomic landscape relevant to biological questions. Overall, for differential expression analysis in multi-sample single-cell experiments, pseudobulk approaches demonstrate superior performance compared to single-cell-specific DE methods ({% cite Squair2021 %}). 
+
+In this tutorial, we will guide you through a pseudobulk analysis workflow using the **Decoupler** and **edgeR** tools available in Galaxy ({% cite Badia-iMompel2022 %}) ({% cite Liu2015 %}). These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.


Suggested change

In this tutorial, we will guide you through a pseudobulk analysis workflow using the **Decoupler** and **edgeR** tools available in Galaxy ({% cite Badia-iMompel2022 %}) ({% cite Liu2015 %}). These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.

In this tutorial, we will guide you through a pseudobulk analysis workflow using the **Decoupler** ({% cite Badia-iMompel2022 %}) and **edgeR** ({% cite Liu2015 %}) tools available in Galaxy. These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.

why not citing the original edgeR paper from 2009? https://doi.org/10.1093/bioinformatics/btp616

pavanvidem · 2025-01-28T13:41:17Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+key_points:
+- Pseudobulk analysis approach bridges the gap between single-cell and bulk RNA-seq data  
+- Decoupler tool generates a pseudobulk count matrix, enabling downstream differential expression and functional analyses 
+- edgeR is a robust tool for differential expression in pseudobulk datasets  


please add pbmc and combining sc datasets tutorials as requirements here

pavanvidem · 2025-01-28T14:31:52Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+
+> <hands-on-title> Decoupler Pseudobulk </hands-on-title>
+>
+> 1. {% tool [Decoupler pseudo-bulk](toolshed.g2.bx.psu.edu/repos/ebi-gxa/decoupler_pseudobulk/decoupler_pseudobulk/1.4.0+galaxy5) %} tool with the following parameters:


can you instead use the latest version (+galaxy8) of the tool?

Will check if it is working =D

pavanvidem · 2025-01-28T14:37:17Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+
+> <hands-on-title> Use Manipulate AnnData Tools to extract observations </hands-on-title>
+>
+> 1. Use the {% tool [Manipulate AnnData](toolshed.g2.bx.psu.edu/repos/iuc/anndata_manipulate/anndata_manipulate/0.10.9+galaxy0) %} tool with the following parameters:


In the latest version, this function does not exist anymore. We moved this function to Scanpy Filter tool. Please adjust this hands-on step accordingly and also update the workflow.

pavanvidem · 2025-01-28T14:37:48Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+>    - *"Value"*: `T cell` (the name of the cluster of interest for subset analysis)
+{: .hands_on}
+
+After using the {% tool [Manipulate AnnData](toolshed.g2.bx.psu.edu/repos/iuc/anndata_manipulate/anndata_manipulate/0.10.9+galaxy0) %} tool to subset the cell type of interest, go back to the top of this tutorial to the hands-on **Pseudobulk with Decoupler** step, and you may perform once again the same steps in this smaller AnnData object that now should only include your T cells. Results from this analysis will correspond to differential expression between conditions only for T cells. 


Also, replace this with Scanpy Filter

pavanvidem · 2025-01-28T14:40:33Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+{: .hands_on}
+
+After using the {% tool [Manipulate AnnData](toolshed.g2.bx.psu.edu/repos/iuc/anndata_manipulate/anndata_manipulate/0.10.9+galaxy0) %} tool to subset the cell type of interest, go back to the top of this tutorial to the hands-on **Pseudobulk with Decoupler** step, and you may perform once again the same steps in this smaller AnnData object that now should only include your T cells. Results from this analysis will correspond to differential expression between conditions only for T cells. 
+


please include the results (plots and some questions with number of genes etc.) of the T cell subsampled data so that the users can compare results.

pavanvidem · 2025-01-28T14:42:47Z

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

+layout: tutorial_hands_on
+
+title: Pseudobulk Analysis with Decoupler and EdgeR
+zenodo_link: https://zenodo.org/records/13929549


please include answer_histories:

Co-authored-by: Pavankumar Videm <[email protected]>

dianichj and others added 6 commits December 6, 2024 19:55

Add pseudobulk-analysis tutorial and associated files

45bf389

Delete topics/single-cell/tutorials/pseudobulk-analysis/faqs/.Rhistory

93c92c7

Update data-library.yaml

fd8eed3

Update tutorial.bib

6e8d4d4

Update tutorial.bib

d62e5b3

Update tutorial.md

cbf3ef1

github-actions bot added the single-cell label Dec 6, 2024

dianichj and others added 21 commits December 6, 2024 22:17

Update tutorial.md

7667591

Update tutorial.bib

a02bce3

Update tutorial.md

a034ac1

Merge branch 'main' into pseudobulk-analysis

5f81f19

Update tutorial.md

3b977f4

Update tutorial.md

71b28c3

Merge branch 'main' into pseudobulk-analysis

7c0f7d2

Update pseudo-bulk_edgeR.ga

95f00fe

added single-cell tag: "tags": ["single-cell"]

Update pseudo-bulk_edgeR.ga

cc78883

Edited Tags in Workflow file

Update tutorial.md

3c710f5

Update tutorial.md

43dc1fe

Merge branch 'galaxyproject:main' into pseudobulk-analysis

1c48c55

Added Image files for tutorial and their respective links to the md file

1df1e73

Update tutorial.md

7e1b0f5

Added edgeR explanation for performing DE in pseudobulk aggregates.

Update tutorial.md

897e05d

Update tutorial.md

b5d726e

Merge branch 'main' into pseudobulk-analysis

f1dfc9d

Update tutorial.md

2a9e2b4

Update tutorial.bib

b152379

Update tutorial.md

a824bb7

Update tutorial.md

f94f423

dianichj marked this pull request as ready for review January 17, 2025 13:05

dianichj requested a review from a team as a code owner January 17, 2025 13:05

dianichj added 3 commits January 17, 2025 14:05

Merge branch 'galaxyproject:main' into pseudobulk-analysis

457b62c

Update tutorial.md

ee8c80d

Update tutorial.md

4bc0bcd

dianichj added 2 commits January 17, 2025 15:02

Update tutorial.md

1f51fec

Update tutorial.md

5089fc4

shiltemann reviewed Jan 17, 2025

View reviewed changes

topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md Outdated Show resolved Hide resolved

dianichj added 2 commits January 17, 2025 15:54

Update tutorial.md

8a16a72

Update tutorial.md

6371f35

shiltemann reviewed Jan 17, 2025

View reviewed changes

Update tutorial.md

995c392

dianichj and others added 6 commits January 17, 2025 16:28

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

1935996

Co-authored-by: Saskia Hiltemann <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

940e4a3

Co-authored-by: Saskia Hiltemann <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

4184c96

Co-authored-by: Saskia Hiltemann <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

9ca64c5

Co-authored-by: Saskia Hiltemann <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

9aa7236

Co-authored-by: Saskia Hiltemann <[email protected]>

Update tutorial.md

8cf2690

pavanvidem reviewed Jan 28, 2025

View reviewed changes

dianichj and others added 6 commits January 29, 2025 16:35

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

1ec961c

Co-authored-by: Pavankumar Videm <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

88da516

Co-authored-by: Pavankumar Videm <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

0d22bed

Co-authored-by: Pavankumar Videm <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

b538827

Co-authored-by: Pavankumar Videm <[email protected]>

Update topics/single-cell/tutorials/pseudobulk-analysis/tutorial.md

5abf6a7

Co-authored-by: Pavankumar Videm <[email protected]>

Update tutorial.md

6a67e2f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pseudobulk Analysis with Decoupler and edgeR #5617

Pseudobulk Analysis with Decoupler and edgeR #5617

dianichj commented Dec 6, 2024 •

edited

Loading

bgruening commented Jan 17, 2025

shiltemann left a comment

shiltemann Jan 17, 2025

shiltemann Jan 17, 2025

shiltemann Jan 17, 2025

dianichj commented Jan 17, 2025

dadrasarmin commented Jan 20, 2025

shiltemann commented Jan 20, 2025

dianichj commented Jan 27, 2025

pavanvidem left a comment

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025

dianichj Jan 29, 2025

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025

pavanvidem Jan 28, 2025


		In this tutorial, we will guide you through a pseudobulk analysis workflow using the Decoupler and edgeR tools available in Galaxy ({% cite Badia-iMompel2022 %}) ({% cite Liu2015 %}). These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.

		> <agenda-title>Pseudobulk Analysis Pipeline Agenda</agenda-title>


		Beyond enhancing statistical validity, pseudobulk analysis enables the identification of cell-type-specific gene expression and functional changes across biological conditions. It balances the detailed resolution of single-cell data with the statistical power of bulk RNA-seq, providing insights into the functional transcriptomic landscape relevant to biological questions. Overall, for differential expression analysis in multi-sample single-cell experiments, pseudobulk approaches demonstrate superior performance compared to single-cell-specific DE methods ({% cite Squair2021 %}).

		In this tutorial, we will guide you through a pseudobulk analysis workflow using the Decoupler and edgeR tools available in Galaxy ({% cite Badia-iMompel2022 %}) ({% cite Liu2015 %}). These tools facilitate functional and differential expression analysis, and their output can be integrated with other Galaxy tools to visualize results, such as creating Volcano Plots, which we will also cover in this tutorial.

		{: .hands_on}

		After using the {% tool [Manipulate AnnData](toolshed.g2.bx.psu.edu/repos/iuc/anndata_manipulate/anndata_manipulate/0.10.9+galaxy0) %} tool to subset the cell type of interest, go back to the top of this tutorial to the hands-on Pseudobulk with Decoupler step, and you may perform once again the same steps in this smaller AnnData object that now should only include your T cells. Results from this analysis will correspond to differential expression between conditions only for T cells.

Pseudobulk Analysis with Decoupler and edgeR #5617

Are you sure you want to change the base?

Pseudobulk Analysis with Decoupler and edgeR #5617

Conversation

dianichj commented Dec 6, 2024 • edited Loading