Regulator and moonlighter T_gcon set value #64
-
Hi, according to your GitHub and tutorial on https://churchman.med.harvard.edu/genewalk tutorial I saw the code that you have provided for scatterplot. May I ask why do you choose 75th quartile of distribution as you threshold? Here by is your code#!/usr/bin/env python coding: utf-8### import other packages (numpy, pandas, matplotlib and seaborn)Install these with pip install before importing for plotting purposes below.import os ### Load GeneWalk scatterplots fileproject_folder = '/home/genewalk/qki/' filename = 'genewalk_scatterplots.csv' ### Regulator and Moonlighting plot functions- If necessary customize plot sizes, plot filenames etc in functions below- The plots are automatically saved as png and pdf in
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Thank you for reaching out. Generally speaking the higher the gene connectivity and fraction of relative GO terms the more relevant the gene in the regulator scatterplot. The threshold choice of 75th percentile has no particular statistical reason, it is just a guidance for plotting purposes. |
Beta Was this translation helpful? Give feedback.
Hi @s-janewanthana
Thank you for reaching out. Generally speaking the higher the gene connectivity and fraction of relative GO terms the more relevant the gene in the regulator scatterplot. The threshold choice of 75th percentile has no particular statistical reason, it is just a guidance for plotting purposes.
Please feel free to adjust to other values or omit altogether if that makes more sense for your data.
The same holds for the T_frac default value of 0.5.
It might also be worth considering genes with a high fraction of relevant GO terms that are less connected. I hope this helps.
Kind regards,
Robert