Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extracting genes driving the enrichment and their mean bootstrap expression #76

Closed
iscastanho opened this issue Sep 5, 2022 · 3 comments

Comments

@iscastanho
Copy link

iscastanho commented Sep 5, 2022

I am not sure if these are already features of EWCE (although I could not find in manuals/tutorials), so I am posting them as questions.

1. Is there a way of extracting which genes are driving the enrichment for each cell type tested?
At the moment I am using the “marker” genes for each population/subpopulation I have (which I identified with Seurat) but I was wondering if there was a way of extracting this directly from EWCE as I can see that it performs differential expression as part of the pipeline (using LIMMA if I noticed it correctly).

2. Can I get the mean bootstrap expression for each gene from EWCE? If so, how?
In Skene and Grant, 2016 (https://doi.org/10.3389/fnins.2016.00016), in Figure 2 C and D, genes from one of the cell types (microglia) are shown, highlighting their expression against the mean bootstrap expression. I would like to have access to this type of information from my data too. How could I extract this?

Thank you.

@Al-Murphy
Copy link
Collaborator

  1. Is there a way of extracting which genes are driving the enrichment for each cell type tested?
    This is essentially the specificity i.e. how specific is the expression of a gene to a cell type. This is available in the ctd made from your reference scRNA-seq dataset. For example (from the vignette dataset):
ctd <- ewceData::[ctd]
#cell type level 1 specificity for genes
ctd[[1]]$specificity

EWCE as I can see that it performs differential expression as part of the pipeline (using LIMMA if I noticed it correctly)
EWCE uses limma as a filtering step to remove uninformative genes i.e. genes, the function drop_uninformative_genes. This is done to remove genes that don't vary across cell types (using limma) to help reduce noise in subsequent steps (makes a more far comparison between your gene list and the randomly sampled background genes). So in this sense, limma isn't used to identify cell type specific gene lists but as a preprocessing step to filter out genes.

  1. Can I get the mean bootstrap expression for each gene from EWCE? If so, how?
    I believe figure 2 C and D are essentially showing the specificity of genes (perhaps @NathanSkene can confirm?). So you should be able to use the specificity values to get a similar plot.

Thanks

@NathanSkene
Copy link
Owner

NathanSkene commented Sep 6, 2022 via email

@bschilder
Copy link
Collaborator

Here's the function @iscastanho
https://nathanskene.github.io/EWCE/reference/generate_bootstrap_plots.html

See here for some upgrades I'm making to it soon.
#77

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants