Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with background gene lists and orthogene #75

Closed
Al-Murphy opened this issue Aug 16, 2022 · 2 comments
Closed

Issue with background gene lists and orthogene #75

Al-Murphy opened this issue Aug 16, 2022 · 2 comments
Assignees
Labels

Comments

@Al-Murphy
Copy link
Collaborator

Al-Murphy commented Aug 16, 2022

1. Bug description

Gene lists aren't filtered based on reference scRNA-Seq dataset. This caused an error in a recent run where only one background gene was left after filtering to the SCT dataset.

Console output

Error in apply(temp, 2, sum) : dim(X) must have a positive length
Calls: <Anonymous> ... <Anonymous> -> lapply -> FUN -> cell_list_dist -> apply

Expected behaviour

I would expect that gene lists should be filtered by the SCT dataset early on to avoid issues downstream like this. For example with background gene lists that are generated to match gene length of the hits list, if any of these genes are removed after this step, it will massively affect this gene length control.

@Al-Murphy Al-Murphy added the bug label Aug 16, 2022
@Al-Murphy Al-Murphy self-assigned this Aug 16, 2022
@Al-Murphy Al-Murphy changed the title Gene lists aren't filtered based on SCT dataset Gene lists aren't filtered based on reference scRNA-seq dataset Aug 16, 2022
@Al-Murphy
Copy link
Collaborator Author

@bschilder just so you are aware

I was wrong, the issue actually only occurs when the user pass a background list and the reference dataset species is the same as the gene list species. The issue is with orthogene and is described here: neurogenomics/orthogene#22

In short, orthogene has been replacing the background gene list with one generated from all known genes leading to the issues described above. I have made a fix in version 1.5.7 and have made a note for orthogene to have this issue fixed at source

@Al-Murphy Al-Murphy changed the title Gene lists aren't filtered based on reference scRNA-seq dataset Issue with background gene lists and orthogene Aug 16, 2022
@bschilder
Copy link
Collaborator

Ah, thanks for fixing this! Will implement directly in orthogene soon.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants