You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Gene lists aren't filtered based on reference scRNA-Seq dataset. This caused an error in a recent run where only one background gene was left after filtering to the SCT dataset.
Console output
Error in apply(temp, 2, sum) : dim(X) must have a positive length
Calls: <Anonymous> ... <Anonymous> -> lapply -> FUN -> cell_list_dist -> apply
Expected behaviour
I would expect that gene lists should be filtered by the SCT dataset early on to avoid issues downstream like this. For example with background gene lists that are generated to match gene length of the hits list, if any of these genes are removed after this step, it will massively affect this gene length control.
The text was updated successfully, but these errors were encountered:
Al-Murphy
changed the title
Gene lists aren't filtered based on SCT dataset
Gene lists aren't filtered based on reference scRNA-seq dataset
Aug 16, 2022
I was wrong, the issue actually only occurs when the user pass a background list and the reference dataset species is the same as the gene list species. The issue is with orthogene and is described here: neurogenomics/orthogene#22
In short, orthogene has been replacing the background gene list with one generated from all known genes leading to the issues described above. I have made a fix in version 1.5.7 and have made a note for orthogene to have this issue fixed at source
Al-Murphy
changed the title
Gene lists aren't filtered based on reference scRNA-seq dataset
Issue with background gene lists and orthogene
Aug 16, 2022
1. Bug description
Gene lists aren't filtered based on reference scRNA-Seq dataset. This caused an error in a recent run where only one background gene was left after filtering to the SCT dataset.
Console output
Expected behaviour
I would expect that gene lists should be filtered by the SCT dataset early on to avoid issues downstream like this. For example with background gene lists that are generated to match gene length of the hits list, if any of these genes are removed after this step, it will massively affect this gene length control.
The text was updated successfully, but these errors were encountered: