-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate ALT allele in VCF causes exception #1718
Comments
hum , I don't think so. How would you know which ALT allele is associated to a genotype ? |
the INFO fields that are related to the ALT are required to be comma separated in the same order as the ALT's I cannot find any mention that ALT should be unique in VCF spec 4.2 |
@bartcharbon Interesting. I believe the restriction makes sense with non symbolic alleles, even if it's not in the spec specifically. It looks like we should relax it for structural variants though. |
Description of the issue:
When reading a VCF with HTSJDK it causes an exception when there is a duplicate ALT allele.
I believe duplicate ALT alleles are allowed in VCF, and are used in cases when there are multiple symbolic alleles at a certain position.
e.g. CNV:TR,CNV:TR
the exception that occurs:
The provided VCF file is malformed at approximately line number 530: Duplicate allele added to VariantContext: <CNV:TR>, for input source:input.vcf.gz
Both
BcfTools
andEBIvariation/vcf-validator/
read/validate the lines with the duplicate ALT without problems.Your environment:
Expected behaviour
The file should not cause an exception
Actual behaviour
The file should does cause an exception
The text was updated successfully, but these errors were encountered: