Select AutoPVS1 transcript as final variant annotation #145
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose/implementation Section
What feature is being added or bug is being addressed?
Closes #144. This PR modifies autogvp and annotation filtering scripts to retain autopvs1 transcript annotation as final outputted annotation for each variant.
What was your approach?
01-annotate_variants_CAVATICA_input.R
andannotate_variants_custom_input.R
to retain autopvs1Feature
column.04-filter_gene_annotations.R
to filter annotated vcf for onlyvcf_id
-Feature
pairs in autogvp output.What GitHub issue does your pull request address?
#144
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Please run shell script on both pbta and custom test files
Is there anything that you want to discuss further?
There are rare instances (I believe only in the custom test files) in which variants are annotated as intergenic by VEP, but have transcript annotation by AutoPVS1. This results in
NA
annotation columns for these variants in the final output, since the AutoPVS1 transcript is not found in the VEP vcf file. I will plan to run this on larger data sets to determine if this only happens with intergenic variants, in which case we can annotate them as such in the final output.Documentation Checklist