-
Notifications
You must be signed in to change notification settings - Fork 10
dbscSNV example
Brent Pedersen edited this page May 23, 2022
·
1 revision
This example was contributed by Somak Roy. It creates an echtvar zip file from the dbscsnv zip file:
download from: https://usf.app.box.com/s/ffwlywsat3q5ijypvunno3rg6steqfs8
filename should be: dbscSNV1.1.zip
Download and execute the script dbscsnv.py to convert the zip file to a VCF file containing the ADA and RF scores for hg38 (default)
python dbscsnv.py dbscSNV1.1.zip | bgzip -c > dbscSNV-1.1.vcf.gz
The script can also be used to pull the hg19 database as well by specifying the --genome hg19
option. The script also gets rid of variants mapped to the random and alternative assembly contigs.
This file tells echtvar which fields to encode and how to label them. The following contents are a good start:
[
{
"field": "ADA_SCORE",
"alias": "dbscsnv_ADA_SCORE",
"multiplier": 1000000
},
{
"field": "RF_SCORE",
"alias": "dbscsnv_RF_SCORE",
"multiplier": 1000000
}
]
This creates dbscsnv_ev.echtvar.zip
which can be use by echtvar anno
echtvar encode dbscsnv_ev.echtvar.zip dbscsnv.json dbscSNV-1.1.vcf.gz