Skip to content

STARVar:Symptom based Tool for Automatic Ranking of Variants using evidence from literature and genomes

Notifications You must be signed in to change notification settings

bio-ontology-research-group/STARVar

Repository files navigation

STARVar

STARVar:Symptom based Tool for Automatic Ranking of Variants using evidence from literature and genomes

How to use STARVar?

To run STARVar, please follow the steps below:

  1. Download the required input files from https://exrcsdrive.kaust.edu.sa/exrcsdrive/index.php/s/CobjdzXWoWmJbYs . The password for accessing the data is : 12345

  2. An input VCF file containing variants first should be filtered for the candiate variants based on Allele Frequency and/or mode of inheritance. (e.g. synthetic_ALDOB.maf001.AR.vcf) We suggest users to use one of the efficient filtering tools, e.g. slivar for the filtering step. Slivar is available from https://github.com/brentp/slivar

  3. The filtered VCF file must be annotated by using the Variant Effect Predictor (VEP) tool with the required information (see the VEP_AnnotFields file for the full set of annotation fields by VEP) for analyzing data efficiently. Follow the steps described at https://github.com/bio-ontology-research-group/pavs_annotation to install and run the VEP as CWL locally.

  4. run STARVar as follows: format: python STARVar.py input.vcf option symptoms

option: STARVar combines scores from the literature based model and genomics evidence scores from either PolyPhen-2, SIFT or Variant Consequence from VEP for variant ranking. Please choose one of them. Use s for SIFT or use p for PolyPhen-2 or use v for Variant Consequence.

symptoms: should be provided as a semi-colon separated single list of symptoms either in free text format or as HPO IDs. For example, "HP:XXXXXX;brain atrophy". Please dont forget to use quotation (") for symptoms.

Example: python STARVar.py gpcards_ALDOB.vep.vcf v "Hepatomegaly;Liver dysfunction;Developmental delay;abnormal transferrin"

About

STARVar:Symptom based Tool for Automatic Ranking of Variants using evidence from literature and genomes

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages