Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clinvar import #333

Open
wants to merge 13 commits into
base: master
Choose a base branch
from
Open

Clinvar import #333

wants to merge 13 commits into from

Conversation

mkroon1
Copy link
Collaborator

@mkroon1 mkroon1 commented Jan 18, 2018

Script to import variants from Clinvar's (gzipped) files.

  • Default source files point to Clinvar's FTP server.
  • Form let's one select user that serves as the owner of the newly imported variants.
  • Previously existing variant descriptions (from another submitter) are left untouched, the "clinvar variant" is imported as a separate entry.
  • Import consists of 3 phases:
    1. Import genomic variant descriptions
    2. Import descriptions on transcripts
    3. Import variant effects.
  • Script allows one to download list of unparsable variant descriptions after import.
  • Successive imports will not duplicate variants, instead variants are only updated when their description is changed.

* Removed code for debugging purposes.
* Prettify statistics output.
* Added adjustable parameter for user ID under which variants are
  imported.
* Improved code comments.
* After import, unparsable variants from Clinvar are stored in a file in
  the temp directory. The file is overwritten with every run of the
  script.
* The file can be downloaded for users with manager clearance.
* Loop over variant summary file.
* Link `clinical significance` field to LOVD's `var_effect`.
* Update objects that have a different value for effectid than in the
  database.
* Translate Clinvar's "clinical significance" to LOVD's
  "variant_effect", taking the most severe value.
* Store original "clinical significance" value in
  "VariantOnTranscript/Classification".
* Modify Clinvar file class to handle arbitrary compression factors.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant