Disease transcript does not match HGVS description #4424
Replies: 6 comments
-
Right, by the looks of it is the same as usual? 😊 This is really a question for the MIP team, but briefly, the consequence annotation is done using VEP with ENSMBL transcripts primarily. The old mapping between RefSeq and ENSEMBL for hg19 isn't super good and often maps multiple RefSeq transcripts on the same ENSEMBL one. This has largely been resolved by them for hg38 with the MANE annotations. Switching to hg38 will auto-solve most of these, and enable us to complain and correct the mappings for the remainder on the latest version. Luckily it is in a very common variant, and one that has ClinVar annotation to support your decision. But it is great that you are attentive - that will vigilance will be needed at least until hg38 for this particular issue. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
And on a side-note, this perhaps also illustrates why it is difficult to maintain a local list of disease causing transcript rather than pooling resources with other labs: note that it is time to consider updating the transcript from NM_012188.4 to NM_012188.5, unless the new version does not conform to the experimental evidence? 😊 |
Beta Was this translation helpful? Give feedback.
-
Thanks for the answer! I was not aware of the issue of potential mismatching between refseq and ensambl. It makes more sense now. here is an example from the same case -> https://scout.scilifelab.se/cust003/21278/08e5e659b2cfde6675aae922801e51a2 I do not think highlighting them in the summary will help though. As you say it will probably be more confusing. Another solution could be to add the information from the disease associated transcript from the transcript table in the bottom of the variant page of to its table, see below. What do you think? |
Beta Was this translation helpful? Give feedback.
-
I do like that in general, but unfortunately it would not work in this case. Maybe we could (also?) add a warning to it, noting that we don't have annotations for one or more of the "disease associated transctipts". In a better world all would perhaps instead use the MANE Select version; NM_000352.6. It seems to have been out for quite some time, so perhaps it is simply not mapping well to hg19 - the release comment says the new version was created "from transcript and genomic sequence data to make the sequence consistent with the reference genome assembly" (https://www.ncbi.nlm.nih.gov/nuccore/NM_000352). But as it stands, if we would show data from the transcript tab on Disease associated transcripts, we would not be able to pick it exactly here, since that one is (manually or historically?) given as NM_000352.4. It is not quite simple. Most of the time, it will be fine, but at some level one perhaps has to get to know ones genes. This appears to be a tricky one with several active isoforms, and a couple of them of exactly the same size though slightly different AA sequence. I'll trust you if you say NM_000352.5:p.Leu1430Phe is published, but one has to be very careful to see that the publication coordinates was not for NM_000352.4, NM_000352.6 or NM_001287174.2. My best current advice would actually be that we focus attention on switching to hg38, and revisit this and similar discussions afterward, since at least some of the interlocking issues will be ironed out by that. Sound ok? |
Beta Was this translation helpful? Give feedback.
-
Sounds good, just one last question. Could it be easier to do as suggested if the version for the transcript is removed from the disease transcripts for our panels? (e.g. NM_000352 instead of NM_000352.5)? We always should go through the publications anywasy to ensure we are not tricked by older transcript versions when we find these variants :) Thanks for the answers and clarification! longing for hg38 here |
Beta Was this translation helpful? Give feedback.
-
Just looked at a variant in scout and discovered a potential bug. Below are the disease associated trancript for FOXI1 and Refseq transcript. According to it the variant is a deep intronic substitution in the disease associated transcript.
However, in the list of transcripts below it is a missense in the same transcript, and looking in UCSC and other dbs this is the correct version, not the above, which is a bit worrying.
The variant is on https://scout.scilifelab.se/cust003/21285/d0b7fc56c3626d80d1bbdafe1a178c2c
Beta Was this translation helpful? Give feedback.
All reactions