You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
chrom start end ref alt variant_id gene aaf gts.REAL_SAMPLE_ID
Traceback (most recent call last):
File "/bcbio/bin/gemini", line 7, in <module>
gemini_main.main()
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/gemini_main.py", line 1249, in main
args.func(parser, args)
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/gemini_main.py", line 439, in query_fn
gemini_query.query(parser, args)
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/gemini_query.py", line 169, in query
run_query(args)
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/gemini_query.py", line 141, in run_query
for row in gq:
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/GeminiQuery.py", line 771, in next
val = row[source][idx]
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/GeminiQuery.py", line 449, in __getitem__
self.cache[key] = self.unpack(self.row[key])
File "/bcbio/anaconda/envs/python2/lib/python2.7/site-packages/gemini/compression.py", line 94, in snappy_unpack_blob
dt = lookup[blob[0]]
KeyError: 'U'
I tried to install a fresh vcf2db.py in python3 environment with:
git clone [email protected]:quinlan-lab/vcf2db.git
# with python3.10 I had a conflict with cyvcf2 with a downgrade list, so I used python3.9
conda create -n vcf2db_python3 python=3.9
conda install -n vcf2db_python3 -c conda-forge gcc snappy
conda install -n vcf2db_python3 -c conda-forge python-snappy
conda install -n vcf2db_python3 -c bioconda cyvcf2 peddy
conda activate vcf2db_python3
which pip
cd vcf2db
pip install -r requirements.txt
This received htslib1.13, python-snappy=0.6.0,snappy=1.1.8
When I create a database with python vcf2db.py from vcf2db_python3 env and trying to read with gemini I am getting the same error.
I tried also to make python-snappy versions the same in python2 gemini environment and in vc2db_python3 by
Hi @brentp !
Thanks so much for supporting half of the ecosystem just by yourself!
In an old project I suddenly can't get a pair of vcf2db.py - gemini working with gts fields compression.
I create a db as usually with:
The cohort is large ~ 1000 samples.
then I query it with:
gemini \ query \ --header \ -q "select chrom, start, end, ref, alt, variant_id, gene, aaf, gts.REAL_SAMPLE_ID from variants limit 100" cohort.db
and I'm getting:
Both vcf2db.py and gemini were installed via bioconda in bcbio, python2 environment (worked just well untill recently).
https://github.com/chapmanb/cloudbiolinux/blob/master/contrib/flavor/ngs_pipeline_minimal/packages-conda.yaml#L350
I tried to install a fresh vcf2db.py in python3 environment with:
This received htslib1.13, python-snappy=0.6.0,snappy=1.1.8
When I create a database with
python vcf2db.py
from vcf2db_python3 env and trying to read with gemini I am getting the same error.I tried also to make python-snappy versions the same in python2 gemini environment and in vc2db_python3 by
still the same snappy unpack blob error.
Let me know if you have any ideas - I am ready to put in efforts to resolve this.
Sergey
The text was updated successfully, but these errors were encountered: