-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem creating seq files when running setup_clade_ap.py. #48
Comments
I'm having this same issue on Python 3.9.13. Have there been any updates? |
Solution: I figured it out, you have to make a file with the NCBI ids that you want to include if you're subsetting taxa, or it won't populate with any sequences (this is described in the 'Runs' doc). Don't know why I decided that wasn't relevant last time I looked at this... There is a helper script if you already have a file with all the names, but I just used a quick BioPython script to pull them and it's running now:
Just set your search terms to the subset you want, set retmax to at least the number of taxa, and put in a random email (not sure if this is required). |
Hi, I have the same problems! And I have provided the taxalist, still does work! Does anyone can help? Thanks! yang@bdchxy-PowerEdge-M630-VRTX:~$ python application/PyPHLAWD-master/src/setup_clade_ap.py -t Fagales -b /storage/phlawd_db_maker-master/DB/pln.db -s /storage/phlawd_db_maker-master/DB -o application/PyPHLAWD-master/examples/clustered/ -l application/PyPHLAWD-master/examples/clustered/ -f ncbi_sp_ids_938.txt STARTING PYPHLAWD (⌯꒪͒ ꌂ̇ ꒪͒) |
Hi and a happy new year, I'm experiencing the same issue, any help would be highly appreciated?! It would also be nice if the website (https://fephyfofum.github.io/PyPHLAWD/) could be updated as there is no more Cheers Bastian |
Hi bheimubu! Happy new year! Yingyya |
you make my day, it's working with the old version (downloaded as source code from here). Cheers Bastian |
Hi. I would just like to add that I was having the same trouble. If there is anything you figure out, please keep me updated. I also couldn't understand how to have the genus & sequence for this. If that is possible, please let me know. python3 setup_clade_ap.py -t Laurales -b /Users/administrator_ge/Desktop/pln.db -s /Users/administrator_ge/Desktop/seq -o /Users/administrator_ge/Desktop/output -l /Users/administrator_ge/Desktop/logfile.md.gz -f /Users/administrator_ge/Desktop/taxalist.txt STARTING PYPHLAWD ٩(⚙ȏ⚙)۶ |
Question Where is the -s parameter (SEQGZFOLDER) for setup_clade_ap.py meant to point?
Issue: I seem to be having a problem populating the gzip directory with sequences. The .table file is all populated from the ncbi db, but it's not finding the sequences. I'm not sure where the -s parameter is supposed to be pointing maybe? ~/ is where all the compressed ncbi files are from phlawd_db_maker.
Steps taken: Followed the steps on the Install page. Built phlawd_db_maker and all dependencies without errors. Built the database with phlawd_db_maker with no errors. Followed directions on the Runs page for a clustering analysis. Python version is 3.8.10
I know Python pretty well, so if I find a fix I'll make a pull request.
The text was updated successfully, but these errors were encountered: