Skip to content

Python3 wrapper for parallelized gene prediction using Prodigal

License

Notifications You must be signed in to change notification settings

sjaenick/pprodigal

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PProdigal: Parallelized gene prediction based on Prodigal.

This is just a small wrapper around the prodigal gene prediction program
that splits input into chunks and processes them im parallel, since prodigal
does not support multithreading by itself. The wrapper supports all command
line parameters accepted by prodigal itself, with two additional parameters
that control the parallelization:

  -T TASKS, --tasks TASKS
                        number of prodigal processes to start in parallel (default: 20)
  -C CHUNKSIZE, --chunksize CHUNKSIZE
                        number of input sequences to process within a chunk (default: 2000)

Due to prodigal's self-training phase, chunks should be chosen sufficiently
large in order to avoid suboptimal results.

Availability:
 - PyPi: https://pypi.org/project/pprodigal/
   $ pip install pprodigal
 - Bioconda: https://github.com/bioconda/bioconda-recipes/tree/master/recipes/pprodigal
   $ conda install -c conda-forge -c bioconda -c defaults pprodigal

License: MIT