Skip to content
@hydra-genetics

hydra-genetics

Hydra-Genetics

We are an organization/community with the goal of making snakemake pipeline development easier and faster, and a bit more structured.

We do this by providing snakemake modules that can be combined to create a complete analysis or included in already existing pipelines. All modules are subjected to extensive testing to make sure that new releases doesn't unexpectedly break existing pipeline or deviate from guidelines and best practices on how to write code.

Current tests:

  • pycodestyle: (--max-line-length 130)
  • snakefmt: (line-length 130)
  • snakemake --lint: (line-length 130)
  • snakemake dry-run
  • execution test with small dataset
  • execution: integration test witt complete dataset: planned to be implemented

The small execution tests will make sure that the pipeline actually can be executed, i.e run and generate data. The integration test with complete data set will also evaluate the generated result to make sure that results does not change, if it's not deliberately.

We also have a development document describing how a hydra-genetic pipeline/module should be structured, named and what a rule should contain.

ReadTheDocs

Please visit our ReadTheDocs for more information about using hydra-genetics.

Pipelines built using Hydra-Genetics

A list of pipelines built with Hydra-Genetics can be found at https://github.com/hydra-genetics/hydra-genetics-pipelines.

Repositories

The current repositories can be divided into the following sections.

Tools and examples

Command line interface to create new modules/pipelines or adding a new rule to a existing project. Provides libraries used to make it easier for people not used to pandas to extract information from samples and units dataframes, these dataframes are generated from units.tsv (schema definition) and samples.tsv (schema definition) files and used as input for hydra-genetics.

Collection of docker files with bioinformatic tools used to execute snakemake with singularity. All dockers are automatically uploaded to dockerhub (after being merged into master)

Example of configuration profiles for executing Snakemake in various computing environments

Environment

Repositories used to setup an environment for users using windows or OSX, or any other system that doesn't support singularity.

Build script for a vagrant machine that can be used by Window/OSX/Linux user to run snakemake in combination with singularity, will not work for ARM cpu systems.

Analysis modules

Snakemake module containing processing steps that are be performed during sequence alignment.

Collection of rules to annotate vcf files.

Collection of rules used to calculate biomarkers like MSI, TMB and HRD.

Collection of rules used to call structural variants

Rules used to compress files

Collection of variant filters

Collection of fusion callers

Module containing rules that are general and most likely will be used by multiple modules.

Snakemake module for performing Mitochondrial short variant discovery

Snakemake module containing an array of steps provided by the parabricks tookit

Snakemake module containing processing steps that are be performed before sequence alignment.

Collection of rules performing QC and generating reports.

Collection of rules used to create references, panel of normals (PoN), and background filters.

Sentieon tools

Collection of rules used to call snv and small indels

Popular repositories Loading

  1. hydra-genetics hydra-genetics Public

    Command line interface to create new modules/pipelines or adding a new rule to a existing project. Helper libraries for handling samples and units.

    Python 3 1

  2. alignment alignment Public

    Snakemake module containing processing steps that should be performed during sequence alignment.

    Python 3 2

  3. cnv_sv cnv_sv Public

    Collection of rules used to call structural variants

    Python 3 3

  4. mitochondrial mitochondrial Public

    Snakemake module for performing Mitochondrial short variant discovery

    Python 3

  5. docker docker Public

    Collection of dockerfiles with bioinformatic tools used to execute snakemake with singularity.

    Dockerfile 2 1

  6. references references Public

    Collection of rules used to create references, panel of normals (PoN), and background filters.

    Python 2 1

Repositories

Showing 10 of 24 repositories
  • filtering Public
    hydra-genetics/filtering’s past year of commit activity
    Python 0 GPL-3.0 0 6 3 Updated Jan 9, 2025
  • cnv_sv Public

    Collection of rules used to call structural variants

    hydra-genetics/cnv_sv’s past year of commit activity
    Python 3 GPL-3.0 3 11 (6 issues need help) 1 Updated Jan 9, 2025
  • biomarker Public

    Collection of rules used to calculate biomarkers like MSI, TMB and HRD.

    hydra-genetics/biomarker’s past year of commit activity
    Python 1 GPL-3.0 0 6 (1 issue needs help) 3 Updated Jan 9, 2025
  • prealignment Public

    Snakemake module containing processing steps that should be performed before sequence alignment.

    hydra-genetics/prealignment’s past year of commit activity
    Python 0 GPL-3.0 1 1 2 Updated Jan 9, 2025
  • snv_indels Public

    Collection of rules used to call snv and small indels.

    hydra-genetics/snv_indels’s past year of commit activity
    Python 0 GPL-3.0 2 9 2 Updated Jan 9, 2025
  • reports Public

    Snakemake module for generating reports related to Hydra-Genetics results

    hydra-genetics/reports’s past year of commit activity
    JavaScript 1 GPL-3.0 0 12 3 Updated Jan 9, 2025
  • misc Public
    hydra-genetics/misc’s past year of commit activity
    Python 0 GPL-3.0 1 0 3 Updated Jan 9, 2025
  • fusions Public

    fusions

    hydra-genetics/fusions’s past year of commit activity
    Python 0 GPL-3.0 0 6 3 Updated Jan 9, 2025
  • references Public

    Collection of rules used to create references, panel of normals (PoN), and background filters.

    hydra-genetics/references’s past year of commit activity
    Python 2 GPL-3.0 1 12 1 Updated Jan 9, 2025
  • parabricks Public

    Snakemake module containing different analyses provided by parabricks.

    hydra-genetics/parabricks’s past year of commit activity
    Python 0 GPL-3.0 1 0 1 Updated Jan 9, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…