Skip to content

SMD-Bioinformatics-Lund/Nisse

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Home for CMD Tomte post processing pipeline.

Running Nisse

Takes the CSV-samplesheet used to run Tomte and and path to the Tomte results folder as input.

Example run script which can be executed using sbatch:

#!/bin/bash
#SBATCH --job-name=nisse
#SBATCH --output=slurm_logs/%j.log
#SBATCH --ntasks=4
#SBATCH --mem=4gb
#SBATCH --time=7-00:00:00
#SBATCH --partition="grace-lowest"

module load Java/13.0.2
module load nextflow/24.04.3
module load singularity/3.2.0

export NXF_HOME="/local/nextflowconf/jakob/"
export NXF_PLUGINS_DIR="/local/nextflowconf/jakob/plugins/"

timestamp=$(date +"%y%m%d_%H%M")
outdir="/nisse/results/${timestamp}"
mkdir -p ${outdir}

csv="/path/to/samplesheet.csv"
tomte_results="/path/to/tomte_results"
work="/path/to/workdir"
main_nf="/base/nisse/main.nf"

nextflow run "${main_nf}" \
    -work-dir "${work}" \
    --tomte_results "${tomte_results}" \
    --input "${csv}" \
    --outdir ${outdir} | tee "${outdir}/run.log"

Map HGNC symbol to ID

A two-column file mapping HGNC symbol to ID is currently needed, as this isn't performed by Tomte but required by Scout (params.hgnc_map).

Steps to prepare:

  1. Download the latest tab hgnc_complete_set file from https://www.genenames.org/download/archive/#!/#tocAnchor-1-3
  2. Extract the symbol and ID columns, without header
tail -n +2 hgnc_complete_set.txt | head | cut -f1,2 | sed "s/HGNC://" > hgnc_id_symbol_map.tsv

About

Post-processing pipeline for Tomte

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published