Skip to content

chenzhr23/psiFinder_snakemake

Repository files navigation

psiFinder_snakemake

snakemake pipline for Ψ-sites Identification/Annotation/Target

psiFinder_snakemake is a snakemake pipline (back-end code) correspond to psiFinder quick start QT widget (front-end user interface).

quick_start

Contents

Pre-installation

psiFinder_snakemake requires snakemake/seqtk/cutadapt/STAR/bedtools/gtfToGenePred and several perl/python/R packages pre-installation and predominantly used in unix-based operating systems. Therefore, for the usability of psiFinder_snakemake, we recommend running all the tools and scripts in WSL2 (WSL2 installation guide: https://pureinfotech.com/install-windows-subsystem-linux-2-windows-10/) or unix-based system with perl, python and R.

Required perl modules:

cpan Getopt::Long;
cpan Bio::Perl;
cpan Bio::SeqIO;

Required python modules:

pip install pandas
pip install matplotlib
pip install numpy
pip install pickle
pip install scikit-learn
pip install argparse

Required R packages:

#use pacman to install packages in batch
install.packages("pacman")
library(pacman)

#load and install required R packages 
p_load("optparse","devtools","caTools","neuralnet","NeuralNetTools","dplyr","stringr","gridExtra","cowplot","pROC","mccr","ggplot2","ggpol","ggpubr","RColorBrewer","openxlsx","reshape2","factoextra","bedr","scales","e1071","tidyr")

Input data and required files

Test data and big files for the snakemake:

Usage

Set configuration

psiFinder_snakemake require a quickstart_config.yml file to get user option and start the running:

quickstart_config

Run psiFinder snakemake

bash run_psiFinder_snakemake.sh

Or use psiFinder quick start QT widget to interactively run psiFinder_snakemake:

quick_start

Notice: If Ψ-sites Target of psiFinder quick start QT widget is selected, psiFinder_snakemake will use human_hg38_snoRNABase_snoDB_rmRepeat.collapse.fa (ACAscan required in ./snakemake/script/target_prediction, predict H/ACA snoRNA-Ψ-target interaction, known H/ACA snoRNA sequence) and overall_multinomialnb_model.pkl (PUSscan required in ./snakemake/script/target_prediction, predict PUS-Ψ-target interaction, pre-built model specialized for TRUB1/PUS3/PUS1 target prediction [can be generated by psiMNB workflow: https://github.com/chenzhr23/psiMNB]).

About

snakemake pipline for psiFinder quick start QT widget

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published