Lab notebook the following research:
- A pan-genome-based machine learning approach for predicting antimicrobial resistance activities of the Escherichia coli strains. HL Her, YW Wu. Bioinformatics 34 (13), i89-i95(2018)
- PangenomeNet: A pan-genome-based functional network provide mechanistic view on Meropenem Resistome for Escherichia coli strains. Hsuan-Lin Her, Po-Ting Lin, and Yu-Wei Wu. (In preperation)
follow the number preceding the scripts to reproduce pangenomeNet work. You will need to install many tools. We are working on refactoring the pipeline into snakemake.
- in Drug folder, there are codes for TMACC, CHEBI, all kinds of drug database and chemical descriptor
- in Genome folder, codes are about pan-genome construction,
annotate_parser
contain functions to parse annotations from CARD, BLAST, COG, HMMer, pFAM and REsfamcontext
folder contains functions and scripts to construct co-inheritance networkgenome
folder contains scripts to download genome from PATRIC, and calculate genome statisticsgoldstandard_pair
folder contains scripts to validate networks, generate gold-standard gene pairs.pangenome_annotation
folder contains scripts to annotation gene clusters from the pan-genomepangenome_build
contains scripts used to call CD-hitpangenome_intrinsic_info
contains scripts to calculate pan-genome statisticsstring
contains parsing EcoliNet, Mentha and STRING network
- in MIC folder, codes are mainly related to CLSI re-annotation for resistance phenotypes
- in Model folder, codes are all about training svm and nb models
compare_scoary
contains code that compares [1] to Scoary.- the Visulazation folder contains all sorts of jupyter notebook, mainly used to plot
cleanData.py
selects data that fits our need (with MIC instead of disk diffusion, the right species ...etc)- in network_analysis contain codes for powerlaw regression