Skip to content

Depository contains script for the plant DNA barcoding using ML

Notifications You must be signed in to change notification settings

BioInf2305/ML-Barcoding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

ML-Barcoding Depository contains scripts used for sequence identification using Machine learnign approach as implemented in WEKA. In brief, these python scripts generate the input files for WEKA machine learning tool.

1). Mismatch kernal usage:python3 kmerMismatchV4.py <input_file> <value of L> <value of K> <value of M>. This will generate two files: (i) Similarity matrix text file (ii) .csv file with leaf kmers. The .csv file can be used directly in WEKA machine learning software to test data against various available classifiers.

2). Gappy kernel usage:python3 kmerGappyV4.py <input_file> <value of L> <value of K> <value of M>. This will generate .csv file of sparse matrix, which contain kmer frequencies. This .csv file can also be used with WEKA GUI interface.

For any issues contact: [email protected] or [email protected].

About

Depository contains script for the plant DNA barcoding using ML

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages