ML-Text-Classification

Machine Learning Algorithm Toolbox

Proposed Directory Structure

data
|--- classif_data.csv/tsv/txt
src
|--- DataLoader.py
|--- Models.py
|--- Trainer.py
|--- Inference.py
|--- FeatureExtractor.py
main.py

Central to any ML system are three key things:

A simple pipeline of any ML project can be defined as:

Prepare your data - split them into train and test sets. We'll do this using DataLoader.py
Represent your data - extract features or embed your data, can also be considered the pre-processing step. We'll do this usinf Extractor.py
Train the model. Will be done in Trainer.py
Predict using the model. Will be done using Inference.py

main.py will be a high-level wrapper to call different classes at a single place.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py