Skip to content

slickFix/PySpark_ml

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PySpark_ml

This repository includes implementation of following:-

  • Spark DataFrames

    • Spark DataFrame Basics
    • Spark DataFrame Operations
    • Groupby and Aggregate Functions
    • Missing Data
    • Dates and Timestamps
  • Linear Regression

    • Linear Regression with PySpark Example (Car Data)
  • Logistic Regression

    • Logistic Regression Example
  • Tree Methods

    • Decision Tree and Random Forest Example
  • Clustering

    • Clustering Example - Iris Dataset
  • Recommender System

    • Recommender Systems and Collaborative Filtering
  • Natural Language Processing

    • Introduction to NLP and Naive Bayes Model with examples
    • NLP pipelines
  • Spark Streaming

    • Introduction to Spark Streaming

About

Basic implementation of ml algos using pySpark

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published