Skip to content

Uses a variety of machine learning and NLP techniques for extracting, normalizing, and attempting to classify fake news articles.

Notifications You must be signed in to change notification settings

mvbrock/fakenews

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FakeNews command-line app and Python library

FakeNews is a Python command-line application and library for building an ensembled set of classifiers to detect fake news articles. This is a work in progress, based on basic understandings of classification problems in relation to NLP. A first approach will attempt to create a feature space using n-grams from extracted text, select a feature subset via TF-IDF, and build a classifier using random forest.

Installing fakenews

  • Install apt packages: sudo apt-get install python3-dev openjdk-7-jre libxml2-dev libxslt1-dev libssl-dev libffi-dev
  • Create virtualenv: virtualenv --python=python3.4 .venv
  • Activate virtualenv: . .venv/bin/activate
  • Install pip requirements: pip install -r requirements.txt
  • Install python-boilerpipe: https://github.com/misja/python-boilerpipe

Misc Research

Detecting fake news

The following is a list of research papers on fake news detection:

Web scraping

Text extraction

The following is a list of text extraction library evaluations:

About

Uses a variety of machine learning and NLP techniques for extracting, normalizing, and attempting to classify fake news articles.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages