Skip to content

Latest commit

 

History

History
executable file
·
11 lines (6 loc) · 893 Bytes

README.md

File metadata and controls

executable file
·
11 lines (6 loc) · 893 Bytes

Memology

Memes - why so popular?

My workshop talk at DataFest4 about parsing websites (including KnowYourMeme) can be found here.

The project

Memology.ipynb contains a short exploration of the dataset, with some graphs, statistics, etc., and, of course, text analysis and modelling. Based on the average views of the meme per day I have created 5 groups of "popularity" varying from "very unpopular" to "viral". To deal with the description texts I used TF-IDF transformation, which then passed to Logit regression and Random Forest. Overall, the quality of the models was quite satisfactory, achieving accuracy of 0.43 (with the naive constant baseline of 0.2)