Skip to content

Latest commit

 

History

History
36 lines (23 loc) · 2.25 KB

README.md

File metadata and controls

36 lines (23 loc) · 2.25 KB

Are search engines doing enough to curb piracy as they claim?

Sample Visualization

Motion picture piracy is a problem that has plagued the movie industry for years. A common form of technology is taking on a new role in helping the issue to persist right underneath movie studios’ noses. In this project we aim to understand how Google and Bing perform while curbing piracy related suggestion in their search enginee algorithm,

We focued on the top 10 most pirated movies data collected by TorrentFreak. The website contains an RSS feed with data that we used for our experiement. To help this story become even more newsworthy, we aim to link data showing how much movies are pirated with data that shows how much revenue is lost by creators due to piracy. In each respective search, we want to see which search terms relate to illegal piracy and which search terms relate to reviews.

It is possible that some of the algorithmic changes have affected the results seen in autosuggestions as well. In 2014, Google received millions of copyright complaints from content creators to remove certain search results. But we also believe that sense of urgency from a user’s perspective is also a determining factor which we hope to be able to experiment with during the execution of our project.

Movie_Auto_Suggestion_Project.py.ipynb contains the full code for the analysis.

To reproduce the result follow the below steps

  1. edit the file final/Torrentfreak_april_16.csv file
  2. update the file name in the code at cell 4,
  3. Update the output folder in the code at cell 2.
  4. Run all cells
  5. output can be viewed as plots in ipython.

Repeat the process for mutiple days or weeks and compare the graphs. Currently there is limit of 50+ graph per day for my account in plot.ly

Save the graph for comparison.

Software Required

Download and install Anaconda Bundle and install Jupyter Notebook to run the .ipynb file.

License


Copyright [2016] [Ramesh Balasekaran && Jessie Karangu]

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0