The goal of this project is to accomplish the following:
1. A .php file will be executed from the CRON on the server and go to the following website:
http://www.videoeta.com/
It will scrape the movie titles that are under ‘New DVD Releases for’
(there are about 10 to 20 titles)
It will store these titles in a text file (moviedb.txt), and be used for step 2.
2. The script will search for each title individually on IMDB.com and each time it will go to the first result.
- On this page, it will scrape the IMDB rating of the movie, as well as the number of votes. It will also scrape the Metascore: and number of reviews.
- It will also scrape the URL to the movie poster image on the left.
This info will be placed into moviedb.txt as well.
3. The script will go to http://www.rottentomatoes.com/ and search for each title individually. Each time, it will go to the first result and scrape the Average Rating: as well as Reviews Counted:
This info will be placed in moviedb.txt
4. The info in moviedb.txt will be used to generate a simple HTML page (goodnewmovies.html) that looks like this:
http://i56.tinypic.com/2q07lhw.jpg
Note that this is just an image, the HTML just needs to be a simple table that looks like this.
On the left you see the movie poster, as contained in moviedb.txt
On the right you see the Rotten Tomatoes Average Rating: (multiplied by 10) and number of reviews. You also see the IMDB Rating (multiplied by 10) and the number of votes. As well as the Metacritic score and number of reviews.
At the bottom you see the Total, which is the total score of all 3.
5. There is a file called config.php and in it we have some parameters we can set.
Number Of Movies To Display: X
^^ to set how many recent movies to display on goodnewmovies.html If we set it to 10 for example, it will show the last 10 movies in moviedb.txt
Minimum Total: X
^^ Only display the movie in goodnewmovies.html if the Total is X or greater.
If No Metacritic: X
^^ If there is no Metacritic score, or it is 0, then only display the movie in goodnewmovies.html if the Total is X or higher.
Minimum Rotten Tomatoes Reviews: X
^^ Only display the movie in goodnewmovies.html if there are X number of Rotten Tomatoes reviews or more.
The PHP script will be designed to be executed once per day to check the latest movies on VideoETA.