Using R shiny, a web application framework, the user can explore the movies dataset base on language, genres, min number of reviews on Rotten Tomatoes, year released and Oscar wins.
The movies dataset included follow information:
X-axis variable: Year Y-axis variable: Dollar at box office
On the top of the right-hand, we can find out 2008; the Dark Knight had a most impressive record. Dark Knight had $ 533,300,000 Boxoffice and high numeric rating 8.5. However, we can graph Rating and Boxoffice two variables that move not always in the same direction. Some movie like The King’s Speech had a numeric rating 8.5 but only have $ 138,800,000 Boxoffice. Life of Pi is the blockbuster movie in 2012 with the numeric rating 8, but sadly only got 125,000,000 Boxoffice compare to Dark Knight $533,300,000.
If we use the filter to selected the 12 movies Box Office over 400 million dollars. We can found that most of them are active and adventure films. Besides, it’s fascinating that sequel made the same or higher record than the prequel. For example, Iron Man 3, Shrek 2, The Dark Knight 1, The Dark Knight 2, Toy Story 3.
If we selected X variable is year and Y dollars at the box office, we could find that after 2000 the film market is growing rapidly. The Lion King in 1994 is the first Animation film that closed to $94,200,000 Box office and opens the Animation & Adventure film market.
After battling a fire-breathing dragon and the evil Lord Farquaad to win the hand of Princess Fiona, Shrek now faces his greatest challenge: the in-laws. Shrek and Princess Fiona return from their honeymoon to find an invitation to visit Fiona’s parents. Shrek 2 rank top 1 in Top 2004 Movies at the Worldwide Box Office with $ 436,500,000 Box Office five-time than the Lion King and $937,008,132 revenue. Frozen and 2010 Toy Story 3($ 415,000,000) also be highly accepted by the films market as the Animation & Adventure. (2010 Black Swan, Drama & Mystery $ 107,000,000 Box Office)
It seems that not only children like a fairy tale, adults also enjoyed the funny and laid-back animation. Animated movies are not only targeted at children because there are many grownups now that are fond of them as well. Maybe watching an animated movie gave them nostalgia or memories in their previous life as a child.
Conclusion: Data Visualization is not an end-goal, it is a process. Using visualization allows users to absorb the data better and see new paths. This enables users to identify new patterns and trends such like the correlation between numeric rating and dollars at the box office. Even extensive amounts of complicated data start to make sense when presented graphically and find out the film market insight . Identifying those relationships helps organizations focus on areas most likely to influence their most important goals.
How to visualize your data : R programming
In this project, I used R Shiny App application to cleaning the data and explore the movies dataset. Shiny applications have two components: a user-interface definition and a server script. The source code for both of these components is listed below.
The user interface is defined in a source file named ui.R: I used some HTML/CSS structure to make the interface more attractive. For example , font-family: ‘Lobster’ and change the color to light green ( color: #48ca3b ).
Secondly, created two select input (Language and Genre) and five slider Input ( genre, reviews, year, Oscars, Boxoffice ). Besides, used ggvisOutput function to show the plot.
The server-side of the application is shown below. At one level, it’s very simple–a random distribution is plotted as a histogram with the requested number of bins.
Join tables and some data cleaning : filtering out those with <10 reviews, and select specified columns.
# Filter the movies, returning a data frame
# Function for generating tooltip text
# A reactive expression with the ggvis plot
# Variables that can be put on the x and y axes
Finally, use Rstudio to run your APP and you can explore the beauty of the data visualization.