Accessing the YouTube API v3 with R

This is a quick tutorial on how to access the YouTube API v3 from within R.

A possible scenario is that you would like to perform an analysis of how channels are related to each others or query YouTube to find out the channels that are related to a certain keyword.

Step 1

In order to access the YouTube API v3 you first need to be authorised. There are two ways to proceed:

  1. Use OAuth
  2. Get an API Key

For the sake of simplicity we will work with method 2, even though 1 is also possible in R (see for example the httr package).

You should now go on and obtain your API Key at I have masked my own API Key in the following code for security reasons and also because each key corresponds to a quota (maximum daily use of the YouTube API’s) that should not be exceeded.

Step 2

You are basically all set. Here is a sample code on how to perform a query to YouTube using the API’s. Replace the API_key with the one obtained from Google in Step 1.

The fromJSON function performs the query and parses the results, which are originally in JSON format, into a nested list which can be further manipulated in R.

Other available YouTube API services use pretty much the same format. See for example the search service within YT_Service[1]. For more information on the what you can do with the YouTube API’s, consult the online documentation starting here.

One final element you need to pay attention to. Results from the API query are paginated. You can set the number of results per page via the MaxResults parameter, which defaults to 5 and has a maximum value of 50 in the current API v3. To get the next page results (assuming MaxResults is exceeded), you will need to repeat the query adding the pageToken parameter and setting it to be equal to nextPageToken as returned by the previous query call.

Happy YouTubing with R!



Continue Reading

Bye bye Shareaholic

I just removed the WordPress Shareaholic plugin after I realised it injects unwanted content and external links into my site. See the screenshot below where it adds an unwanted link to KUpdates. In full honesty, would it be a link somehow related to what I write about, but this is about someone expecting twins! Unbelievable.

Screen Shot 2014-10-06 at 22.26.44

Anyway, according to some other bloggers out there, the same plugin also injects unwanted tracking code and provides information on your visitors to other sites. I recommend everyone using Shareaholic to remove it immediately

This is the story of an originally nice plugin gone totally bad. Hopefully will also take a stand and remove it from their plugin directory. For sure it is gone forever from my sites.


Continue Reading

Pivot Tables in R with dplyr

The prolific Hadley Wickham did it again! This time he came up, together with Romain Francois, with an amazing library for data manipulation that turns the task of making Pivot Tables in R a real breeze. Enter dplyr. Along the lines of ggplot2, also from the same main author, dplyr implements a grammar of data manipulation and also introduces a new syntax using “pipe” operators.

Continue Reading

R for Excel Users #5 – Manipulating Data Frames

I just uploaded to YouTube a new episode of the series “R for Excel Users”. Today we will be looking at how to manipulate Data Frames in a proper and effective way. If you like this series and would like to be informed when new episodes are posted, you can subscribe to this blog through the subscription button on the right. Enjoy!

Continue Reading