Category: Big data

Heart-shaped wordcloud, celebrating Colombia peace treaty

This is a lightening quick post just providing the script to draw a heart-shaped wordcloud, using the awesome wordcloud2 package. See the resulting image here:   Apparently, the original code allows you to fit a wordcloud to any shape, even custom shapes, but I didn’t find that functionality pushed out into R yet (a peace sign would have been awesome… Read more →

R Tagosphere!

This post explores the inter-relationships of StackOverflow Tags for R-related questions. So I grabbed all the questions tagged with “r”, took the other tags in each question and made some network charts that show how often each tag is seen with the other tags. The point is to see the empirical relationships that develop as people organically describe their problems with R. Full… Read more →

Clickable list of the best animations since 1900, gathered the geek way.

In the midst of our random data exploration, Laure and I started playing around with Hadley’s movies dataset and noticed that there were a lot of old cartoon animations… I mean REALLY old. So we got excited and wondered if we could find Youtube links for all these old animations. Indeed we could! Here’s how we did it. As always, the full analysis… Read more →

Locations for 75000 dams

The last task I performed for AQUASTAT was to try to find the best way to estimate the anthropogenic evaporation from dams. There will be a paper coming out soon, but in the meantime, I provide one of the fun outputs, a map of 75000 dams! ok okok… it’s not REALLY 75000 dams, this is an duplicate-containing amalgam of: AQUASTAT dams… Read more →

Transboundary surface water flow

A visualization generated for AQUASTAT of FAO. Two types of charts were prepared: Sankey and Network. The Sankey plots allow for an ‘automatic sorting’ of countries based on who is the water tower and who is the water source. This visualization is useful to demonstrate where a country falls on this continuum. The network plot shows the automatic arrangement of countries with their ‘water-neighbours’ without being… Read more →

Chord progressions of 5 000 songs!

Update: Full analysis and everything you need at my github The database contains analyses of over 5000 songs*. These analyses are uploaded by users and allow for all these songs to be analyzed in bulk, as well as individually. One of these ‘all song’ analyses enables users to gather chord progressions on ALL songs (see the analysis file… Read more →

%d bloggers like this: