1 Executive summary

This document presents a data analysis of Rebrickable data set, which is basically a catalog of LEGO Sets, Parts, Colors and various relationships. The goal of the analysis was to identify interesting patterns and relationships in the data.

Key findings from the analysis include:

  1. Skeleton is the most common minifigure in LEGO sets.
  2. The most common color of LEGO parts is black.
  3. Quite surprising is that the most common part category is “Technic Pins” in spare parts and normal.
  4. In 2016 LEGO released the most Star Wars sets and
  5. In set Collectible Minifigures number of parts is almost always between 6 and 11.

2 Most common minifigures

This chart presents which minifigures occur most often in LEGO sets, hover on each bar to see the image of particular minifigure

3 Distribution of number of parts in sets

This chart visualizes distribution of number of parts in each set of some theme. We took only subset of points not to clutter the plot. Hover on each point to see an image of particular set. We guaranteed that top 10 biggest sets in each theme are always present because they usually look cool.

4 Number of LEGO sets released under each theme

This graph presents distribution of some most popular themes in terms of how many sets were released each year. Feel free to toggle themes that interest you the most.

5 Tree of themes

Themes might be sub-themes, tree-map below shows some of the most popular themes and their subthemes.

Tree map with all present themes, showing the difference between number of sets produced per theme and the number of sub-themes per theme.

6 Part’s colors

7 Grouped bar plots

The most frequent parts in inventory from the most interesting within 5 the most frequent part categories, hover on each bar to see the image of particular part.