Start Free Trial

Weekly Challenges

Solve the challenge, share your solution and summit the ranks of our Community!

Also available in | Français | Português | Español | 日本語
IDEAS WANTED

Want to get involved? We're always looking for ideas and content for Weekly Challenges.

SUBMIT YOUR IDEA

Challenge #365: LEGO® My Data!

Pilsner
13 - Pulsar

My solution:

Spoiler
365.png

RM_MOHANTY
7 - Meteor

LEGO Dataset Explanations

To identify a LEGO model based on an inventory ID, we are using four related datasets. Each one plays a unique role in helping us connect the inventory to the final LEGO set information, such as name, year, number of parts, and the theme it belongs to.


Dataset 1 – Inventory Sets

Column Name Description
idUnique inventory ID for a specific LEGO box or set of bricks
versionVersion of the inventory (not used in our joins)
set_numLEGO set number that this inventory ID belongs to
 

Purpose:
This dataset tells us which LEGO set each inventory ID refers to.

Example: Inventory ID 11148 belongs to set_num = 75911-1.


Dataset 2 – Sets

Column Name Description
set_numLEGO set number (primary key)
nameName of the LEGO set
yearYear the set was released
theme_idReference to the theme or collection the set belongs to
 

Purpose:
This dataset provides the actual details of each LEGO set, including its name, release year, and the theme it belongs to (through theme_id).


Dataset 3 – Set Parts Info

Column Name Description
model_numSame as set_num — identifies the LEGO set
num_partsTotal number of parts in the set
 

Purpose:
This dataset tells us how many bricks or parts are in each LEGO set.
We join it using set_num = model_num.


Dataset 4 – Themes

Column Name Description
idUnique identifier for the LEGO theme
nameName of the theme (e.g., "Star Wars", "Technic", "City")
parent_idUsed for nested themes (not used here)
 

Purpose:
This dataset gives us the theme name for each theme_id.
We use it to convert theme IDs into readable theme labels.


Summary of Data Flow

  1. Start with Dataset 1 (inventory ID)

  2. Join with Dataset 2 to get set details (name, year, theme ID)

  3. Join with Dataset 3 to get number of parts

  4. Join with Dataset 4 to get the theme name

Ajay_Singha
5 - Atom

Loved it!! 

mirzaglamocak_alteryx
8 - Asteroid
Spoiler
mirzaglamocak_alteryx_0-1757694098080.png