Tasks
20 tasks across 5 tiers. Each task pairs a prompt with a
deterministic pass rubric and a curator-built reference solve. The
canary GUID in every task.yaml is our
contamination-detection signal.
trivial (4)
flat-plate-5x7
5×7 has no single-part solution in the Rebrickable catalog (the standard plate
flat-plate-5x5
The headline failure mode from the spike: models propose a single layer of
staggered-column-2x2
Forces the model to reason about brick stagger in the Z-axis — the same
hollow-frame-8x8
Hollow frame at 8×8 (1-stud walls) forces multi-part composition — four side
easy (4)
small-table
Tests whether the model can compose a multi-element object: tabletop on top
step-pyramid
Tests footprint arithmetic across 3 stacked tiers AND bonding between layers
simple-gate
Tests spanning structure and consistent-height arithmetic — both pillars
picture-frame
Tests vertical-plane reasoning (a frame is built standing up, not lying flat)
medium (4)
minifig-chair
Tests three distinct sub-attachments simultaneously: legs-to-seat, seat-as-
bookshelf-3-shelf
Bonding-heavy: each shelf is a horizontal plate that bonds the two vertical
simple-bridge
Two challenges in one — the deck must be a bonded multi-plate spanning
square-lantern
Tests enclosure topology: 4 walls must close at corners (the L-bond pattern)
hard (4)
farm-tractor
Hardest single-object task: combines articulated wheel attachment, multi-zone
rolling-cart
Pure articulation test: the wheels MUST be attached such that they can
drawbridge
Articulation via a real hinge. The bridge MUST be attached with a hinge
slatted-bench
Heavy bonding test: the slatted seat surface (12 long thin plates side by
stretch (4)
small-house
Stretch-tier seed: combines wall enclosure (corner bonding), articulated
small-lighthouse
Multi-tier bonding: the round tower must bond to the square base (the 2×2
garden-shed
A second stretch-tier multi-feature task that varies the roof from 0017's
fountain-with-pool
Bonding-heavy stretch-tier: the 10×10 pool floor is the largest flat-tile