models /
gpt-5.5 scaffold-v1
Provider: openai ·
Run at 2026-04-28 ·
Submission 6d5dcb9b
CaSS 45.0%
CBS 0.0% v1: 0 by design
Coverage 0%
Cost $4.54
Tokens (in/out) 1.41M / 101.0k
Cost circuit-breaker tripped.
$4.00 cap exceeded at $4.54 estimated — tripped at task
0017 (small-house);
3 task(s) marked
did_not_run and excluded from the CaSS denominator.
Diagnostic row, not ranked. legacy task-set provenance; 20 task(s) missing canary/hash; 3 did_not_run
Tier breakdown
The 3-dimension diagnostic matrix for this baseline. CBS can read 0 while the conservative v1 validator has low definitive coverage — that's expected for v1.
| Tier | BOM pass | Scope pass | Buildability | CaSS | CBS | Coverage | n |
|---|---|---|---|---|---|---|---|
| trivial | 100.0% | 100.0% | — | 100.0% | 0.0% | 0% | 4 |
| easy | 100.0% | 50.0% | — | 50.0% | 0.0% | 0% | 4 |
| medium | 100.0% | 50.0% | — | 50.0% | 0.0% | 0% | 4 |
| hard | 100.0% | 25.0% | — | 25.0% | 0.0% | 0% | 4 |
| stretch | 100.0% | 0.0% | — | 0.0% | 0.0% | 0% | 1 |
Per-task results
All 20 tasks attempted by this baseline. Click a task to see the same row from every model's perspective.
| # | Slug | Tier | CaSS | BOM | Scope | Build | Ref sim | Iter | Tokens | Duration | Status | 3D |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0001 | flat-plate-5x7 | trivial | 1 | 1 | 1 | incon. | 0.000 | 11 | 105.6k | 5m39s | pass | — |
| 0002 | flat-plate-5x5 | trivial | 1 | 1 | 1 | incon. | 1.000 | 8 | 28.7k | 1m57s | pass | — |
| 0003 | staggered-column-2x2 | trivial | 1 | 1 | 1 | incon. | 0.000 | 3 | 12.5k | 16.6s | pass | — |
| 0004 | hollow-frame-8x8 | trivial | 1 | 1 | 1 | incon. | 0.000 | 5 | 18.8k | 2m01s | pass | — |
| 0005 | small-table | easy | 1 | 1 | 1 | incon. | 0.000 | 4 | 20.1k | 55.2s | pass | — |
| 0006 | step-pyramid | easy | 1 | 1 | 1 | incon. | 0.400 | 4 | 10.7k | 36.2s | pass | — |
| 0007 | simple-gate | easy | 0 | 1 | 0 | incon. | 0.000 | 3 | 22.0k | 27.2s | — | |
| 0008 | picture-frame | easy | 0 | 1 | 0 | incon. | 0.100 | 4 | 33.6k | 3m20s | — | |
| 0009 | minifig-chair | medium | 1 | 1 | 1 | incon. | 0.000 | 4 | 40.7k | 1m18s | pass | — |
| 0010 | bookshelf-3-shelf | medium | 0 | 1 | 0 | incon. | 0.000 | 4 | 28.0k | 2m45s | — | |
| 0011 | simple-bridge | medium | 0 | 1 | 0 | incon. | 0.250 | 6 | 49.8k | 2m07s | — | |
| 0012 | square-lantern | medium | 1 | 1 | 1 | incon. | 0.375 | 3 | 38.2k | 2m04s | pass | — |
| 0013 | farm-tractor | hard | 0 | 1 | 0 | incon. | 0.083 | 13 | 193.1k | 3m57s | — | |
| 0014 | rolling-cart | hard | 0 | 1 | 0 | incon. | 0.000 | 16 | 318.1k | 4m54s | — | |
| 0015 | drawbridge | hard | 1 | 1 | 1 | incon. | 0.000 | 8 | 55.8k | 2m50s | pass | — |
| 0016 | slatted-bench | hard | 0 | 1 | 0 | incon. | 0.000 | 7 | 269.0k | 6m07s | — | |
| 0017 | small-house | stretch | 0 | 1 | 0 | incon. | 0.301 | 12 | 267.1k | 5m08s | — | |
| 0018 | small-lighthouse | stretch | — | — | — | DNR | — | 0 | 0 | 0ms | DNR | — |
| 0019 | garden-shed | stretch | — | — | — | DNR | — | 0 | 0 | 0ms | DNR | — |
| 0020 | fountain-with-pool | stretch | — | — | — | DNR | — | 0 | 0 | 0ms | DNR | — |
Submitted BOMs
The final BOM the model emitted on each task it attempted.
#0001 flat-plate-5x7 trivial CaSS 1
| part_num | color_id | qty |
|---|---|---|
3032 | 0 | 2 |
3666 | 0 | 2 |
3710 | 0 | 2 |
3024 | 0 | 2 |
| total pieces | 8 | |
#0002 flat-plate-5x5 trivial CaSS 1
| part_num | color_id | qty |
|---|---|---|
78329 | 4 | 10 |
| total pieces | 10 | |
#0003 staggered-column-2x2 trivial CaSS 1
| part_num | color_id | qty |
|---|---|---|
3004 | 4 | 26 |
| total pieces | 26 | |
#0004 hollow-frame-8x8 trivial CaSS 1
| part_num | color_id | qty |
|---|---|---|
3460 | 0 | 4 |
3666 | 0 | 2 |
| total pieces | 6 | |
#0005 small-table easy CaSS 1
| part_num | color_id | qty |
|---|---|---|
3958 | 70 | 1 |
3005 | 70 | 12 |
| total pieces | 13 | |
#0006 step-pyramid easy CaSS 1
| part_num | color_id | qty |
|---|---|---|
3958 | 19 | 1 |
3031 | 19 | 1 |
3022 | 19 | 1 |
| total pieces | 3 | |
#0007 simple-gate easy CaSS 0
| part_num | color_id | qty |
|---|---|---|
3005 | 0 | 10 |
3009 | 0 | 1 |
| total pieces | 11 | |
#0008 picture-frame easy CaSS 0
| part_num | color_id | qty |
|---|---|---|
3036 | 70 | 1 |
3032 | 15 | 1 |
3666 | 70 | 4 |
3032 | 70 | 1 |
11211 | 70 | 2 |
3001 | 70 | 1 |
| total pieces | 10 | |
#0009 minifig-chair medium CaSS 1
| part_num | color_id | qty |
|---|---|---|
3021 | 70 | 1 |
3005 | 70 | 8 |
3004 | 70 | 1 |
| total pieces | 10 | |
#0010 bookshelf-3-shelf medium CaSS 0
| part_num | color_id | qty |
|---|---|---|
3032 | 6 | 3 |
3010 | 6 | 18 |
| total pieces | 21 | |
#0011 simple-bridge medium CaSS 0
| part_num | color_id | qty |
|---|---|---|
3003 | 71 | 8 |
3029 | 70 | 1 |
3633 | 70 | 6 |
| total pieces | 15 | |
#0012 square-lantern medium CaSS 1
| part_num | color_id | qty |
|---|---|---|
3031 | 72 | 2 |
3010 | 72 | 6 |
3004 | 72 | 6 |
3005 | 72 | 16 |
| total pieces | 30 | |
#0013 farm-tractor hard CaSS 0
| part_num | color_id | qty |
|---|---|---|
3035 | 2 | 1 |
4488 | 0 | 2 |
6014b | 14 | 2 |
6015 | 0 | 2 |
30027b | 14 | 2 |
50951 | 0 | 2 |
3020 | 2 | 1 |
3003 | 2 | 1 |
3039 | 2 | 1 |
4079 | 0 | 1 |
3829c01 | 0 | 1 |
2412b | 71 | 1 |
3062b | 71 | 1 |
| total pieces | 18 | |
#0014 rolling-cart hard CaSS 0
| part_num | color_id | qty |
|---|---|---|
3032 | 4 | 1 |
3009 | 4 | 2 |
3004 | 4 | 2 |
32828 | 4 | 1 |
21445 | 0 | 4 |
30027a | 15 | 4 |
30028 | 0 | 4 |
| total pieces | 18 | |
#0015 drawbridge hard CaSS 1
| part_num | color_id | qty |
|---|---|---|
3003 | 71 | 10 |
3035 | 71 | 1 |
2429 | 70 | 1 |
2430 | 70 | 1 |
3032 | 70 | 1 |
2431pr0075 | 19 | 6 |
| total pieces | 20 | |
#0016 slatted-bench hard CaSS 0
| part_num | color_id | qty |
|---|---|---|
60479 | 70 | 4 |
60479 | 308 | 4 |
3029 | 72 | 1 |
3010 | 72 | 8 |
3020 | 72 | 2 |
3039 | 72 | 4 |
| total pieces | 23 | |
#0017 small-house stretch CaSS 0
| part_num | color_id | qty |
|---|---|---|
3867 | 2 | 1 |
30179 | 15 | 1 |
40241 | 6 | 1 |
3853 | 15 | 1 |
3855 | 47 | 1 |
3004 | 15 | 40 |
3010 | 15 | 20 |
3008 | 15 | 12 |
6111 | 15 | 8 |
3037 | 4 | 8 |
3039 | 4 | 4 |
3008 | 4 | 1 |
| total pieces | 98 | |
Run metadata
| Submission ID | 6d5dcb9b-0bea-4993-81cf-af49587dc0e6 |
|---|---|
| Protocol | scaffold-v1 |
| Protocol hash | sha256:83fca868afffd414061a997e14f4688a0c96a4e750c2c91de23e7c5be11a2b7d |
| Run at | 2026-04-28T15:21:38.453Z |
| Total duration | 46m22s |
| Adapter health | ok |