Workroom PlayTime Longplayer: Interesting Testing

Feb 2, 2026 (Feb 16, 2026) Loading...

January's Interesting Testing series was fun. I'll rerun the exercises and will put them in context in a longer (90-minute) session sometime in February – probably a Weds / Thursday evening in the last couple of weeks of the month. I'm calling it a Longplayer.

I'll run if I have 6 people, run more than once if I have too many, and if there are several of you who need a particular time or day, I'll run it for you and friends.

Want to come? Email me – and if you're not already subscribed, subscribe.

What we're doing

I'll take the last three Workroom PlayTimes and run them all together. They've all, as it happens, got a machine reasoning / AI / LLM component to them – and they all present challenges or provocations to us, as testers.

You'll get hands-on with these different parts:

We'll do two things with code, building that code from examples. We'll build whatwords, from A Library with No Code, as a one-shot, just from examples.
Next, we'll use that library in a Ralph Loop: we'll set the loop up so that we're building tiny checkable deliverables using a fresh LLM each time. Each tiny task will be built test (check) first, each with a fresh context, with us simply checking and steering, not designing or coding.
We'll also be looking at the ARC-AGI website and its tests: tests that are built to be easy for humans, hard for machines, and designed to assess and maybe guide the growth of machine reasoning.