Photo by mohammed idris djoudi / Unsplash

Workroom PlayTime Longplayer: Interesting Testing

Feb 2, 2026

January's Interesting Testing series was fun. I'll rerun the exercises and will put them in context in a longer (90-minute) session sometime in February – probably a Weds / Thursday evening in the last couple of weeks of the month. I'm calling it a Longplayer.

I'll run if I have 6 people, run more than once if I have too many, and if there are several of you who need a particular time or day, I'll run it for you and friends.

Want to come? Email me – and if you're not already subscribed, subscribe.

What we're doing

I'll take the last three Workroom PlayTimes and run them all together. They've all, as it happens, got a machine reasoning / AI / LLM component to them – and they all present challenges or provocations to us, as testers.

You'll get hands-on with these different parts:

  • We'll do two things with code, building that code from examples. We'll build whatwords, from A Library with No Code, as a one-shot, just from examples.
  • Next, we'll use that library in a Ralph Loop: we'll set the loop up so that we're building tiny checkable deliverables using a fresh LLM each time. Each tiny task will be built test (check) first, each with a fresh context, with us simply checking and steering, not designing or coding.
  • We'll also be looking at the ARC-AGI website and its tests: tests that are built to be easy for humans, hard for machines, and designed to assess and maybe guide the growth of machine reasoning.

Tags

James Lyndsay

Getting better at software testing. Singing in Bulgarian. Staying in. Going out. Listening. Talking. Writing. Making.