Workroom PlayTime Longplayer: Interesting Testing
January's Interesting Testing series was fun. I'll rerun the exercises and will put them in context in a longer (90-minute) session sometime in February – probably a Weds / Thursday evening in the last couple of weeks of the month. I'm calling it a Longplayer.
I'll run if I have 6 people, run more than once if I have too many, and if there are several of you who need a particular time or day, I'll run it for you and friends.
Want to come? Email me – and if you're not already subscribed, subscribe.
What we're doing
I'll take the last three Workroom PlayTimes and run them all together. They've all, as it happens, got a machine reasoning / AI / LLM component to them – and they all present challenges or provocations to us, as testers.
You'll get hands-on with these different parts:
- We'll do two things with code, building that code from examples. We'll build
whatwords, from A Library with No Code, as a one-shot, just from examples. - Next, we'll use that library in a Ralph Loop: we'll set the loop up so that we're building tiny checkable deliverables using a fresh LLM each time. Each tiny task will be built test (check) first, each with a fresh context, with us simply checking and steering, not designing or coding.
- We'll also be looking at the ARC-AGI website and its tests: tests that are built to be easy for humans, hard for machines, and designed to assess and maybe guide the growth of machine reasoning.