The last mile nobody demos: shipping agentic AI in regulated pharma

I’ve watched the same meeting play out more than once. An agentic system reads a batch of messy clinical trial data, reasons about what looks wrong, and drafts the questions a human reviewer would ...

Jun 24, 2026 AI & ML

Claude as a Meta-Tool, Through the Loop

Claude isn’t just a tool you operate. It’s a meta-tool: give it an outcome and it builds its own tools to reach it — a script, a skill, a small pipeline — for a single task or a stack of them at an...

Jun 22, 2026 AI & ML

Rootless LLM Fine-Tuning on DGX Spark: GPU Containers Without Root

In my previous DGX Spark posts I built and profiled an LLM fine-tuning pipeline that ran in a Docker container. This time I went rootless: same Grace Blackwell GB10, but no sudo, no writes to /etc,...

Jun 20, 2026 DevOps & Computing

GPU Profiling for LLM Fine-Tuning on DGX Spark: What the Traces Reveal

I fine-tuned a 1.5B parameter model on a DGX Spark and the training finished in 26 seconds. Good enough? I had no idea. The terminal showed 2.18 it/s and a loss curve that went down. But whether th...

Jun 10, 2026 AI & ML

LLM Fine-Tuning on DGX Spark: Building a Reusable Training Pipeline

A colleague’s team recently got access to NVIDIA DGX Spark for LLM work, and I joined to help set up the infrastructure. Before anyone could start experimenting, we needed a reusable pipeline — som...

Jun 10, 2026 DevOps & Computing

One Person, One AI, One Morning: The New Shape of Knowledge Work

I needed to launch a training pilot — an eight-course AI development curriculum — across multiple teams in a large global organization. If you’ve worked in a global organization, you know what come...

Jun 6, 2026 AI & ML

Your AI Agent Has Amnesia

In a previous post, I wrote about an AI tutoring system that generated identical verb conjugation exercises across every session — grammatically perfect, pedagogically useless. The student scored 1...

May 17, 2026 AI & ML

Two Gaussian distributions side by side: without harness, AI points to the peak (most probable); with harness, the arrow redirects to a less probable but more useful bar. Tagline: Plausible ≠ Useful.

What Two Bugs Taught Me About AI Agentic Harnesses

I’d been running my AI-powered exercise generation system for weeks. It produced worksheets, rendered them to PDF, evaluated photographed handwritten answers, tracked scores, and adapted difficulty...

May 10, 2026 AI & ML

The Planning Phase Is Where AI Agents Earn Their Keep

I asked Claude Code to design a Neovim maintenance system — update scripts, upstream sync, cross-machine portability, the works. It explored my setup in parallel with three agents, drafted a compre...

Mar 29, 2026 AI & ML

AI Agent Skills: Turning One-Off Fixes into Reusable Knowledge

My VPN stopped connecting. Not with a helpful error message — the Alacritty terminal window spawned by my opencon command opened for a fraction of a second and closed. No output, no log, nothing to...

Mar 19, 2026 AI & ML