Blog

Visit and subscribe to the blog from here: Semantics & Systems

AI-Driven Development with OpenSpec: A Step-by-Step Walkthrough by Khaled Ahmed, PhD

Building a budget tracker from proposal to archive, one artifact at a time

Read on Substack

Spec Driven Development: Fixing the AI Coding Pipeline with OpenSpec and Claude Code by Khaled Ahmed, PhD

Stop letting AI guess your architecture. Start orchestrating it with Spec-Driven Development

Read on Substack

Atomic claims as an evaluation primitive by Khaled Ahmed, PhD

Turning free text into checkable units for LLM evaluation

Read on Substack

Why Holistic LLM Judging Fails by Khaled Ahmed, PhD

Single-pass “LLM-as-a-judge” tends to sample the claim space, overloads attention in long contexts, and can produce plausible false critique.

Read on Substack

Where To Trust LLMs in the Program Analysis Pipeline by Khaled Ahmed, PhD

Reflections from my thesis defense on keeping correctness with analysis and using models for interpretation.

Read on Substack

Older posts: