Chains of Thought

Musings on AI, machine learning, and life

The Crystallization of Transformer Architectures (2017-2025)

A dataset-driven analysis of transformer architecture choices and their convergence over eight years.

31 min read · December 05, 2025

2025 · llm
Building a Fast BPE Tokenizer from Scratch

Incrementally optimizing a BPE tokenizer with complexity analysis and benchmarks.

44 min read · November 20, 2025

2025 · llm
Systematic Pessimism

A new paradigm for scaling quality engineering with AI — automated discovery of edge cases or potential failure modes at every commit.

18 min read · February 10, 2025

2025 · ai coding-assistants human-centered-ai
Beyond Automation — The Case for AI Augmentation

The really transformative interfaces won't be the ones that make us more productive; they'll be the ones that make us more thoughtful, more creative, more aware of our own cognitive patterns. Like mirrors for our minds, showing us our blind spots and suggesting perspectives we habitually miss.

12 min read · January 06, 2025

2025 · ai human-centered-ai llm
Rethinking Generation & Reasoning Evaluation in Dialogue AI Systems

As we rely further on (and reap the benefits of) LLMs’ reasoning abilities in AI systems and products, how can we still grasp a sense of how LLMs “think”? Where steerability is concerned — users or developers may desire to add in custom handling logic and instructions — how can ensure that these models continue to follow and reason from these instructions towards a desirable output?

13 min read · November 08, 2023

2023 · llm reasoning ai-evaluation machine-learning