Junaid Butt
  • About Me
  • Blog
  • Study Notes
  • Cheatsheets
Categories
All (4)

Blog

Welcome to my blog! Here you’ll find my latest thoughts, tutorials, and insights on software development, machine learning, and technology.

Latest Posts

This section automatically displays my latest blog posts. Each post covers technical topics, project showcases, and learning experiences.

Getting Started with Knowledge Distillation

This post is a brief introduction to the major types of knowledge distillation used when training Language Models. It should provide a conceptual overview of how this is…
Jan 24, 2026
Junaid Butt

Introduction to Key-Value Cache

This post presumes familiarity with the transformer architecture and how tokens are computed and selected. During inference, we autoregressively generate new tokens given…
Nov 25, 2025
Junaid Butt

Introduction to Speculative Decoding

The standard model of generating test from a language model (LM) is autoregressive sampling which involves generation \(K\) tokens using \(K\) sequential runs of the LM.…
Sep 22, 2025
Junaid Butt
 

Tokenising Text Data for Pretraining

After reading Chapter 3 of Building Large Language Models from Scratch, I was curious about how text data turns into tokens and gets batched for pretraining. This post…
Sep 20, 2025
Junaid Butt
No matching items