Starter AI
Posts
Deep Dive: Building GPT from scratch - part 7

Deep Dive: Building GPT from scratch - part 7

learning from Andrej Karpathy

Miko
March 22, 2024

Sponsored by

Hello and welcome back to the series on Starter AI. It’s Miko again, from vaguely spring-like London.

Today, we’re back from the side quests, and we’re finally finishing up makemore. We’ll complicate the architecture and then have an epiphany that the resulting code looks very much like WaveNet from the 2016 DeepMind paper. The lecture is shorter than usual, at just under an hour. Let’s do it!

Subscribe to keep reading

This content is free, but you must be subscribed to Starter AI to continue reading.

Already a subscriber?Sign in.Not now