• Starter AI
  • Posts
  • Deep Dive: Building GPT from scratch - part 7

Deep Dive: Building GPT from scratch - part 7

learning from Andrej Karpathy

Sponsored by

Hello and welcome back to the series on Starter AI. It’s Miko again, from vaguely spring-like London.

Today, we’re back from the side quests, and we’re finally finishing up makemore. We’ll complicate the architecture and then  have an epiphany that the resulting code looks very much like WaveNet from the 2016 DeepMind paper. The lecture is shorter than usual, at just under an hour. Let’s do it!

Subscribe to keep reading

This content is free, but you must be subscribed to Starter AI to continue reading.

I consent to receive newsletters via email. Terms of Use and Privacy Policy.

Already a subscriber?Sign In.Not now