an open lab for language models

Learn how LLMs actually work, by playing with them.

Six guided modules with browser-native simulators. Tokenize text, build vocabularies, train a tiny model, retrieve documents and wire agents together. No GPU. No setup.

Start with Tokens Train a tiny model

modules

100%

in-browser

API keys

∞

experiments

/ curriculum

The full stack of an LLM

Follow them in order or jump to what you're curious about.

Tokens

See how text becomes numbers with a live BPE-style tokenizer.

open module

Vocabulary

Build a vocab from a corpus and watch coverage grow.

open module

Pre-training

Train a tiny next-token predictor in your browser.

open module

Fine-tuning

Watch loss curves move as you teach a model new behavior.

open module

Retrieval (RAG)

Index documents, embed queries, and rank chunks live.

open module

Orchestration

Compose agents, tools, and routers into a flow.

open module

/ key ideas

A model in one paragraph

A language model takes text, breaks it into tokens from a fixed vocabulary, and predicts the next token. It learns this skill during pre-training on huge corpora, then is steered with fine-tuning. To answer about things it never saw, we attach retrieval. To make it act, we wrap it in orchestration, with tools, routers, and memory.