Skip to main content
PastWebinarOnline

Webinar "Interpretability of large language models"

Programme

A survey of modern interpretability methods for large language models: mechanistic interpretability, activation analysis and feature attribution.

Speakers

Materials

Subscriber access

The recording, slides and discussion transcript are available in full to subscribers.

Full material available to AI Forum Review subscribers

Subscribe