PastWebinarOnline
Webinar "Interpretability of large language models"
Programme
A survey of modern interpretability methods for large language models: mechanistic interpretability, activation analysis and feature attribution.
Speakers
Materials
Subscriber access
The recording, slides and discussion transcript are available in full to subscribers.