Gemma Scope: helping the safety community shed light on the inner workings of language models

Announcing a comprehensive, open suite of sparse autoencoders for language model interpretability.

Jan 3, 2025 - 02:47
 4954
Gemma Scope: helping the safety community shed light on the inner workings of language models
Announcing a comprehensive, open suite of sparse autoencoders for language model interpretability.