OpenMOSS
OpenMOSS shares a collection of our research in large language models. The team is affiliated to the FudanNLP lab.
Popular repositories Loading
-
Language-Model-SAEs
Language-Model-SAEs PublicFor OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
Repositories
Showing 9 of 9 repositories
- Language-Model-SAEs Public
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
OpenMOSS/Language-Model-SAEs’s past year of commit activity - TransformerLens Public Forked from TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
OpenMOSS/TransformerLens’s past year of commit activity