Release v1.3.0 · TransformerLensOrg/TransformerLens

What's Changed

fix outdated link in Exploratory Analysis Demo by @daspartho in #259
Finish patching docs by @ckkissane in #261
Fix from_pretrained with redwood_attn_2l by @ArthurConmy in #268
Added list of demos to tutorial section. by @JayBaileyCS in #263
Improving head detector by @MatthewBaggins in #255
Optimize imports in HookedTransformer by @rusheb in #260
Baidicoot main - Implemented functionality for loading mingpt-style models off HF (e.g. othello-gpt) by @jbloomAus in #272
Upgrade to typeguard 3 by @dkamm in #269
Install autoformatting tools and add formatting checks to CI by @rusheb in #270
Add TransformerLens logo to docs and GitHub by @koayon in #273
Wrap docstrings and comments in HookedTransformer by @luciaquirke in #274
Format array in test_transformer_lens.py by @rusheb in #275
Introducing HookedEncoder by @rusheb in #276
Add tests for tokenization methods by @Aprillion in #280
Fix broken link in issue template by @rusheb in #278
Various memory solutions. Ultimately used gc to "hide" memory issue which should be solved soon. by @jbloomAus in #296
FactoredMatrix getitem (#224) by @glerzing in #295
Add tiny stories by @Felhof in #292
from_pretrained custom parameters (#288) by @glerzing in #298
Add better __name__ annotation to full_hooks by @ArthurConmy in #302
Multiple minor corrections by @glerzing in #301
Add get_basic_config util function by @adamyedidia in #294
Fix bug: HookedEncoder not being moved to GPU by @rusheb in #307
Fix tokenization tests on GPU by @rusheb in #308
Add prepend option to model.add_hook by @ArthurConmy in #303
Fix tiny stories model names by @Felhof in #305
Add hook_mlp_in by @ArthurConmy in #313
Ignore some functions in the documentation (#310) by @glerzing in #312
Add assertion to refactor_factored_attn_matrices by @ArthurConmy in #320
Update evals.py to not directly call cuda, instead have default cuda … by @dennis-akar in #324
Add SVD interpretability feature to TransformerLens by @JayBaileyCS in #311
Fix svd tests on GPU by @slavachalnev in #330
Reduce memory use when loading model by @slavachalnev in #327

New Contributors

@MatthewBaggins made their first contribution in #255
@koayon made their first contribution in #273
@luciaquirke made their first contribution in #274
@Aprillion made their first contribution in #280
@glerzing made their first contribution in #295
@Felhof made their first contribution in #292
@dennis-akar made their first contribution in #324

Full Changelog: v1.2.2...v1.3.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.3.0

What's Changed

New Contributors

Contributors