v1.3.0
What's Changed
- fix outdated link in Exploratory Analysis Demo by @daspartho in #259
- Finish patching docs by @ckkissane in #261
- Fix
from_pretrained
withredwood_attn_2l
by @ArthurConmy in #268 - Added list of demos to tutorial section. by @JayBaileyCS in #263
- Improving head detector by @MatthewBaggins in #255
- Optimize imports in HookedTransformer by @rusheb in #260
- Baidicoot main - Implemented functionality for loading mingpt-style models off HF (e.g. othello-gpt) by @jbloomAus in #272
- Upgrade to typeguard 3 by @dkamm in #269
- Install autoformatting tools and add formatting checks to CI by @rusheb in #270
- Add TransformerLens logo to docs and GitHub by @koayon in #273
- Wrap docstrings and comments in HookedTransformer by @luciaquirke in #274
- Format array in test_transformer_lens.py by @rusheb in #275
- Introducing HookedEncoder by @rusheb in #276
- Add tests for tokenization methods by @Aprillion in #280
- Fix broken link in issue template by @rusheb in #278
- Various memory solutions. Ultimately used gc to "hide" memory issue which should be solved soon. by @jbloomAus in #296
- FactoredMatrix getitem (#224) by @glerzing in #295
- Add tiny stories by @Felhof in #292
- from_pretrained custom parameters (#288) by @glerzing in #298
- Add better
__name__
annotation tofull_hook
s by @ArthurConmy in #302 - Multiple minor corrections by @glerzing in #301
- Add get_basic_config util function by @adamyedidia in #294
- Fix bug: HookedEncoder not being moved to GPU by @rusheb in #307
- Fix tokenization tests on GPU by @rusheb in #308
- Add prepend option to
model.add_hook
by @ArthurConmy in #303 - Fix tiny stories model names by @Felhof in #305
- Add
hook_mlp_in
by @ArthurConmy in #313 - Ignore some functions in the documentation (#310) by @glerzing in #312
- Add assertion to refactor_factored_attn_matrices by @ArthurConmy in #320
- Update evals.py to not directly call cuda, instead have default cuda … by @dennis-akar in #324
- Add SVD interpretability feature to TransformerLens by @JayBaileyCS in #311
- Fix svd tests on GPU by @slavachalnev in #330
- Reduce memory use when loading model by @slavachalnev in #327
New Contributors
- @MatthewBaggins made their first contribution in #255
- @koayon made their first contribution in #273
- @luciaquirke made their first contribution in #274
- @Aprillion made their first contribution in #280
- @glerzing made their first contribution in #295
- @Felhof made their first contribution in #292
- @dennis-akar made their first contribution in #324
Full Changelog: v1.2.2...v1.3.0