v1.2.2

jbloomAus released this 24 Apr 07:31

· 326 commits to main since this release

25a9c07

What's Changed

Too many commit messages so let's summarise them.

General Features

Pipeline Parallelism
Cache now doesn't move tensors across devices unless told to

New Models:

Redwood 2L
New Pythia Models
LLaMA

Analysis Features:

Add apply_ln to stack_head_results and stack_neuron_results
Context Manager for Hooks
Attention Head Detectors

Thanks to all the Contributors!

Many thanks to: @rusheb, @ckkissane, @slavachalnev, @JayBaileyCS, @zshn-gvg, @jbloomAus, @adzcai, @adamyedidia, @ArthurConmy, @bryce13950, @daspartho, @haileyschoelkopf, @0amp

Full Changelog: v1.2.1...v1.2.2

Contributors

adamyedidia, bryce13950, and 11 other contributors

Assets 2