v1.2.2
What's Changed
Too many commit messages so let's summarise them.
General Features
- Pipeline Parallelism
- Cache now doesn't move tensors across devices unless told to
New Models:
- Redwood 2L
- New Pythia Models
- LLaMA
Analysis Features:
- Add apply_ln to stack_head_results and stack_neuron_results
- Context Manager for Hooks
- Attention Head Detectors
Thanks to all the Contributors!
Many thanks to: @rusheb, @ckkissane, @slavachalnev, @JayBaileyCS, @zshn-gvg, @jbloomAus, @adzcai, @adamyedidia, @ArthurConmy, @bryce13950, @daspartho, @haileyschoelkopf, @0amp
Full Changelog: v1.2.1...v1.2.2