Skip to content

v1.2.2

Compare
Choose a tag to compare
@jbloomAus jbloomAus released this 24 Apr 07:31
· 326 commits to main since this release
25a9c07

What's Changed

Too many commit messages so let's summarise them.

General Features

  • Pipeline Parallelism
  • Cache now doesn't move tensors across devices unless told to

New Models:

  • Redwood 2L
  • New Pythia Models
  • LLaMA

Analysis Features:

  • Add apply_ln to stack_head_results and stack_neuron_results
  • Context Manager for Hooks
  • Attention Head Detectors

Thanks to all the Contributors!

Many thanks to: @rusheb, @ckkissane, @slavachalnev, @JayBaileyCS, @zshn-gvg, @jbloomAus, @adzcai, @adamyedidia, @ArthurConmy, @bryce13950, @daspartho, @haileyschoelkopf, @0amp

Full Changelog: v1.2.1...v1.2.2