You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the paper there is a nice visualization of prediction depth. Prediction depth is defined in the paper is the first layer where the most likely token is equal to the token output.
These should be included as part of the PredictionTrajectory class so that we can easily produce them in the future. Note that the code for this should be modular like TrajectoryStatistic since we may want to reuse these visualizations for attention in the future.
The text was updated successfully, but these errors were encountered:
In the paper there is a nice visualization of prediction depth. Prediction depth is defined in the paper is the first layer where the most likely token is equal to the token output.
These should be included as part of the
PredictionTrajectory
class so that we can easily produce them in the future. Note that the code for this should be modular likeTrajectoryStatistic
since we may want to reuse these visualizations for attention in the future.The text was updated successfully, but these errors were encountered: