New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Visual Video DIT Attention map #649

Open

1 of 2 tasks

Passenger12138 opened this issue Jan 10, 2025 · 0 comments

Assignees

Labels

good first issue

Passenger12138 commented Jan 10, 2025

System Info / 系統信息

visualize attention maps for video generation models based on the Diffusers Transformer。

Information / 问题信息

The official example scripts / 官方的示例脚本
My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

1 ref paper ”https://arxiv.org/abs/2412.18597“
2 code https://github.com/Passenger12138/attention-map-diffusers-vdm.git

Expected behavior / 期待表现

text-2-text attention map

text-2-video attention map
word "jacket"
image
video https://github.com/user-attachments/assets/8ae0f67e-abdb-4aa4-bb65-95b31feae222

zRzRzRzRzRzRzR self-assigned this

zRzRzRzRzRzRzR added the good first issue label

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment