Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Visual Video DIT Attention map #649

Open
1 of 2 tasks
Passenger12138 opened this issue Jan 10, 2025 · 0 comments
Open
1 of 2 tasks

Visual Video DIT Attention map #649

Passenger12138 opened this issue Jan 10, 2025 · 0 comments
Assignees
Labels
good first issue Good for newcomers

Comments

@Passenger12138
Copy link

System Info / 系統信息

visualize attention maps for video generation models based on the Diffusers Transformer。

Information / 问题信息

  • The official example scripts / 官方的示例脚本
  • My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

1 ref paper ”https://arxiv.org/abs/2412.18597“
2 code https://github.com/Passenger12138/attention-map-diffusers-vdm.git

Expected behavior / 期待表现

text-2-text attention map
attention-t2t
text-2-video attention map
word "jacket"
image 1
video https://github.com/user-attachments/assets/8ae0f67e-abdb-4aa4-bb65-95b31feae222

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Jan 11, 2025
@zRzRzRzRzRzRzR zRzRzRzRzRzRzR added the good first issue Good for newcomers label Jan 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers
Projects
None yet
Development

No branches or pull requests

2 participants