A Collection of Papers and Codes for ECCV2024 AIGC
整理汇总下今年ECCV AIGC相关的论文和代码,具体如下。
欢迎star,fork和PR~
Please feel free to star, fork or PR if helpful~
- Awesome-CVPR2024-AIGC
- Awesome-AIGC-Research-Groups
- Awesome-ECCV2024-ECCV2020-Low-Level-Vision
- Awesome-CVPR2024-CVPR2021-CVPR2020-Low-Level-Vision
ECCV2024官网:https://eccv.ecva.net/
ECCV接收论文列表:
ECCV完整论文库:https://www.ecva.net/papers.php
开会时间:2024年9月29日-10月4日
论文接收公布时间:2024年
【Contents】
- 1.图像生成(Image Generation/Image Synthesis)
- 2.图像编辑(Image Editing)
- 3.视频生成(Video Generation/Image Synthesis)
- 4.视频编辑(Video Editing)
- 5.3D生成(3D Generation/3D Synthesis)
- 6.3D编辑(3D Editing)
- 7.多模态大语言模型(Multi-Modal Large Language Model)
- 8.其他多任务(Others)
- Paper: https://arxiv.org/abs/2407.14709
- Code:
AID-AppEAL: Automatic Image Dataset and Algorithm for Content Appeal Enhancement and Assessment Labeling
AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild
- Paper: https://arxiv.org/abs/2402.11849
- Code:
Co-synthesis of Histopathology Nuclei Image-Label Pairs using a Context-Conditioned Joint Diffusion Model
- Paper: https://arxiv.org/abs/2407.15111
- Code:
- Paper:
- Code: https://github.com/IVRL/AugSal
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
- Paper:
- Code: https://github.com/murphytju/DiffFAS
- Paper: https://arxiv.org/abs/2405.05967
- Code:
- Paper: https://arxiv.org/abs/2407.11966
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/2529_ECCV_2024_paper.php
- Code: https://github.com/aim-uofa/FreeCompose
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance
- Paper: https://arxiv.org/abs/2406.04551
- Code: https://github.com/facebookresearch/Contextualized-Vendi-Score-Guidance
Iterative Ensemble Training with Anti-Gradient Control for Mitigating Memorization in Diffusion Models
- Paper: https://arxiv.org/abs/2401.12244
- Code: https://github.com/pinterest/atg-research/tree/main/joint-rl-diffusion
Lego: Learning to Disentangle and Invert Personalized Concepts Beyond Object Appearance in Text-to-Image Diffusion Models
- Paper: https://arxiv.org/abs/2407.13752
- Code:
- Paper:
- Code: https://github.com/RossoneriZhao/iced_coke
MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
NeuroPictor: Refining fMRI-to-Image Reconstruction via Multi-individual Pretraining and Multi-level Modulation
- Paper: https://arxiv.org/abs/2404.00995
- Code:
Post-training Quantization for Text-to-Image Diffusion Models with Progressive Calibration and Activation Relaxing
- Paper: https://arxiv.org/abs/2404.00995
- Code:
- Paper: https://arxiv.org/abs/2408.02226
- Code: https://github.com/Agentic-Learning-AI-Lab/procreate-diffusion-public
- Paper: https://arxiv.org/abs/2311.17717
- Code:
- Paper: https://arxiv.org/abs/2403.13589
- Code:
- Paper: https://arxiv.org/abs/2403.17377
- Code: https://github.com/sunovivid/Perturbed-Attention-Guidance
- Paper: https://arxiv.org/abs/2408.14176
- Code:
- Paper:
- Code: https://github.com/Robin-WZQ/T2IShield
The Gaussian Discriminant Variational Autoencoder (GdVAE): A Self-Explainable Model with Counterfactual Explanations
- Paper: https://arxiv.org/abs/2408.12352
- Code:
- Paper: https://arxiv.org/abs/2407.03917
- Code:
- Paper: https://arxiv.org/abs/2407.13609
- Code:
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models
- Paper: https://arxiv.org/abs/2407.12642
- Code:
- Paper: https://arxiv.org/abs/2408.13922
- Code:
- Paper:
- Code: https://github.com/john09282922/CQS
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/1909_ECCV_2024_paper.php
- Code: https://github.com/JS-Lee525/PIC
- Paper: https://arxiv.org/abs/2406.04413
- Code: https://github.com/VIROBO-15/Efficient-3D-Aware-Facial-Image-Editing
Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods
- Paper: https://arxiv.org/abs/2302.14368
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/2157_ECCV_2024_paper.php
- Code: https://github.com/furiosa-ai/eta-inversion
Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/759_ECCV_2024_paper.php
- Code: https://github.com/Thermal-Dynamics/FreeDiff
- Paper: https://arxiv.org/abs/2403.05018
- Code:
- Paper: https://arxiv.org/abs/2409.00674
- Code:
- Paper: https://arxiv.org/abs/2404.04833
- Code:
- Paper: https://arxiv.org/abs/2403.04437
- Code:
- Paper: https://arxiv.org/abs/2403.12658
- Code:
- Paper: https://arxiv.org/abs/2408.08332
- Code:
- Paper: https://arxiv.org/abs/2403.09069
- Code: https://github.com/Boese0601/Dyadic-Interaction-Modeling
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model
- Paper: https://arxiv.org/abs/2311.11325
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/2790_ECCV_2024_paper.php
- Code: https://github.com/hechang25/MVSD
Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
- Paper:
- Code: https://github.com/Vchitect/VEnhancer
- Paper: https://arxiv.org/abs/2409.13037
- Code:
- Paper: https://arxiv.org/abs/2403.12002
- Code:
- Paper: https://arxiv.org/abs/2407.07554
- Code:
Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models
- Paper: https://arxiv.org/abs/2408.14860
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/2100_ECCV_2024_paper.php
- Code: https://github.com/HyoKong/DreamDrone
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/168_ECCV_2024_paper.php
- Code: https://github.com/Frank-ZY-Dou/EMDM
- Paper: https://arxiv.org/abs/2408.00296
- Code:
- Paper: https://arxiv.org/abs/2408.00297
- Code:
- Paper: https://arxiv.org/abs/2312.07231
- Code:
- Paper: https://arxiv.org/abs/2407.11174
- Code:
- Paper: https://arxiv.org/abs/2407.11174
- Code:
JointDreamer: Ensuring Geometry Consistency and Text Congruence in Text-to-3D Generation via Joint Score Distillation
- Paper: https://arxiv.org/abs/2407.12291
- Code:
- Paper:
- Code: https://github.com/ffxzh/KMTalk
- Paper: https://arxiv.org/abs/2407.11532
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/501_ECCV_2024_paper.php
- Code: https://github.com/NIRVANALAN/LN3Diff
- Paper: https://arxiv.org/abs/2407.10528
- Code:
Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM
MVDiffHD: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/2446_ECCV_2024_paper.php
- Code: https://github.com/Tangshitao/MVDiffusion_plusplus
NeuSDFusion: A Spatial-Aware Generative Model for 3D Shape Completion, Reconstruction, and Generation
- Paper: https://arxiv.org/abs/2403.18241
- Code:
- Paper: https://arxiv.org/abs/2311.12085
- Code: https://github.com/yuhengliu02/pyramid-discrete-diffusion
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting
- Paper: https://arxiv.org/abs/2407.20727
- Code:
- Paper: https://arxiv.org/abs/2408.01291
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/698_ECCV_2024_paper.php
- Code: https://github.com/YG256Li/UniDream
- Paper: https://arxiv.org/abs/2407.04461
- Code:
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/1890_ECCV_2024_paper.php
- Code: https://github.com/sfanxiang/videoshop
Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models
- Paper:
- Code: https://github.com/SupstarZh/VividDreamer
- Paper: https://arxiv.org/abs/2407.10102
- Code:
- Paper: https://arxiv.org/abs/2312.13663
- Code: https://github.com/nazmul-karim170/FreeEditor-Text-to-3D-Scene-Editing
- Paper: https://www.ecva.net/papers/eccv_2024/papers_ECCV/html/8662_ECCV_2024_paper.php
- Code: https://github.com/qwang666/RoomTex-
About Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
- Paper:
- Code: https://github.com/ChaduCheng/TypoDeceptions
AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
- Paper:
- Code: https://github.com/yu-rp/apiprompting
- Paper: https://arxiv.org/abs/2408.06662
- Code:
- Paper: https://arxiv.org/abs/2408.05926
- Code:
- Paper: https://arxiv.org/abs/2311.16445
- Code:
- Paper: https://arxiv.org/abs/2407.07433
- Code:
- Paper:
- Code: https://github.com/zhaohengyuan1/Genixer
EventBind: Learning a Unified Representation to Bind Them All for Event-based Open-world Understanding
- Paper: https://arxiv.org/abs/2408.02788
- Code:
Introducing Routing Functions to Vision-Language Parameter-Efficient Fine-Tuning with Low-Rank Bottlenecks
- Paper: https://arxiv.org/abs/2408.05019
- Code:
- Paper:
- Code: https://github.com/YBZh/LAPT
- Paper: https://arxiv.org/abs/2408.14805
- Code: https://github.com/AlibabaResearch/AdvancedLiterateMachinery/tree/main/OCR/Platypus
- Paper:
- Code: https://github.com/agneet42/revision
- Paper: https://arxiv.org/abs/2409.10542
- Code: https://github.com/AI-Application-and-Integration-Lab/SAM4MLLM
- Paper:
- Code: https://github.com/wuyongjianCODE/SDPT
- Paper: https://arxiv.org/abs/2403.11299
- Code:
Unveiling Typographic Deceptions: Insights of the Typographic Vulnerability in Large Vision-Language Model
- Paper: https://arxiv.org/abs/2407.13851
- Code:
持续更新~