Skip to content

autodriving-heart/ECCV-2024-Papers-Autonomous-Driving

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 

Repository files navigation

ECCV-2024-Papers-Autonomous-Driving

We will promptly include more related works in this repository. Please stay tuned!!!

We also kindly invite you to our platform, Auto Driving Heart, for paper interpretation and sharing. If you would like to promote your work, please feel free to contact me.

1) End to End | 端到端自动驾驶

GenAD: Generative End-to-End Autonomous Driving

CarFormer: Self-Driving with Learned Object-Centric Representations

2)LLM Agent | 大语言模型智能体

DriveLM: Driving with Graph Visual Question Answering

ELM: Embodied Understanding of Driving Scenarios

Controllable Navigation Instruction Generation with Chain of Thought Prompting

Asynchronous Large Language Model Enhanced Planner for Autonomous Driving

  • paper: coming soon
  • code: coming soon

TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Dolphins: Multimodal Language Model for Driving

3)SSC: Semantic Scene Completion | 语义场景补全

Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Pyramid Diffusion for Fine 3D Large Scene Generation

4)OCC: Occupancy Prediction | 占用感知

Fully Sparse 3D Occupancy Prediction

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

Occupancy as Set of Points

Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers

nuCraft: Crafting High Resolution 3D Semantic Occupancy for Unified 3D Scene Understanding

5) World Model | 世界模型

OccWorld: 3D World Model for Autonomous Driving

Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model

  • paper: coming soon
  • code: coming soon

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

6)HD-Mapping

MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping

ADMap: Anti-disturbance framework for reconstructing online vectorized HD map

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention

Leveraging Enhanced Queries of Point Sets for Vectorized Map Construction

7)Foundation Model

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

  • paper: coming soon
  • code: coming soon

8)Robust Perception

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations

9)3D Object Detection | 三维目标检测

Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance

GraphBEV: Towards Robust BEV Feature Alignment for Multi-Modal 3D Object Detection

RecurrentBEV: A Long-term Temporal Fusion Framework for Multi-view 3D Detection

Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object Detection

MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection

DualBEV: CNN is All You Need in View Transformation

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

  • paper: coming soon
  • code: coming soon

SEED: A Simple and Effective 3D DETR in Point Clouds

  • paper: coming soon
  • code: coming soon

Towards Stable 3D Object Detection

FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection

HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras

RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

SimPB: A Single Model for 2D and 3D Object Detection from Multiple Cameras

Interactive 3D Object Detection with Prompts

CSOT: Cross-Scan Object Transfer for Semi-Supervised LiDAR Object Detection

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

10)Domain Adaptation & Test-Time Adaptation

Enhancing Source-Free Domain Adaptive Object Detection with Low-Confidence Pseudo-Label Distillation

Fully Test-Time Adaptation for Monocular 3D Object Detection

Progressive Classifier and Feature Extractor Adaptation for Unsupervised Domain Adaptation on Point Clouds

CMD: A Cross Mechanism Domain Adaptation Dataset for 3D Object Detection

11)Cooperative Perception | 协同感知

Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception

Align before Collaborate: Mitigating Feature Misalignment for Robust Multi-Agent Perception

12)World Model

Neural Volumetric World Models for Autonomous Driving

13)Scene Flow Estimation | 场景流估计

4D Contrastive Superflows are Dense 3D Representation Learners

SeFlow: A Self-Supervised Scene Flow Method in Autonomous Driving

I Can't Believe It's Not Scene Flow!

14)Point Cloud Semantic Segmentation| 点云

T-CorresNet: Template Guided 3D Point Cloud Completion with Correspondence Pooling Query Generation Strategy

Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation

SFPNet: Sparse Focal Point Network for Semantic Segmentation on General LiDAR Point Clouds

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation

ItTakesTwo: Leveraging Peer Representations for Semi-supervised LiDAR Semantic Segmentation

Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather

MUSES: The Multi-Sensor Semantic Perception Dataset for Driving under Uncertainty

Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning

15) Generative Model

RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios

SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

16) Optical Flow

SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow

17)Radar | 毫米波雷达

Sparse Beats Dense: Rethinking Supervision in Radar-Camera Depth Completion

18)Nerf Gaussian Splatting

Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting

MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images

GScream: Learning 3D Geometry and Feature Consistent Gaussian Splatting for Object Removal

BeNeRF: Neural Radiance Fields from a Single Blurry Image and Event Stream

PreSight: Enhancing Autonomous Vehicle Perception with City-Scale NeRF Priors

GaussianImage: 1000 FPS Image Representation and Compression by 2D Gaussian Splatting

SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization

Disentangled Generation and Aggregation for Robust Radiance Fields

RPBG: Towards Robust Neural Point-based Graphics in the Wild

19)Object Tracking | 目标跟踪

Beyond MOT: Semantic Multi-Object Tracking

3D Single-object Tracking in Point Clouds with High Temporal Variation

OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Object Appearance Graphs

Boosting 3D Single Object Tracking with 2D Matching Distillation and 3D Pre-training

20)Lane Detection | 车道线检测

OMR: Occlusion-Aware Memory-Based Refinement for Video Lane Detection

RoadPainter: Points Are Ideal Navigators for Topology transformER

21) Motion Prediction | 运动预测

22) Trajectory Prediction | 轨迹预测

VisionTrap: Vision-Augmented Trajectory Prediction Guided by Textual Descriptions

Risk-Aware Self-Consistent Imitation Learning for Trajectory Planning in Autonomous Driving

23) Depth Estimation | 深度估计

Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions

Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation

M2Depth: Self-supervised Two-Frame Multi-camera Metric Depth Estimation

24) Event Camera | 事件相机

25) Odometry

DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure Alignment

26)Normalized Object Coordinates

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Postscript

This list of papers is primarily curated by Rujia Wang.

If you have any questions about the paper list, please do not hesitate to email me and [Auto Driving Heart Team] or open an issue on GitHub.

About

ECCV 2024 Paper List about Autonomous Driving

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published