Enhancing Diffusion-Based End-to-End Autonomous Driving with VLM

Overview

I improved a diffusion-based end-to-end (E2E) autonomous driving pipeline by introducing VLM-guided dynamic anchor sampling for trajectory diversity and a VLM-based Trajectory Scorer for robust final trajectory selection. :contentReference[oaicite:2]{index=2}

What I Did

VLM-guided dynamic anchor sampling
- Increased trajectory diversity and overall performance by dynamically sampling anchor trajectories with VLM guidance. :contentReference[oaicite:3]{index=3}
- Switched to a lightweight YOLO-based sampler for real-time operation while maintaining comparable results. :contentReference[oaicite:4]{index=4}
VLM-based Trajectory Scorer (re-ranking)
- Built a trajectory re-ranking module that evaluates candidates using:
  - Centerline alignment
  - Collision-avoidance metrics
- Improved final path selection by scoring and selecting the best candidate trajectory. :contentReference[oaicite:5]{index=5}

My Role

Owner of the core algorithmic improvements:

Designed the sampling + scoring architecture
Implemented real-time-friendly sampling (YOLO-based)
Integrated the re-ranking scorer into the E2E planning stack :contentReference[oaicite:6]{index=6}

Tech Stack

Deep Learning / E2E Driving: Diffusion-based planning pipeline
Vision & Sampling: YOLO-based lightweight sampler
Reasoning & Scoring: VLM-based candidate evaluation module :contentReference[oaicite:7]{index=7}

Media

Below are placeholders. Replace them with your own screenshots (qualitative results, failure cases, ablations, etc.).

Example visuals: sampling diversity, collision-avoidance improvements, and selected trajectory comparisons.

Notes

This page is written to be readable to both research and engineering audiences.
If you want, I can also generate a concise “portfolio card” summary (2–3 lines) for the main projects page.