Enhancing Diffusion-Based End-to-End Autonomous Driving with VLM

VLM-guided dynamic anchor sampling + trajectory re-ranking for safer and more diverse planning

Overview

I improved a diffusion-based end-to-end (E2E) autonomous driving pipeline by introducing VLM-guided dynamic anchor sampling for trajectory diversity and a VLM-based Trajectory Scorer for robust final trajectory selection. :contentReference[oaicite:2]{index=2}

What I Did

  • VLM-guided dynamic anchor sampling
    • Increased trajectory diversity and overall performance by dynamically sampling anchor trajectories with VLM guidance. :contentReference[oaicite:3]{index=3}
    • Switched to a lightweight YOLO-based sampler for real-time operation while maintaining comparable results. :contentReference[oaicite:4]{index=4}
  • VLM-based Trajectory Scorer (re-ranking)
    • Built a trajectory re-ranking module that evaluates candidates using:
      • Centerline alignment
      • Collision-avoidance metrics
    • Improved final path selection by scoring and selecting the best candidate trajectory. :contentReference[oaicite:5]{index=5}

My Role

Owner of the core algorithmic improvements:

  • Designed the sampling + scoring architecture
  • Implemented real-time-friendly sampling (YOLO-based)
  • Integrated the re-ranking scorer into the E2E planning stack :contentReference[oaicite:6]{index=6}

Tech Stack

  • Deep Learning / E2E Driving: Diffusion-based planning pipeline
  • Vision & Sampling: YOLO-based lightweight sampler
  • Reasoning & Scoring: VLM-based candidate evaluation module :contentReference[oaicite:7]{index=7}

Media

Below are placeholders. Replace them with your own screenshots (qualitative results, failure cases, ablations, etc.).

Example visuals: sampling diversity, collision-avoidance improvements, and selected trajectory comparisons.

Notes

  • This page is written to be readable to both research and engineering audiences.
  • If you want, I can also generate a concise “portfolio card” summary (2–3 lines) for the main projects page.