On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving Paper • 2311.05332 • Published Nov 9, 2023 • 13
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds Paper • 2306.06023 • Published Jun 9, 2023
OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving Paper • 2402.03830 • Published Feb 6, 2024 • 2
DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes Paper • 2409.04003 • Published Sep 6, 2024 • 1
O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering Paper • 2505.16582 • Published May 22
KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision Paper • 2506.00783 • Published Jun 1 • 1
IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video? Paper • 2509.24709 • Published 17 days ago • 4
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks Paper • 2510.08002 • Published 7 days ago • 22
RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection Paper • 2509.26048 • Published 16 days ago • 7