yxm's picture

2

yxm

jokester-yxm

·

AI & ML interests

AI agents

Recent Activity

upvoted a paper 2 days ago

InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models

authored a paper 6 days ago

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

authored a paper 6 days ago

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

View all activity

Organizations

authored 9 papers 6 days ago

On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving

Paper • 2311.05332 • Published Nov 9, 2023 • 13

DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Paper • 2306.06023 • Published Jun 9, 2023

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Paper • 2402.03830 • Published Feb 6, 2024 • 2

DreamForge: Motion-Aware Autoregressive Video Generation for Multi-View Driving Scenes

Paper • 2409.04003 • Published Sep 6, 2024 • 1

O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Paper • 2505.16582 • Published May 22

KG-TRACES: Enhancing Large Language Models with Knowledge Graph-constrained Trajectory Reasoning and Attribution Supervision

Paper • 2506.00783 • Published Jun 1 • 1

IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?

Paper • 2509.24709 • Published 17 days ago • 4

Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks

Paper • 2510.08002 • Published 7 days ago • 22

RE-Searcher: Robust Agentic Search with Goal-oriented Planning and Self-reflection

Paper • 2509.26048 • Published 16 days ago • 7