SiweiWu's picture

SiweiWu

SiweiWu

·

https://wusiwei0410.github.io/

AI & ML interests

NLP, MultiModal model, AIGC

Recent Activity

upvoted a paper 11 days ago

On Data Engineering for Scaling LLM Terminal Capabilities

authored a paper 12 days ago

DocMMIR: A Framework for Document Multi-modal Information Retrieval

authored a paper 12 days ago

Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

View all activity

Organizations

upvoted a paper 11 days ago

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published 12 days ago • 91

upvoted a paper 13 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted a paper 21 days ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published 29 days ago • 22

upvoted a collection 25 days ago

TerminalTraj

Including TerminalTraj's data, models, and paper • 4 items • Updated 25 days ago • 4

upvoted a paper 25 days ago

Large-Scale Terminal Agentic Trajectory Generation from Dockerized Environments

Paper • 2602.01244 • Published Feb 1 • 16

upvoted a paper 2 months ago

Scaling Laws for Code: Every Programming Language Matters

Paper • 2512.13472 • Published Dec 15, 2025 • 15

upvoted a collection 2 months ago

IQuest-Coder

14 items • Updated 5 days ago • 106

upvoted 2 papers 3 months ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published Dec 14, 2025 • 48

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published Dec 4, 2025 • 80

upvoted a paper 6 months ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published Aug 24, 2025 • 80

upvoted 2 collections 7 months ago

LLMs

468 items • Updated Feb 2 • 43

VisionLM

1884 items • Updated Jan 12 • 144

upvoted a paper 7 months ago

LIME: Less Is More for MLLM Evaluation

Paper • 2409.06851 • Published Sep 10, 2024 • 2

upvoted 2 papers 8 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 131

upvoted 3 papers 9 months ago

OAgents: An Empirical Study of Building Effective Agents

Paper • 2506.15741 • Published Jun 17, 2025 • 35

CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following

Paper • 2506.12285 • Published Jun 14, 2025 • 54

Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15, 2025 • 63

upvoted 2 papers 10 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5, 2025 • 33

Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts

Paper • 2504.21117 • Published Apr 29, 2025 • 26