15 12 5

yueliu1999

https://yueliu1999.github.io/

yueliu1999

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

upvoted a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

commented on a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

View all activity

Organizations

None yet

yueliu1999's activity

authored a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published 5 days ago • 43

upvoted a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published 5 days ago • 43

commented a paper 5 days ago

FlowReasoner: Reinforcing Query-Level Meta-Agents

Paper • 2504.15257 • Published 5 days ago • 43 •

upvoted a paper 9 days ago

NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation

Paper • 2504.13055 • Published 10 days ago • 18

New activity in yueliu1999/GuardReasoner-1B 12 days ago

Output is broken

#4 opened 13 days ago by

AmenRa

New activity in yueliu1999/GuardReasoner-3B 12 days ago

Output is broken

#4 opened 13 days ago by

AmenRa

New activity in yueliu1999/GuardReasoner-8B 12 days ago

Output is broken

#3 opened 13 days ago by

AmenRa

upvoted a paper 19 days ago

Thinking Longer, Not Larger: Enhancing Software Engineering Agents via Scaling Test-Time Compute

Paper • 2503.23803 • Published 27 days ago • 8

upvoted a paper 25 days ago

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published 27 days ago • 60

authored a paper 26 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 29 days ago • 46

upvoted a paper 26 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 29 days ago • 46

commented a paper 26 days ago

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published 29 days ago • 46 •

upvoted a paper about 1 month ago

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Paper • 2503.19622 • Published Mar 25 • 30

upvoted a collection 3 months ago

GuardReasoner

Collection

As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardRea • 5 items • Updated Feb 8 • 1

updated a collection 3 months ago

GuardReasoner

Collection

As LLMs increasingly impact safety-critical applications, ensuring their safety using guardrails remains a key challenge. This paper proposes GuardRea • 5 items • Updated Feb 8 • 1

authored a paper 3 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 87