Kangyang Luo's picture

13

Kangyang Luo

lKangyang

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

upvoted a paper 10 days ago

Utonia: Toward One Encoder for All Point Clouds

upvoted a paper 19 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

View all activity

Organizations

None yet

upvoted a paper 8 days ago

BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning

Paper • 2603.04918 • Published 9 days ago • 54

upvoted a paper 10 days ago

Utonia: Toward One Encoder for All Point Clouds

Paper • 2603.03283 • Published 11 days ago • 165

upvoted a paper 19 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published Feb 4 • 22

upvoted 3 papers about 1 month ago

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Paper • 2602.08794 • Published Feb 9 • 156

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Paper • 2601.21459 • Published Jan 29 • 10

TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents

Paper • 2602.02196 • Published Feb 2 • 35

upvoted a paper 3 months ago

FaithLens: Detecting and Explaining Faithfulness Hallucination

Paper • 2512.20182 • Published Dec 23, 2025 • 9

upvoted a paper 4 months ago

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28, 2025 • 72

upvoted 4 papers 5 months ago

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27, 2025 • 97

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published Oct 22, 2025 • 69

QueST: Incentivizing LLMs to Generate Difficult Problems

Paper • 2510.17715 • Published Oct 20, 2025 • 35

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Paper • 2509.26490 • Published Sep 30, 2025 • 20

upvoted a paper 10 months ago

Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning

Paper • 2505.16483 • Published May 22, 2025 • 10

authored a paper over 1 year ago

Selecting Influential Samples for Long Context Alignment via Homologous Models' Guidance and Contextual Awareness Measurement

Paper • 2410.15633 • Published Oct 21, 2024 • 7