24 126 37

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper about 8 hours ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

upvoted a paper 2 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

commented on a paper 2 days ago

Agentic Reinforced Policy Optimization

View all activity

Organizations

upvoted a paper about 8 hours ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published about 24 hours ago • 63

upvoted a paper 2 days ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published 3 days ago • 90

upvoted a paper 3 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 9 days ago • 257

upvoted a paper 4 days ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 6 days ago • 113

upvoted a collection 8 days ago

ARPO

Collection

The official datasets and model checkpoints of ARPO • 9 items • Updated 3 days ago • 3

upvoted a paper 21 days ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published 22 days ago • 151

upvoted 2 papers 23 days ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published 30 days ago • 126

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published 28 days ago • 142

upvoted a paper 28 days ago

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published 29 days ago • 101

upvoted a paper 29 days ago

Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search

Paper • 2507.02652 • Published 29 days ago • 23

upvoted a paper about 1 month ago

MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings

Paper • 2506.23115 • Published Jun 29 • 36

upvoted 3 collections about 1 month ago

upvoted 4 papers about 2 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 66

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 260

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Paper • 2506.11763 • Published Jun 13 • 69

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 98

upvoted a paper 2 months ago

Reward Reasoning Model

Paper • 2505.14674 • Published May 20 • 36

upvoted a changelog 2 months ago

Changelog

Filter by MCP compatibility available in HF Spaces

May 21

• 77

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity

Filter by MCP compatibility available in HF Spaces