10 21 3

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

liked a dataset 27 days ago

facebook/principia-collection

authored a paper 30 days ago

Scaling Agent Learning via Experience Synthesis

upvoted a paper 30 days ago

Scaling Agent Learning via Experience Synthesis

View all activity

Organizations

authored a paper 30 days ago

Scaling Agent Learning via Experience Synthesis

Paper • 2511.03773 • Published Nov 5 • 80

authored a paper about 1 month ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published Oct 28 • 15

authored 2 papers about 2 months ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published Oct 9 • 35

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 266

authored 3 papers 2 months ago

authored a paper 3 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 84

authored a paper 5 months ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50

authored a paper 8 months ago

TextArena

Paper • 2504.11442 • Published Apr 15 • 29

authored a paper 10 months ago

EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine

Paper • 2206.10558 • Published Jun 21, 2022 • 1

authored a paper about 1 year ago

Natural Language Reinforcement Learning

Paper • 2411.14251 • Published Nov 21, 2024 • 31

authored 4 papers over 1 year ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 60

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 24

DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data

Paper • 2405.14333 • Published May 23, 2024 • 43

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Paper • 2403.05525 • Published Mar 8, 2024 • 48

authored a paper almost 2 years ago

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Paper • 2401.02954 • Published Jan 5, 2024 • 50

Bo Liu

AI & ML interests

Recent Activity

Organizations

Benjamin-eecs's activity