OpenLemur

non-profit

https://xlang.ai

XLangNLP

OpenLemur

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

tianbaoxiexxx authored a paper 7 days ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

tianbaoxiexxx authored a paper 7 days ago

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

tianbaoxiexxx authored a paper 7 days ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

View all activity

tianbaoxiexxx

authored 5 papers 7 days ago

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant

Paper • 2410.18603 • Published Oct 24, 2024 • 33

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Paper • 2505.19897 • Published May 26 • 103

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published Jun 16 • 9

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published 26 days ago • 29

OpenCUA: Open Foundations for Computer-Use Agents

Paper • 2508.09123 • Published 8 days ago • 25

SivilTaram

authored 2 papers about 1 month ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

SivilTaram

authored a paper about 2 months ago

ZeCO: Zero Communication Overhead Sequence Parallelism for Linear Attention

Paper • 2507.01004 • Published Jul 1 • 10

koalazf99

authored a paper about 2 months ago

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46

koalazf99

authored a paper 2 months ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

SivilTaram

authored a paper 3 months ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published May 20 • 23

ranpox

authored a paper 3 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 46

tianbaoxiexxx

authored a paper 3 months ago

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Paper • 2505.13227 • Published May 19 • 46

huybery

authored 4 papers 3 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 276

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 276

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15 • 83

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

OliverZhao

authored a paper 4 months ago

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Paper • 2504.18904 • Published Apr 26 • 9

koalazf99

authored a paper 4 months ago

MegaMath: Pushing the Limits of Open Math Corpora

Paper • 2504.02807 • Published Apr 3 • 34

SivilTaram

authored a paper 5 months ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024 • 2