BytedTsinghua-SIA

university

https://air.tsinghua.edu.cn/en/About_Us/About_AIR.htm

BytedTsinghua-SIA

AI & ML interests

None defined yet.

Recent Activity

jiangjiechen updated a collection 14 days ago

jiangjiechen updated a collection 17 days ago

jiangjiechen updated a dataset 17 days ago

BytedTsinghua-SIA/Enigmata-Eval

View all activity

BytedTsinghua-SIA's activity

jiangjiechen

updated a collection 14 days ago

DAPO

4 items • Updated 14 days ago

jiangjiechen

updated a collection 17 days ago

Enigmata

Resources for the Enigmata Project: https://seed-enigmata.github.io. • 4 items • Updated 17 days ago • 2

jiangjiechen

updated a dataset 17 days ago

BytedTsinghua-SIA/Enigmata-Eval

Viewer • Updated 17 days ago • 4.76k • 686 • 1

jiangjiechen

published a dataset 17 days ago

BytedTsinghua-SIA/Enigmata-Eval

Viewer • Updated 17 days ago • 4.76k • 686 • 1

jiangjiechen

authored 5 papers 17 days ago

TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

Paper • 2402.05733 • Published Feb 8, 2024

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Paper • 2406.04784 • Published Jun 7, 2024 • 2

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10 • 1

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published 18 days ago • 42

jiangjiechen

updated a collection 17 days ago

Enigmata

Resources for the Enigmata Project: https://seed-enigmata.github.io. • 4 items • Updated 17 days ago • 2

jiangjiechen

updated a model 17 days ago

BytedTsinghua-SIA/Enigmata-Qwen2.5-32B

Updated 17 days ago • 349 • 1

jiangjiechen

published a model 18 days ago

BytedTsinghua-SIA/Enigmata-Qwen2.5-32B

Updated 17 days ago • 349 • 1

dyyyson

updated a collection 29 days ago

DAPO

4 items • Updated 14 days ago

dyyyson

updated a model about 1 month ago

BytedTsinghua-SIA/DAPO-Qwen-32B

Text Generation • Updated May 10 • 8.7k • 3

dyyyson

in BytedTsinghua-SIA/DAPO-Qwen-32B about 1 month ago

Improve language tag

#1 opened about 2 months ago by

qiying

updated a dataset about 2 months ago

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18 • 1.79M • 2.94k • 76

dyyyson

published a model 2 months ago

BytedTsinghua-SIA/DAPO-Qwen-32B

Text Generation • Updated May 10 • 8.7k • 3

tongyx361

authored a paper 3 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 128