Knowledge Works Lab at Fudan University

university

Verified

http://kw.fudan.edu.cn

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

xjhuang authored a paper 10 days ago

Pre-Trained Policy Discriminators are General Reward Models

LibraTree authored a paper about 1 month ago

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

LibraTree authored a paper about 1 month ago

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

View all activity

xjhuang

authored a paper 10 days ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published 11 days ago • 36

LibraTree

authored 2 papers about 1 month ago

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Paper • 2506.09040 • Published Jun 10 • 36

GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization

Paper • 2506.07160 • Published Jun 8 • 3

siyuyuan

authored a paper about 2 months ago

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published May 26 • 44

jiangjiechen

authored 4 papers about 2 months ago

TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

Paper • 2402.05733 • Published Feb 8, 2024

SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

Paper • 2406.04784 • Published Jun 7, 2024 • 2

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 134

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Paper • 2504.13914 • Published Apr 10 • 3

hsaest

authored a paper about 2 months ago

ARM: Adaptive Reasoning Model

Paper • 2505.20258 • Published May 26 • 44

jiangjiechen

authored a paper about 2 months ago

Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles

Paper • 2505.19914 • Published May 26 • 44

xjhuang

authored 2 papers 2 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15 • 34

A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models

Paper • 2505.07591 • Published May 12 • 11

EZ-hwh

authored a paper 2 months ago

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Paper • 2505.02735 • Published May 5 • 32

EZ-hwh

authored a paper 3 months ago

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Paper • 2504.15415 • Published Apr 21 • 22

LibraTree

authored 3 papers 3 months ago

LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition

Paper • 2402.14568 • Published Feb 22, 2024 • 1

Uncertainty Aware Learning for Language Model Alignment

Paper • 2406.04854 • Published Jun 7, 2024

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Paper • 2504.09130 • Published Apr 12 • 12

dongdong2021

authored a paper 3 months ago

TransMamba: Flexibly Switching between Transformer and Mamba

Paper • 2503.24067 • Published Mar 31 • 21

EZ-hwh

authored a paper 3 months ago

COIG-P: A High-Quality and Large-Scale Chinese Preference Dataset for Alignment with Human Values

Paper • 2504.05535 • Published Apr 7 • 44

siyuyuan

authored a paper 4 months ago

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published Mar 10 • 23