RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback Paper • 2507.15024 • Published Jul 20 • 14
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2 • 184
Towards Better Dynamic Graph Learning: New Architecture and Unified Library Paper • 2303.13047 • Published Mar 23, 2023
Heterogeneous Graph Representation Learning with Relation Awareness Paper • 2105.11122 • Published May 24, 2021
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models Paper • 2410.13841 • Published Oct 17, 2024 • 17
Extend Model Merging from Fine-Tuned to Pre-Trained Large Language Models via Weight Disentanglement Paper • 2408.03092 • Published Aug 6, 2024 • 1