š Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized ⢠107 items ⢠Updated 16 days ago ⢠99
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper ⢠2504.03624 ⢠Published 25 days ago ⢠13
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper ⢠2503.16219 ⢠Published Mar 20 ⢠48
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning Paper ⢠2503.07572 ⢠Published Mar 10 ⢠44
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Paper ⢠2502.13922 ⢠Published Feb 19 ⢠28
Towards Physical Understanding in Video Generation: A 3D Point Regularization Approach Paper ⢠2502.03639 ⢠Published Feb 5 ⢠9
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models Paper ⢠2501.03124 ⢠Published Jan 6 ⢠14
Multimodal Latent Language Modeling with Next-Token Diffusion Paper ⢠2412.08635 ⢠Published Dec 11, 2024 ⢠45
Large Multi-modal Models Can Interpret Features in Large Multi-modal Models Paper ⢠2411.14982 ⢠Published Nov 22, 2024 ⢠17
Multimodal-SAE Collection The collection of the sae that hooked on llava ⢠5 items ⢠Updated Mar 4 ⢠8
GUI agents Collection A collection of papers on GUI agents ⢠3 items ⢠Updated Dec 14, 2024 ⢠5
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper ⢠2412.03555 ⢠Published Dec 4, 2024 ⢠135
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions Paper ⢠2411.14405 ⢠Published Nov 21, 2024 ⢠62