Tom LUCAS's picture

1 38

Tom LUCAS

C0casio45

·

AI & ML interests

None yet

Recent Activity

commented on an article 9 days ago

Judge Arena: Benchmarking LLMs as Evaluators

upvoted a paper 15 days ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

upvoted a paper about 1 month ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

View all activity

Organizations

upvoted a paper 15 days ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published 17 days ago • 622

upvoted a paper about 1 month ago

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 149

upvoted a paper 2 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 254

upvoted a paper 3 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2 • 130

upvoted a paper 4 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 293

upvoted 2 papers 5 months ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 136

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 291

upvoted 5 papers 6 months ago

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 109

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published Mar 24 • 119

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 169

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 143

upvoted 3 papers 7 months ago

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25 • 49

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 89

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 203

upvoted 4 papers 8 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 124

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 62

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 417

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Paper • 2501.08828 • Published Jan 15 • 31

upvoted a paper 9 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 297