timmyhhh (tim huang)

upvoted 2 papers 2 months ago

The End of Manual Decoding: Towards Truly End-to-End Language Models

Paper • 2510.26697 • Published Oct 30, 2025 • 116

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 99

upvoted a paper 8 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

upvoted 2 papers about 1 year ago

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

Paper • 2412.17483 • Published Dec 23, 2024 • 34

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11, 2024 • 17

upvoted a paper over 1 year ago

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

Paper • 2402.13577 • Published Feb 21, 2024 • 9

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

437

upvoted a paper over 1 year ago

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Paper • 2406.16377 • Published Jun 24, 2024 • 13

upvoted 3 papers almost 2 years ago

upvoted a collection almost 2 years ago

Qwen1.5

Collection

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 9 days ago • 212

upvoted a paper almost 2 years ago

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25, 2024 • 39

tim huang

AI & ML interests

Organizations

The End of Manual Decoding: Towards Truly End-to-End Language Models

DeepAgent: A General Reasoning Agent with Scalable Toolsets

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

A Silver Bullet or a Compromise for Full Attention? A Comprehensive Study of Gist Token-based Context Compression

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

SmolLM - blazingly fast and remarkably powerful

On the Transformations across Reward Model, Parameter Update, and In-Context Prompt

Knowledge Fusion of Large Language Models

LLM Augmented LLMs: Expanding Capabilities through Composition

Neural Network Diffusion

Qwen1.5

FuseChat: Knowledge Fusion of Chat Models

tim huang

AI & ML interests

Organizations

timmyhhh's activity

SmolLM - blazingly fast and remarkably powerful