None's picture

3 3

None

tr3n1ttty

·

surkovvv

AI & ML interests

None yet

Recent Activity

liked a Space about 2 months ago

nanotron/ultrascale-playbook

liked a model 2 months ago

Qwen/Qwen3-235B-A22B

upvoted a paper 2 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

View all activity

Organizations

None yet

liked a Space about 2 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a model 2 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated Jul 26 • 169k • • 1.04k

upvoted a paper 2 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 77

upvoted a collection 2 months ago

T-pro-2.0

Hybrid reasoning model based on Qwen3 32B • 12 items • Updated Jul 18 • 30

liked a model 2 months ago

t-tech/T-pro-it-2.0-eagle

Updated Jul 18 • 539 • 45

updated a Space over 1 year ago

Chatbot Demo

Generate responses to text messages in a chat interface

updated a Space over 2 years ago

Ysda Transformers Practice