Tim Wu

changtimwu

AI & ML interests

DL,IoT,Devop

Recent Activity

new activity about 2 months ago

PowerInfer/SmallThinker-21BA3B-Instruct:Are there any other frameworks tested besides transformers that can be deployed?

liked a model 2 months ago

RedHatAI/Qwen3-32B-NVFP4

new activity 3 months ago

omeng-nvidia/saved_models_Qwen3-30B-A3B_nvfp4_hf:Can you explain how this model was built?

View all activity

Organizations

New activity in PowerInfer/SmallThinker-21BA3B-Instruct about 2 months ago

Are there any other frameworks tested besides transformers that can be deployed?

#5 opened 2 months ago by

DarrenChen

liked a model 2 months ago

RedHatAI/Qwen3-32B-NVFP4

Text Generation • 19B • Updated Jun 30 • 1.35k • 3

New activity in omeng-nvidia/saved_models_Qwen3-30B-A3B_nvfp4_hf 3 months ago

Can you explain how this model was built?

#2 opened 3 months ago by

changtimwu

liked a model 4 months ago

Qwen/Qwen3-32B-FP8

Text Generation • 33B • Updated Jul 26 • 64.6k • 64

liked a Space 5 months ago

3.27k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 6 months ago

DistServe: Disaggregating Prefill and Decoding for Goodput-optimized Large Language Model Serving

Paper • 2401.09670 • Published Jan 18, 2024 • 2

upvoted an article 7 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 684

liked a model 7 months ago

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • 6B • Updated May 1 • 806k • 1.49k

upvoted a paper 8 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

liked a model 8 months ago

QuantFactory/Llama-3.2-Taiwan-Legal-3B-Instruct-GGUF

Text Generation • 3B • Updated Nov 2, 2024 • 321 • 11

upvoted 2 papers 8 months ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 102

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 122

liked a Space about 1 year ago

115

Llama3.1 S V0.2 Checkpoint 2024 08 20

😻

Convert text to audio and vice versa

liked a model about 1 year ago

shenzhi-wang/Llama3.1-8B-Chinese-Chat

Text Generation • 8B • Updated Jul 29, 2024 • 5.56k • • 264

liked a model over 1 year ago

openbmb/MiniCPM-Llama3-V-2_5-gguf

Updated Feb 27 • 5.55k • 213

liked a Space over 1 year ago

220

Microsoft Phi-3-Vision-128k

😻

Generate text descriptions from images

liked a model over 1 year ago

google/paligemma-3b-pt-224

Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 40.5k • 358

updated a model over 1 year ago

changtimwu/speaker-segmentation-fine-tuned-callhome-jpn

0.0B • Updated May 2, 2024 • 4

liked 2 models over 1 year ago

crusoeai/Llama-3-8B-Instruct-262k-GGUF

8B • Updated May 5, 2024 • 2.33k • 49

bullerwins/gradientai_Llama-3-8B-Instruct-262k_exl2_8.0bpw

Text Generation • Updated Apr 26, 2024 • 11 • 3