Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tien Dung's picture
10086 14 222

Tien Dung

tiendung
nibeditad07's profile picture daosysang's profile picture vinhnx90's profile picture
·
  • tiendung

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago
SparseLLM/BlockFFN-3B-SFT
liked a model 10 days ago
turboderp/ERNIE-4.5-300B-A47B-PT-exl3
reacted to Jaward's post with 😎 14 days ago
I played around with the new RXTX paper (XX^T) and was able to train nanogpt with 4x4 RXTX matmuls in both attention layer and optimizer🤕 It just works (well I had to add some guardrails) but still saves 5% of memory usage: The Patch: - Computes attention scores with a 4x4 blockwise RXTX matmuls (no pytorch dot prod) - Handles arbitrary sequence lengths by padding to the nearest multiple of 4. - An RXTX variant of shampoo with params reshaped into 4x4 blocks during each optimizer step. - Uses 5% less ops Code: https://github.com/Jaykef/ai-algorithms/blob/main/nanogpt-rxtx.ipynb Paper: https://arxiv.org/pdf/2505.09814
View all activity

Organizations

Symato Team's profile picture Tiny Monsters's profile picture Vietnamese Mistral's profile picture

published an article 10 months ago
view article
Article

Ưu tiên có thể diễn giải thông qua Mô hình Phần thưởng Đa mục tiêu và Hỗn hợp Chuyên gia

By tiendung •
Sep 29, 2024
• 1
published an article 10 months ago
view article
Article

Bài học đắng trong AI

By tiendung •
Sep 29, 2024
• 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs