18 36 64

PenutChen

penut85420

penut85420

AI & ML interests

LLM, Quantization

Recent Activity

upvoted an article 21 days ago

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

upvoted a paper about 1 month ago

Unifying Demonstration Selection and Compression for In-Context Learning

liked a model 2 months ago

chandar-lab/NeoBERT

View all activity

Organizations

upvoted an article 21 days ago

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

and 1 other •

23 days ago

• 57

upvoted a paper about 1 month ago

Unifying Demonstration Selection and Compression for In-Context Learning

Paper • 2405.17062 • Published May 27, 2024 • 1

liked a model 2 months ago

chandar-lab/NeoBERT

Feature Extraction • 0.2B • Updated Mar 25 • 1.86k • 178

upvoted a paper 3 months ago

TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling

Paper • 2504.07053 • Published Apr 9 • 4

liked a dataset 3 months ago

institutional/institutional-books-1.0

Viewer • Updated Jun 16 • 983k • 4.88k • 251

updated a Space 3 months ago

HelloGradio

🏢

Test your Japanese kana knowledge quiz-style

published a Space 3 months ago

HelloGradio

🏢

Test your Japanese kana knowledge quiz-style

updated a Space 3 months ago

Test

📚

Take a Japanese kana quiz to test your knowledge 🚀

published a Space 3 months ago

Test

📚

Take a Japanese kana quiz to test your knowledge 🚀

updated a Space 3 months ago

JpVocab

✏

Take a Japanese vocabulary quiz

commented on 🐯 Liger GRPO meets TRL 3 months ago

Sounds perfect!

upvoted an article 3 months ago

Article

🐯 Liger GRPO meets TRL

and 5 others •

May 25

• 50

commented on 🐯 Liger GRPO meets TRL 4 months ago

Does Liger Kernel affect training speed at all? Is it faster, slower, or no difference compared to regular GRPO?

updated a Space 4 months ago

KanaQuiz

📝

Take a Japanese kana quiz 📝

upvoted a paper 4 months ago

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97

liked a Space 4 months ago

968

Computer Agent

🖥

Interact with an AI agent to perform web tasks

liked a model 4 months ago

JetBrains/Mellum-4b-base

Text Generation • 4B • Updated May 7 • 7.94k • 415

upvoted a collection 5 months ago

Tiny Series

Collection

Tiny datasets that empower the foundation of Small Language Model! • 11 items • Updated Jan 26, 2024 • 42

updated 2 Spaces 5 months ago

JpVocab

✏

Take a Japanese vocabulary quiz

JpVocab

✏

Take a Japanese vocabulary quiz

PenutChen

AI & ML interests

Recent Activity

Organizations

penut85420's activity

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

HelloGradio

HelloGradio

Test

Test

JpVocab

🐯 Liger GRPO meets TRL

KanaQuiz

Computer Agent

JpVocab

JpVocab