tc lin

stuser2023

·

https://github.com/stuser

stuser

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

TabArena/leaderboard

liked a model 4 days ago

google/tabfm-1.0.0-pytorch

liked a model 7 days ago

OpenFormosa/BlueMagpie-TTS

View all activity

Organizations

None yet

upvoted a collection about 1 month ago

Cosmos3

Omnimodal World Models for Physical AI • 18 items • Updated 4 days ago • 138

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 911

upvoted an article 7 months ago

Article

Getting Started with Sentiment Analysis using Python

federicopascual

•

Feb 2, 2022

• 75

upvoted a collection 7 months ago

EmoPillars

This collection contains models and a dataset for fine-grained context-aware and context-less emotion classification. • 7 items • Updated Apr 25, 2025 • 4

upvoted an article 10 months ago

Article

Introducing AI Sheets: a tool to work with datasets using open AI models!

+4

dvilasuero, Ameeeee, frascuchon, damianpumar, lvwerra, thomwolf

•

Aug 8, 2025

• 109

upvoted a paper 11 months ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17, 2025 • 264

upvoted a paper about 1 year ago

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 63

upvoted an article about 1 year ago

Article

The Common Pile v0.1

stellaathena

•

Jun 6, 2025

• 54

upvoted a collection about 1 year ago

🧠 Traditional Chinese Reasoning Datasets

A curated collection of datasets designed to evaluate and train reasoning capabilities in Traditional Chinese across various domains. • 3 items • Updated May 6 • 9

upvoted an article over 1 year ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

+2

saurabhdash, olivernan, ArashAhmadian, johndang-cohere

•

Mar 4, 2025

• 78

upvoted 5 collections over 1 year ago

PaliGemma 2 Mix

13 items • Updated Mar 12 • 66

Breeze 2 Family

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26, 2025 • 20

Cosmos-Tokenizer1

⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/cosmos3 • 22 items • Updated 25 days ago • 44

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 16 items • Updated Mar 2 • 83

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309

upvoted an article almost 2 years ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Isayoften

•

Aug 26, 2024

• 91

upvoted a paper almost 2 years ago

AgentPoison: Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Paper • 2407.12784 • Published Jul 17, 2024 • 51

upvoted a collection almost 2 years ago

🪐 SmolLM

A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5, 2025 • 251

upvoted an article almost 2 years ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

loubnabnl, anton-l, eliebak

•

Jul 16, 2024

• 460

upvoted a collection about 2 years ago

Nemotron 4 340B

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 25 days ago • 164