Shyam Sunder Kumar's picture

Open to Collab

Shyam Sunder Kumar

theainerd

·

AI & ML interests

Natural Language Processing

Recent Activity

updated a collection 10 days ago

Safety & Security

liked a model 10 days ago

google/gemma-scope-2

upvoted a collection 17 days ago

View all activity

Organizations

updated a collection 10 days ago

Safety & Security

19 items • Updated 10 days ago

liked a model 10 days ago

google/gemma-scope-2

Updated 10 days ago • 49

upvoted a collection 17 days ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated 26 days ago • 183

liked 2 models 17 days ago

openai/circuit-sparsity

Text Generation • 0.4B • Updated 18 days ago • 2.05k • 188

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1 • 655k • 2.12k

upvoted a collection 22 days ago

Bhojpuri and Hindi Rural Women ASR

This dataset includes ASR data from rural women speaking Hindi and Bhojpuri, supporting inclusive voice recognition. • 2 items • Updated Nov 6 • 1

liked a model 25 days ago

microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 17 days ago • 283k • 1.01k

upvoted a collection 26 days ago

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer. • 46 items • Updated 6 days ago • 66

liked a model 26 days ago

nvidia/multitalker-parakeet-streaming-0.6b-v1

Automatic Speech Recognition • Updated 14 days ago • 478 • 54

updated a collection 26 days ago

Safety & Security

19 items • Updated 10 days ago

liked a model 27 days ago

mistralai/Mistral-Large-3-675B-Instruct-2512

Updated 10 days ago • 913 • 196

upvoted a collection 27 days ago

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 27 days ago • 80

liked a dataset 27 days ago

nvidia/Nemotron-AIQ-Agentic-Safety-Dataset-1.0

Viewer • Updated 23 days ago • 10.8k • 5.78k • 9

liked a model 28 days ago

deepseek-ai/DeepSeek-V3.2-Speciale

Text Generation • 685B • Updated 29 days ago • 22.7k • 626

liked a model about 1 month ago

nvidia/Nemotron-Orchestrator-8B

Text Generation • 8B • Updated 28 days ago • 60.4k • 469

reacted to danielhanchen's post with 🔥 about 1 month ago

Post

8455

Qwen3-Next can now be Run locally! (30GB RAM)
Instruct GGUF: unsloth/Qwen3-Next-80B-A3B-Instruct-GGUF

The models come in Thinking and Instruct versions and utilize a new architecture, allowing it to have ~10x faster inference than Qwen32B.
💜 Step-by-step Guide: https://docs.unsloth.ai/models/qwen3-next

Thinking GGUF: unsloth/Qwen3-Next-80B-A3B-Thinking-GGUF

liked a model about 1 month ago

deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated Nov 27 • 8.72k • 672

upvoted an article about 1 month ago

Article

Continuous batching from first principles

+1

Nov 25

•

288

liked a model about 1 month ago

black-forest-labs/FLUX.2-dev

Image-to-Image • Updated Nov 27 • 99.1k • • 1.16k

liked a dataset about 1 month ago

Tevatron/browsecomp-plus-corpus

Viewer • Updated Aug 23 • 100k • 9.37k • 14