Vaibhav Srivastav's picture

Vaibhav Srivastav PRO

reach-vb

·

https://vaibhavs10.github.io

AI & ML interests

TTS + LM performance prediction

Recent Activity

liked a model about 13 hours ago

tencent/HunyuanWorld-Voyager

liked a model 3 days ago

moonshotai/Kimi-K2-Instruct-0905

liked a model 3 days ago

google/embeddinggemma-300m

View all activity

Organizations

upvoted an article 6 days ago

Article

Make your ZeroGPU Spaces go brrr with PyTorch ahead-of-time compilation

By

and 3 others •

6 days ago

• 40

upvoted 5 articles 7 days ago

Article

MCP for Research: How to Connect AI to Research Tools

By

•

21 days ago

• 44

Article

Fine-tune Llama 2 with DPO

By

and 2 others •

Aug 8, 2023

• 61

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

By

and 4 others •

Jan 18, 2024

• 71

Article

Illustrating Reinforcement Learning from Human Feedback (RLHF)

By

and 3 others •

Dec 9, 2022

• 336

Article

🥬 TinyLettuce: Efficient Hallucination Detection with 17–68M Encoders

By

and 1 other •

7 days ago

• 9

upvoted a collection 9 days ago

FastVLM

Efficient Vision Encoding for Vision Language Models • 9 items • Updated 5 days ago • 90

upvoted a paper 9 days ago

TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Language Modeling

Paper • 2508.16790 • Published 16 days ago • 7

upvoted an article 11 days ago

Article

Open R1: How to use OlympicCoder locally for coding?

By

and 4 others •

Mar 20

• 63

upvoted 3 collections 13 days ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated 10 days ago • 86

AI Release Year Thread 2025

16 items • Updated 7 days ago • 6

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 5 items • Updated 7 days ago • 106

upvoted 2 articles 18 days ago

Article

Generate Images with Claude and Hugging Face

By

•

20 days ago

• 31

Article

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

By

and 4 others •

18 days ago

• 15

upvoted a collection 20 days ago

DeepSeek-V3.1

3 items • Updated 18 days ago • 222

upvoted a collection 24 days ago

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated 17 days ago • 278

upvoted an article 24 days ago

Article

How To Build a News Agent with GPT-OSS, Hugging Face Inference & Gradio

By

•

25 days ago

• 23

upvoted an article about 1 month ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

By

and 11 others •

Aug 5

• 489

upvoted 2 collections about 1 month ago

GPT OSS

2 items • Updated 18 days ago • 12

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 338