Aritra Roy Gosthipaty's picture

Building on HF

Aritra Roy Gosthipaty PRO

ariG23498

huggingface

·

https://arig23498.github.io/

AI & ML interests

Deep Representation Learning

Recent Activity

upvoted an article 1 day ago

Ulysses Sequence Parallelism: Training with Million-Token Contexts

liked a model 2 days ago

fishaudio/s2-pro

liked a model 2 days ago

View all activity

Organizations

upvoted an article 1 day ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

6 days ago

•

20

liked 3 models 2 days ago

fishaudio/s2-pro

Text-to-Speech • 5B • Updated 4 days ago • 3.96k • 427

HumeAI/tada-1b

Text-to-Speech • 2B • Updated 1 day ago • 8.76k • 177

HumeAI/tada-3b-ml

Text-to-Speech • 4B • Updated 1 day ago • 7.69k • 115

New activity in Jackrong/Qwen3.5-9B-Claude-4.6-Opus-Reasoning-Distilled 2 days ago

Adding `transformers` as the library name

#3 opened 2 days ago by

New activity in 1Covenant/Covenant-72B 2 days ago

Adding `transformers` as the library name

#2 opened 2 days ago by

liked a model 2 days ago

1Covenant/Covenant-72B

Text Generation • 73B • Updated 5 days ago • 296 • 30

updated 2 datasets 2 days ago

model-metadata/custom-code-models

Viewer • Updated 2 days ago • 100 • 54 • 1

model-metadata/trending_models_metadata

Viewer • Updated 2 days ago • 100 • 38

upvoted an article 3 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

5 days ago

•

49

upvoted a collection 3 days ago

Built with DGX Spark 💚

0 items • Updated 3 days ago • 4

liked a model 3 days ago

bharatgenai/Param2-17B-A2.4B-Thinking

Text Generation • 17B • Updated 1 day ago • 2.12k • 53

upvoted a collection 4 days ago

NVIDIA Nemotron v3

Open, Production-ready Enterprise Models • 12 items • Updated 4 days ago • 197

liked a model 4 days ago

nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16

Text Generation • 124B • Updated about 16 hours ago • 13.1k • 192

updated a Space 4 days ago

README

upvoted an article 5 days ago

Article

Introducing Storage Buckets on the Hugging Face Hub

+10

5 days ago

•

166

liked a model 5 days ago

tencent/Penguin-VL-8B

Text Generation • 9B • Updated 3 days ago • 3.07k • 64

upvoted an article 6 days ago

Article

Creating custom kernels for the AMD MI300

Jul 9, 2025

•

54

updated 2 datasets 9 days ago

model-metadata/code_execution_files

Viewer • Updated 9 days ago • 418 • 1.35k

model-metadata/hf_jobs_url

Viewer • Updated 9 days ago • 76 • 13