Debashish C's picture

1 6 11

Debashish C

fuzzyIntel

·

debashishc

AI & ML interests

Natural Language Processing, Speech, Text, Video, Computational Efficiency, you name it.

Recent Activity

updated a collection 8 days ago

updated a collection 8 days ago

upvoted an article 12 days ago

mmBERT: ModernBERT goes Multilingual

View all activity

Organizations

updated a collection 8 days ago

VLM training

List of VLM papers • 3 items • Updated 8 days ago

upvoted an article 12 days ago

Article

mmBERT: ModernBERT goes Multilingual

By

and 5 others •

15 days ago

• 93

liked a Space 17 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 18 days ago

Article

NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks

By

and 4 others •

Aug 11

• 73

upvoted a collection 21 days ago

V-JEPA 2

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 161

updated a collection 24 days ago

VLM training

List of VLM papers • 3 items • Updated 8 days ago

liked a model about 1 year ago

mms-meta/mms-zeroshot-300m

Automatic Speech Recognition • 0.3B • Updated Jul 30, 2024 • 79 • 12

upvoted a paper about 1 year ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 133

liked 3 models over 1 year ago

CohereLabs/aya-23-8B

Text Generation • 8B • Updated 13 days ago • 12.9k • 417

yl4579/StyleTTS2-LibriTTS

Updated Nov 21, 2023 • 54

CohereLabs/c4ai-command-r-v01-4bit

Text Generation • 19B • Updated Apr 16 • 48 • 175

updated a collection over 1 year ago

LLM-fundaments

1 item • Updated Feb 2, 2024

updated a model almost 2 years ago

fuzzyIntel/distil-whisper-large-v3-hi

Updated Dec 19, 2023

upvoted a collection almost 2 years ago

Training Datasets

A collection of pseudo-labelled datasets used to train the Distil-Whisper model. • 9 items • Updated Mar 21, 2024 • 14

New activity in versae/whisper-large-v3 almost 2 years ago

Update README.md

#1 opened almost 2 years ago by

upvoted a paper about 2 years ago

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 25

liked a model about 2 years ago

rsonavane/distil-whisper-large-v2-8-ls

Automatic Speech Recognition • Updated May 19, 2023 • 8 • 10

liked a Space over 2 years ago

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots