Shubhashis Roy Dipta PRO
AI & ML interests
Recent Activity
Organizations
-
Textbooks Are All You Need
Paper • 2306.11644 • Published • 146 -
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper • 2401.02330 • Published • 18 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
Visual Instruction Tuning
Paper • 2304.08485 • Published • 19
-
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper • 2402.13064 • Published • 50 -
Large Language Models for Data Annotation: A Survey
Paper • 2402.13446 • Published • 1 -
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
Paper • 2303.16854 • Published • 1 -
Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks
Paper • 2307.02179 • Published • 7
-
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Paper • 2401.10208 • Published • 1 -
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Paper • 2305.11172 • Published • 2 -
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Paper • 2302.00402 • Published -
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities
Paper • 2308.12966 • Published • 9
-
Running122122
Open VLM Video Leaderboard
🌎VLMEvalKit Eval Results in video understanding benchmark
-
Running on CPU Upgrade13.6k13.6k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade1.08k1.08k
Open ASR Leaderboard
🏆View and request speech recognition model benchmarks
-
Running on CPU Upgrade888888
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
Running147147
Recommend Similar Papers
🌖Find similar papers using a link
-
Running on CPU Upgrade268268
Daily Papers
📊Complete list of past Daily Papers
-
Running8282
Semantic Hugging Face Hub Search
🔎Find datasets and models using semantic search
-
Running on CPU Upgrade888888
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
M^3IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
Paper • 2306.04387 • Published • 8 -
Datasets for Large Language Models: A Comprehensive Survey
Paper • 2402.18041 • Published • 2 -
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
Paper • 2306.06687 • Published • 1 -
Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
Paper • 2201.04236 • Published
-
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 23
-
uiuc-convai/CoALM-IT
Viewer • Updated • 312k • 165 • 12 -
Running9393
Nexus Function Calling Leaderboard
🐠Display benchmark results for models on various tasks
-
fireworks-ai/function-calling-eval-dataset-v0
Viewer • Updated • 212 • 58 • 14 -
NousResearch/func-calling-eval-glaive
Viewer • Updated • 100 • 41 • 7
-
Running122122
Open VLM Video Leaderboard
🌎VLMEvalKit Eval Results in video understanding benchmark
-
Running on CPU Upgrade13.6k13.6k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade1.08k1.08k
Open ASR Leaderboard
🏆View and request speech recognition model benchmarks
-
Running on CPU Upgrade888888
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
Textbooks Are All You Need
Paper • 2306.11644 • Published • 146 -
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model
Paper • 2401.02330 • Published • 18 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
Visual Instruction Tuning
Paper • 2304.08485 • Published • 19
-
Running147147
Recommend Similar Papers
🌖Find similar papers using a link
-
Running on CPU Upgrade268268
Daily Papers
📊Complete list of past Daily Papers
-
Running8282
Semantic Hugging Face Hub Search
🔎Find datasets and models using semantic search
-
Running on CPU Upgrade888888
Open VLM Leaderboard
🌎VLMEvalKit Evaluation Results Collection
-
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper • 2402.13064 • Published • 50 -
Large Language Models for Data Annotation: A Survey
Paper • 2402.13446 • Published • 1 -
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
Paper • 2303.16854 • Published • 1 -
Open-Source Large Language Models Outperform Crowd Workers and Approach ChatGPT in Text-Annotation Tasks
Paper • 2307.02179 • Published • 7
-
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
Paper • 2401.10208 • Published • 1 -
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities
Paper • 2305.11172 • Published • 2 -
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
Paper • 2302.00402 • Published -
Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities
Paper • 2308.12966 • Published • 9
-
M^3IT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
Paper • 2306.04387 • Published • 8 -
Datasets for Large Language Models: A Comprehensive Survey
Paper • 2402.18041 • Published • 2 -
LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark
Paper • 2306.06687 • Published • 1 -
Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
Paper • 2201.04236 • Published
-
Proximal Policy Optimization Algorithms
Paper • 1707.06347 • Published • 11 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 151 -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 23