Oliver Pfaffel
OliP
AI & ML interests
None yet
Recent Activity
liked
a model
10 days ago
nvidia/NVIDIA-Nemotron-Nano-9B-v2
liked
a model
15 days ago
jhu-clsp/mmBERT-base
liked
a model
16 days ago
lusxvr/nanoVLM-222M
Organizations
2024 Papers of the year
LLM Deployment
-
Running274274
Llm Pricing
📊Display a React app with TypeScript
-
Running1.01k1.01k
Can You Run It? LLM version
🚀Calculate GPU requirements for running LLMs
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper • 2312.15234 • Published • 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper • 2407.11062 • Published • 10
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper • 2407.14057 • Published • 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper • 2407.14482 • Published • 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper • 2407.11963 • Published • 44
Special LMs <10B
Evaluation
-
Self-Taught Evaluators
Paper • 2408.02666 • Published • 30 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper • 2409.12640 • Published • 2 -
openai/MMMLU
Viewer • Updated • 393k • 11.5k • 499 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper • 2409.16191 • Published • 42
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper • 2407.13168 • Published • 14 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper • 2408.03910 • Published • 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper • 2408.07060 • Published • 42
Leading Leaderboards
-
Running on CPU Upgrade13.6k13.6k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade6.45k6.45k
MTEB Leaderboard
🥇Embedding Leaderboard
-
Running4.62k4.62k
LMArena Leaderboard
🏆Display LMArena Leaderboard
-
Running222222
BigCodeBench Leaderboard
🥇Explore and analyze code completion benchmarks
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper • 2306.00989 • Published • 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Matryoshka Representation Learning
Paper • 2205.13147 • Published • 24
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper • 2407.14177 • Published • 44 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper • 2407.04172 • Published • 26 -
facebook/chameleon-7b
Image-Text-to-Text • 7B • Updated • 80.9k • 190 -
vidore/colpali
Visual Document Retrieval • Updated • 12.6k • 462
Audio
🌶️ Spaces
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper • 2407.19340 • Published • 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper • 2408.02900 • Published • 30 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126
NewGen small LMs
Leading Leaderboards
-
Running on CPU Upgrade13.6k13.6k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
Running on CPU Upgrade6.45k6.45k
MTEB Leaderboard
🥇Embedding Leaderboard
-
Running4.62k4.62k
LMArena Leaderboard
🏆Display LMArena Leaderboard
-
Running222222
BigCodeBench Leaderboard
🥇Explore and analyze code completion benchmarks
2024 Papers of the year
2023 (and before) Papers of the Year
-
Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles
Paper • 2306.00989 • Published • 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 63 -
Scalable Diffusion Models with Transformers
Paper • 2212.09748 • Published • 18 -
Matryoshka Representation Learning
Paper • 2205.13147 • Published • 24
LLM Deployment
-
Running274274
Llm Pricing
📊Display a React app with TypeScript
-
Running1.01k1.01k
Can You Run It? LLM version
🚀Calculate GPU requirements for running LLMs
-
Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems
Paper • 2312.15234 • Published • 3 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper • 2407.11062 • Published • 10
Vision-Language
-
EVLM: An Efficient Vision-Language Model for Visual Understanding
Paper • 2407.14177 • Published • 44 -
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Paper • 2407.04172 • Published • 26 -
facebook/chameleon-7b
Image-Text-to-Text • 7B • Updated • 80.9k • 190 -
vidore/colpali
Visual Document Retrieval • Updated • 12.6k • 462
Long-Context
-
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper • 2407.14057 • Published • 46 -
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities
Paper • 2407.14482 • Published • 26 -
NeedleBench: Can LLMs Do Retrieval and Reasoning in 1 Million Context Window?
Paper • 2407.11963 • Published • 44
Audio
Special LMs <10B
🌶️ Spaces
Evaluation
-
Self-Taught Evaluators
Paper • 2408.02666 • Published • 30 -
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries
Paper • 2409.12640 • Published • 2 -
openai/MMMLU
Viewer • Updated • 393k • 11.5k • 499 -
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models
Paper • 2409.16191 • Published • 42
Applications
-
Integrating Large Language Models into a Tri-Modal Architecture for Automated Depression Classification
Paper • 2407.19340 • Published • 58 -
MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine
Paper • 2408.02900 • Published • 30 -
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery
Paper • 2408.06292 • Published • 126
Coding
-
SciCode: A Research Coding Benchmark Curated by Scientists
Paper • 2407.13168 • Published • 14 -
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Paper • 2407.16741 • Published • 73 -
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases
Paper • 2408.03910 • Published • 18 -
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
Paper • 2408.07060 • Published • 42