OLMo Friends

non-profit

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

orionweller authored a paper 5 days ago

On the Theoretical Limitations of Embedding-Based Retrieval

rulins authored a paper 4 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

natolambert authored a paper 5 months ago

Reinforcement Learning from Human Feedback

View all activity

orionweller

authored a paper 5 days ago

On the Theoretical Limitations of Embedding-Based Retrieval

Paper • 2508.21038 • Published 11 days ago • 16

rulins

authored a paper 4 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55

natolambert

authored a paper 5 months ago

Reinforcement Learning from Human Feedback

Paper • 2504.12501 • Published Apr 16 • 4

akshitab

authored 3 papers 5 months ago

soldni

authored a paper 5 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 77

liujch1998

authored 2 papers 5 months ago

Efficient Test-Time Scaling via Self-Calibration

Paper • 2503.00031 • Published Feb 25 • 15

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 77

yuchenlin

authored a paper 5 months ago

CrossWordBench: Evaluating the Reasoning Capabilities of LLMs and LVLMs with Controllable Puzzle Generation

Paper • 2504.00043 • Published Mar 30 • 10

orionweller

authored a paper 6 months ago

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 25

hamishivi

authored 3 papers 6 months ago

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Paper • 2408.10075 • Published Aug 19, 2024

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 21

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 13

jungok

authored a paper 6 months ago

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Paper • 2502.20583 • Published Feb 27 • 13

yuchenlin

authored 2 papers 7 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 40

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3 • 18

liujch1998

authored 3 papers 8 months ago

Don't throw away your value model! Making PPO even better via Value-Guided Monte-Carlo Tree Search decoding

Paper • 2309.15028 • Published Sep 26, 2023 • 1

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

Paper • 2310.02255 • Published Oct 3, 2023 • 2

Crystal: Introspective Reasoners Reinforced with Self-Feedback

Paper • 2310.04921 • Published Oct 7, 2023 • 1

AI & ML interests

Recent Activity

Team members 16

olmo-friends's activity