Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sewon Min's picture
4 3 4

Sewon Min

sewon
Biswa123's profile picture victor's profile picture 21world's profile picture
·
https://shmsw25.github.io/
  • sewon__min
  • shmsw25

AI & ML interests

natural language processing, language modeling

Organizations

Ai2's profile picture University of Washington NLP's profile picture OLMoE's profile picture data-delve's profile picture

authored a paper 2 months ago

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 55
authored 2 papers 3 months ago

Reasoning Models Can Be Effective Without Thinking

Paper • 2504.09858 • Published Apr 14 • 12

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 74
authored a paper 10 months ago

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3, 2024 • 79
authored 2 papers over 1 year ago

Infini-gram: Scaling Unbounded n-gram Language Models to a Trillion Tokens

Paper • 2401.17377 • Published Jan 30, 2024 • 38

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 30
authored a paper almost 2 years ago

SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore

Paper • 2308.04430 • Published Aug 8, 2023 • 10
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs