University of Washington

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

ronakdm authored a paper about 1 month ago

Distributionally Robust Optimization with Bias and Variance Reduction

ronakdm authored a paper about 1 month ago

The Benefits of Balance: From Information Projections to Variance Reduction

ronakdm authored a paper about 1 month ago

A Generalization Theory for Zero-Shot Prediction

View all activity

LNIU

authored 6 papers about 2 months ago

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates

Paper • 2406.12935 • Published Jun 17, 2024 • 2

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models

Paper • 2406.12257 • Published Jun 18, 2024

Stronger Models are NOT Stronger Teachers for Instruction Tuning

Paper • 2411.07133 • Published Nov 11, 2024 • 39

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published Feb 17 • 3

TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning

Paper • 2505.14625 • Published May 20 • 13

VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

Paper • 2505.23977 • Published May 29 • 10

alisawuffles

in UW/OLMo2-8B-SuperBPE-t180k 5 months ago

Training code for Tokenizer

#1 opened 5 months ago by

kevinlin311tw

authored a paper 5 months ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published Apr 10 • 20

yanaiela

authored a paper 5 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 77

alisawuffles

updated a dataset 5 months ago

UW/olmo-mix-1124-subset-p99

Updated Apr 10 • 82 • 1

alisawuffles

updated a collection 5 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

kevinlin311tw

authored a paper 6 months ago

BizGen: Advancing Article-level Visual Text Rendering for Infographics Generation

Paper • 2503.20672 • Published Mar 26 • 14

Jhayase

published a model 6 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 9 • 2

Jhayase

updated a model 6 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 9 • 2

alisawuffles

updated a collection 6 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

alisawuffles

updated a model 6 months ago

UW/OLMo2-11B-SuperBPE-t180k

Text Generation • 11B • Updated Mar 20 • 9 • 2

alisawuffles

updated a collection 6 months ago

SuperBPE

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

alisawuffles

published 3 models 6 months ago

UW/OLMo2-8B-SuperBPE-t80k

Text Generation • 8B • Updated Mar 19 • 8

UW/OLMo2-8B-SuperBPE-t180k

Text Generation • 8B • Updated Mar 19 • 1.13k • 8

UW/OLMo2-8B-BPE

Text Generation • 8B • Updated Mar 19 • 7