2925 17 3

Orion Weller PRO

orionweller

http://orionweller.com

AI & ML interests

None yet

Recent Activity

updated a model 1 day ago

blab-jhu/mmbert-checkpoints

updated a dataset 6 days ago

blab-jhu/mmbert-fineweb2-remaining-langs

updated a dataset 6 days ago

blab-jhu/mmbert-fineweb2-remaining-langs

View all activity

Organizations

upvoted a collection about 1 month ago

Encoders vs Decoders: the Ettin Suite

Collection

A collection of SOTA, open-data, paired encoder-only and decoder only models ranging from 17M params to 1B. See the paper at https://arxiv.org/abs/250 • 32 items • Updated Jul 16 • 20

upvoted a paper about 1 month ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15 • 25

upvoted an article about 1 month ago

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

and 5 others •

Jul 16

• 59

upvoted a paper about 2 months ago

The Translation Barrier Hypothesis: Multilingual Generation with Large Language Models Suffers from Implicit Translation Failure

Paper • 2506.22724 • Published Jun 28 • 10

upvoted a paper 4 months ago

Certified Mitigation of Worst-Case LLM Copyright Infringement

Paper • 2504.16046 • Published Apr 22 • 13

upvoted a paper 5 months ago

WikiVideo: Article Generation from Multiple Videos

Paper • 2504.00939 • Published Apr 1 • 38

upvoted 4 papers 6 months ago

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 25

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published Feb 25 • 28

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published Feb 19 • 38

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Paper • 2502.13962 • Published Feb 19 • 29

upvoted 3 papers 8 months ago

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 96

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 154

Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 36

upvoted a paper 10 months ago

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11, 2024 • 14

upvoted a paper 11 months ago

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models

Paper • 2409.11136 • Published Sep 17, 2024 • 25

upvoted a collection about 1 year ago

BM25S