OptimalScale

university

https://github.com/OptimalScale

Activity Feed Request to join this org

AI & ML interests

Large foundation models, large language models.

Recent Activity

lmflow-optimalscale new activity 4 days ago

OptimalScale/ClimbLab:Really nice contribution 👏🏻👏🏻

lmflow-optimalscale new activity 4 days ago

OptimalScale/ClimbMix:Erroneous Token Count Column

lmflow-optimalscale updated a dataset 7 days ago

OptimalScale/ClimbMix

View all activity

OptimalScale's activity

lmflow-optimalscale

in OptimalScale/ClimbLab 4 days ago

Really nice contribution 👏🏻👏🏻

#2 opened 5 days ago by

lmflow-optimalscale

in OptimalScale/ClimbMix 4 days ago

Erroneous Token Count Column

#2 opened 4 days ago by

lmflow-optimalscale

updated 2 datasets 7 days ago

OptimalScale/ClimbMix

Viewer • Updated 7 days ago • 395M • 724 • 4

OptimalScale/ClimbLab

Viewer • Updated 7 days ago • 1.24B • 823 • 7

shizhediao

updated a dataset 7 days ago

OptimalScale/ClimbLab

Viewer • Updated 7 days ago • 1.24B • 823 • 7

shizhediao

authored a paper 9 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published 10 days ago • 87

lmflow-optimalscale

published 2 datasets 9 days ago

OptimalScale/ClimbLab

Viewer • Updated 7 days ago • 1.24B • 823 • 7

OptimalScale/ClimbMix

Viewer • Updated 7 days ago • 395M • 724 • 4

ksshumab

authored a paper about 2 months ago

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

ksshumab

authored 3 papers 2 months ago

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

Paper • 2302.12822 • Published Feb 24, 2023

RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment

Paper • 2304.06767 • Published Apr 13, 2023 • 2

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

Paper • 2408.12168 • Published Aug 22, 2024

hendrydong

authored 2 papers 3 months ago

BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation

Paper • 2502.03860 • Published Feb 6 • 24

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39

hendrydong

authored a paper 4 months ago

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 39

shizhediao

authored a paper 5 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 45

research4pan

authored a paper 7 months ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 71

shizhediao

authored a paper 7 months ago

Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models

Paper • 2410.03290 • Published Oct 4, 2024 • 7

hendrydong

authored a paper 7 months ago

MathHay: An Automated Benchmark for Long-Context Mathematical Reasoning in LLMs

Paper • 2410.04698 • Published Oct 7, 2024 • 13

hendrydong

authored a paper 9 months ago

ThinK: Thinner Key Cache by Query-Driven Pruning

Paper • 2407.21018 • Published Jul 30, 2024 • 33