Data Selection

community

https://github.com/microsoft/LMOps/tree/main/data_selection

AI & ML interests

Data Selection for Language Models

t1101675

in Data-Selection/PDS-470M 9 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

t1101675

in Data-Selection/PDS-160M 9 months ago

Adding `safetensors` variant of this model

#1 opened 11 months ago by

Add link to paper

#2 opened 9 months ago by

t1101675

in Data-Selection/PDS-470M 9 months ago

Clarify Model Description and Add Project Page Link

#2 opened 9 months ago by

t1101675

in Data-Selection/PDS-1B 9 months ago

Add link to code repository

#2 opened 9 months ago by

t1101675

in Data-Selection/PDS-1.7B 9 months ago

Add link to Github and improve description

#2 opened 9 months ago by

t1101675

in Data-Selection/BSL-1.7B 9 months ago

Add link to code

#2 opened 9 months ago by

t1101675

updated a model 12 months ago

Data-Selection/data_scorer

Updated Jan 5, 2025 • 14

t1101675

authored a paper about 1 year ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 59

t1101675

updated 8 models about 1 year ago

Data-Selection/BSL-1B

Text Generation • Updated Oct 28, 2024 • 8

Data-Selection/BSL-1.7B

Text Generation • Updated Mar 25, 2025 • 9

Data-Selection/BSL-470M

Text Generation • Updated Oct 28, 2024 • 9

Data-Selection/BSL-160M

Text Generation • Updated Oct 28, 2024 • 40

Data-Selection/PDS-1.7B

Text Generation • Updated Mar 25, 2025 • 9

Data-Selection/PDS-1B

Text Generation • Updated Mar 25, 2025 • 8

Data-Selection/PDS-470M

Text Generation • 0.5B • Updated Mar 25, 2025 • 10

Data-Selection/PDS-160M

Text Generation • 0.2B • Updated Mar 25, 2025 • 8

t1101675

authored 2 papers about 1 year ago

MiniPLM: Knowledge Distillation for Pre-Training Language Models

Paper • 2410.17215 • Published Oct 22, 2024 • 16

Data Selection via Optimal Control for Language Models

Paper • 2410.07064 • Published Oct 9, 2024 • 9

t1101675

authored a paper over 1 year ago

Direct Preference Knowledge Distillation for Large Language Models

Paper • 2406.19774 • Published Jun 28, 2024 • 22