shisa-ai
's Collections
shisa-v2-research
updated
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
67
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
97
argilla/magpie-ultra-v1.0
Viewer
•
Updated
•
3.22M
•
3.88k
•
42
Viewer
•
Updated
•
1k
•
6.68k
•
84
Viewer
•
Updated
•
817
•
7.19k
•
132
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
65
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
61
Self-Boosting Large Language Models with Synthetic Preference Data
Paper
•
2410.06961
•
Published
•
16
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
•
2412.11605
•
Published
•
18
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
•
Updated
•
150k
•
724
•
17
sbintuitions/modernbert-ja-130m
Fill-Mask
•
Updated
•
10.3k
•
38
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
64.8k
•
292
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
Paper
•
2312.01523
•
Published