shisa-ai
's Collections
shisa-v2-research
updated
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
•
2406.08464
•
Published
•
70
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
•
2406.20094
•
Published
•
102
argilla/magpie-ultra-v1.0
Viewer
•
Updated
•
3.22M
•
412
•
42
Viewer
•
Updated
•
1k
•
5.02k
•
110
Viewer
•
Updated
•
817
•
3.75k
•
154
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
•
2401.01335
•
Published
•
66
Direct Nash Optimization: Teaching Language Models to Self-Improve with
General Preferences
Paper
•
2404.03715
•
Published
•
62
Self-Boosting Large Language Models with Synthetic Preference Data
Paper
•
2410.06961
•
Published
•
17
SPaR: Self-Play with Tree-Search Refinement to Improve
Instruction-Following in Large Language Models
Paper
•
2412.11605
•
Published
•
18
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
•
Updated
•
150k
•
215
•
17
sbintuitions/modernbert-ja-130m
Fill-Mask
•
Updated
•
4.69k
•
41
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
22.3k
•
307
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
Paper
•
2312.01523
•
Published
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
63