hanspeterlyngsoeraaschoujensen/Reasoning_Data_25K_DeepScaleR_1.5B_Preview Viewer • Updated 3 days ago • 25.2k • 55
hanspeterlyngsoeraaschoujensen/Reasoning_Data_25K_DeepScaleR_1.5B_Preview Viewer • Updated 3 days ago • 25.2k • 55
hanspeterlyngsoeraaschoujensen/reasoning_data_DeepScaleR_1.5B_Preview Viewer • Updated 3 days ago • 5.18k • 52
hanspeterlyngsoeraaschoujensen/reasoning_data_DeepScaleR_1.5B_Preview Viewer • Updated 3 days ago • 5.18k • 52
hanspeterlyngsoeraaschoujensen/10K_open_r1_OpenR1_Math_220k_synthetic_dataset Preview • Updated 27 days ago • 33
hanspeterlyngsoeraaschoujensen/10K_open_r1_OpenR1_Math_220k_synthetic_dataset Preview • Updated 27 days ago • 33
Running 2.66k 2.66k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 91
hanspeterlyngsoeraaschoujensen/week41_train_en_input_output Viewer • Updated Sep 24, 2024 • 6.41k • 24
hanspeterlyngsoeraaschoujensen/deberta-v3-base-finetuned-nlp-course Question Answering • Updated Sep 23, 2024 • 17
hanspeterlyngsoeraaschoujensen/distilbert-base-uncased-finetuned-nlp-course Question Answering • Updated Sep 23, 2024 • 14
hanspeterlyngsoeraaschoujensen/mt5-base-finetuned-nlp-course Question Answering • Updated Sep 21, 2024 • 13