-
simonycl/self-seq-Meta-Llama-3-8B-alpaca_it_llmam_70b
Text Generation • Updated • 11 -
simonycl/self-seq-Meta-Llama-3-8B-wizardlm
Text Generation • Updated • 2 -
simonycl/self-seq-Meta-Llama-3-8B-alpaca_llmam_70b-iter-2
Text Generation • Updated • 2 -
simonycl/self-seq-Meta-Llama-3-8B-flancot_full_it_llama_70b
Text Generation • Updated • 5
Hanxu Hu
HanxuHU
AI & ML interests
Multi-Modality
Recent Activity
liked
a model
3 days ago
microsoft/bitnet-b1.58-2B-4T
upvoted
a
paper
4 days ago
Learning to Reason under Off-Policy Guidance
published
a model
26 days ago
HanxuHU/Qwen2-0.5B-SFT
Organizations
Collections
2
models
14
HanxuHU/Qwen2-0.5B-SFT
Updated
HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_seq_it2_llama70b
Updated
•
3
•
1
HanxuHU/self-seq-Meta-Llama-3-8B-tulu100k_base_ours_new_llama70b
Text Generation
•
Updated
•
1
HanxuHU/sit_all_models
Updated
HanxuHU/flancot_full_it1
Updated
HanxuHU/sharegpt_filter
Updated
HanxuHU/files
Updated
HanxuHU/my-mLLMs
Updated
HanxuHU/multilingual_mmmu
Updated
HanxuHU/alpaca_topk_indices
Updated
datasets
61
HanxuHU/mt_data
Viewer
•
Updated
•
796k
•
26
HanxuHU/gemma-llama-2-9b-it-ultrafeedback-annotate-ultrafb-judge-5-maj
Viewer
•
Updated
•
60k
•
12
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-filtered
Viewer
•
Updated
•
56.4k
•
15
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-judge-5-majority-filtered
Viewer
•
Updated
•
55.2k
•
18
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-ultrafb-merge-single-judge
Viewer
•
Updated
•
60.7k
•
13
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-ultrafb-merge-single-judge
Viewer
•
Updated
•
1.96k
•
16
HanxuHU/gemma2-9B-it-ultrafeedback-annotate-truth-judge
Viewer
•
Updated
•
60.7k
•
19
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-truth-judge
Viewer
•
Updated
•
1.96k
•
19
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-honesty-judge
Viewer
•
Updated
•
1.96k
•
12
HanxuHU/gemma-2-9b-it-ultrafeedback-annotate-safe-judge
Viewer
•
Updated
•
1.96k
•
31