Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
43
15
39
instruction-pretrain
instruction-pretrain
Follow
Presidentlin's profile picture
NickyNicky's profile picture
tinyrobot's profile picture
127 followers
·
5 following
https://huggingface.co/papers/2406.14491
DaixuanC45443
AI & ML interests
Synthetic Instructions for Pre-Training
Recent Activity
upvoted
a
paper
21 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
4 months ago
Reasoning with Exploration: An Entropy Perspective
new
activity
4 months ago
instruction-pretrain/finance-Llama3-8B:
How large is the corpus size used for pretraining the finance LLaMA?
View all activity
Organizations
None yet
models
5
Sort: Recently updated
instruction-pretrain/InstructLM-500M
Text Generation
•
0.6B
•
Updated
Mar 4
•
4
•
34
instruction-pretrain/InstructLM-1.3B
Text Generation
•
1B
•
Updated
Mar 4
•
23
•
43
instruction-pretrain/finance-Llama3-8B
Text Generation
•
8B
•
Updated
Mar 1
•
3.54k
•
•
68
instruction-pretrain/medicine-Llama3-8B
Text Generation
•
8B
•
Updated
Mar 1
•
8
•
38
instruction-pretrain/instruction-synthesizer
Text Generation
•
7B
•
Updated
Mar 1
•
20
•
79
datasets
3
Sort: Recently updated
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
•
Updated
Mar 1
•
249k
•
159
•
62
instruction-pretrain/medicine-instruction-augmented-corpora
Preview
•
Updated
Mar 1
•
88
•
13
instruction-pretrain/general-instruction-augmented-corpora
Preview
•
Updated
Mar 1
•
1.44k
•
18