Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
312.6
TFLOPS
92
66
183
Yaowei Zheng
hiyouga
Follow
Mackalinux's profile picture
Sivnds's profile picture
igorbkz's profile picture
2713 followers
·
36 following
https://github.com/hiyouga
llamafactory_ai
hiyouga
AI & ML interests
LLM Training System
Recent Activity
liked
a model
12 days ago
microsoft/VibeVoice-1.5B
liked
a model
18 days ago
internlm/Intern-S1-mini
new
activity
20 days ago
google/gemma-3-270m-it:
ValueError During SFT Fine-tuning with Gamma3 Model
View all activity
Organizations
hiyouga
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
12 days ago
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
7 days ago
•
231k
•
1.54k
liked
a model
18 days ago
internlm/Intern-S1-mini
Image-Text-to-Text
•
9B
•
Updated
14 days ago
•
7.56k
•
90
liked
a dataset
25 days ago
nvidia/Llama-Nemotron-VLM-Dataset-v1
Viewer
•
Updated
6 days ago
•
2.86M
•
7.21k
•
139
liked
a model
25 days ago
janhq/Jan-v1-4B
Text Generation
•
4B
•
Updated
15 days ago
•
12.6k
•
327
liked
a model
26 days ago
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
27 days ago
•
21.6k
•
459
liked
a dataset
27 days ago
allenai/WildChat-4.8M
Viewer
•
Updated
27 days ago
•
3.2M
•
7.24k
•
98
liked
a model
about 1 month ago
openai/gpt-oss-20b
Text Generation
•
22B
•
Updated
12 days ago
•
9.1M
•
•
3.43k
liked
a dataset
about 1 month ago
JT-LM/JIUTIAN-TReB
Updated
1 day ago
•
420
•
2
liked
a Space
about 2 months ago
Running
16
16
Megatron Memory Estimator
👁
Estimate GPU memory usage for Megatron models
liked
a model
about 2 months ago
moonshotai/Kimi-K2-Instruct
Text Generation
•
Updated
3 days ago
•
397k
•
•
2.14k
liked
a dataset
about 2 months ago
data-for-agents/insta-150k-v3
Viewer
•
Updated
May 28
•
146k
•
169
•
15
liked
a model
2 months ago
zai-org/GLM-4.1V-9B-Thinking
Image-Text-to-Text
•
10B
•
Updated
10 days ago
•
266k
•
•
731
liked
a dataset
3 months ago
Saigyouji-Yuyuko1000/dapo17k
Viewer
•
Updated
Jun 23
•
17.9k
•
214
•
2
liked
2 models
3 months ago
reducto/RolmOCR
Image-to-Text
•
8B
•
Updated
Apr 2
•
111k
•
505
nanonets/Nanonets-OCR-s
Image-Text-to-Text
•
4B
•
Updated
Jun 20
•
274k
•
1.49k
liked
a dataset
3 months ago
open-thoughts/OpenThoughts3-1.2M
Viewer
•
Updated
Jun 9
•
1.2M
•
7.19k
•
154
liked
2 models
3 months ago
open-thoughts/OpenThinker3-7B
Text Generation
•
8B
•
Updated
Jun 9
•
4.68k
•
•
124
ByteDance-Seed/BAGEL-7B-MoT
Any-to-Any
•
15B
•
Updated
Jun 23
•
782
•
1.12k
liked
a Space
3 months ago
Running
478
478
AI Deadlines
⚡
Generate project deadlines
liked
a dataset
4 months ago
ByteDance-Seed/mga-fineweb-edu
Viewer
•
Updated
May 19
•
846M
•
1.92k
•
33
Load more