Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
202
1
36
Ronan McGovern
PRO
RonanMcGovern
Follow
yohn-maistre's profile picture
bangbang's profile picture
Botcho's profile picture
54 followers
·
12 following
https://ronanmcgovern.com
RonanKMcGovern
RonanKMcGovern
AI & ML interests
Open source LLMs. Fine-tuning. Summarisation. Patents.
Recent Activity
new
activity
4 days ago
SGLang/DeepSeek-V3-NextN:
Is this MTP head just for predicting one token ahead?
updated
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_wait
published
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_wait
View all activity
Organizations
RonanMcGovern
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
SGLang/DeepSeek-V3-NextN
4 days ago
Is this MTP head just for predicting one token ahead?
#1 opened 4 days ago by
RonanMcGovern
updated
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_wait
Text Generation
•
Updated
7 days ago
published
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_wait
Text Generation
•
Updated
7 days ago
updated
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit
Text Generation
•
Updated
7 days ago
published
a model
7 days ago
Trelis/Llama-3.2-1B-Instruct_GRPO_1_chkpt100_16bit
Text Generation
•
Updated
7 days ago
updated
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_2
Text Generation
•
Updated
9 days ago
•
2
published
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_2
Text Generation
•
Updated
9 days ago
•
2
updated
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1_ORPO_2
Text Generation
•
Updated
9 days ago
•
3
published
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1_ORPO_2
Text Generation
•
Updated
9 days ago
•
3
updated
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1_SFT_2
Text Generation
•
Updated
9 days ago
published
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1_SFT_2
Text Generation
•
Updated
9 days ago
updated
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_ORPO_1
Text Generation
•
Updated
9 days ago
•
3
published
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_ORPO_1
Text Generation
•
Updated
9 days ago
•
3
updated
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1
Text Generation
•
Updated
9 days ago
•
3
published
a model
9 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_1
Text Generation
•
Updated
9 days ago
•
3
updated
a model
10 days ago
Trelis/Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr
Text Generation
•
Updated
10 days ago
•
6
published
a model
10 days ago
Trelis/Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr
Text Generation
•
Updated
10 days ago
•
6
New activity in
Qwen/Qwen2-VL-7B-Instruct
11 days ago
[BUG] {'use_reentrant': True} results in "Gradients will be None"
2
#74 opened 24 days ago by
RonanMcGovern
updated
a model
15 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_step1
Text Generation
•
Updated
15 days ago
•
9
published
a model
15 days ago
Trelis/Llama-3.2-1B-Instruct_SFT_step1
Text Generation
•
Updated
15 days ago
•
9
Load more