Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
14
1
25
DAEHEEKIM
andreaKIM
Follow
EsraN's profile picture
21world's profile picture
julien-c's profile picture
4 followers
ยท
3 following
daehuikim
AI & ML interests
LLM interactive chatbot
Recent Activity
upvoted
a
paper
15 days ago
Agentic Reinforced Policy Optimization
commented
on
a paper
15 days ago
Agentic Reinforced Policy Optimization
liked
a dataset
5 months ago
lbox/kbl
View all activity
Organizations
None yet
andreaKIM
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
commented
a paper
15 days ago
Agentic Reinforced Policy Optimization
Paper
โข
2507.19849
โข
Published
19 days ago
โข
138
โข
7
New activity in
google/gemma-7b-it
over 1 year ago
Instruct Training Dataset languages
๐
1
5
#12 opened over 1 year ago by
Deniaud
New activity in
upstage/SOLAR-10.7B-Instruct-v1.0
over 1 year ago
This model ranked 1st place in open llm leader board, However this model has lower performance in supervised fine tuning.
2
#8 opened over 1 year ago by
andreaKIM
New activity in
berkeley-nest/Starling-LM-7B-alpha
over 1 year ago
What could be instruction fine tuning prompt for this model?
5
#22 opened over 1 year ago by
andreaKIM
New activity in
mistralai/Mistral-7B-v0.1
almost 2 years ago
Why adaptor_model.bin becomes much larger than llama familes?
#34 opened almost 2 years ago by
andreaKIM
New activity in
hyunseoki/ko-en-llama2-13b
almost 2 years ago
Occured problem at long context
1
#3 opened almost 2 years ago by
Se-Hun
Load more