Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
77
16
71
Jimin Huang
PRO
jiminHuang
Follow
tonebeta's profile picture
seanita's profile picture
sawyerhu's profile picture
15 followers
·
21 following
jimin-huang-239abb261
AI & ML interests
Natural language processing and Computational finance
Recent Activity
reacted
to
clefourrier
's
post
with 👍
about 4 hours ago
Gemma3 family is out! Reading the tech report, and this section was really interesting to me from a methods/scientific fairness pov. Instead of doing over-hyped comparisons, they clearly state that **results are reported in a setup which is advantageous to their models**. (Which everybody does, but people usually don't say) For a tech report, it makes a lot of sense to report model performance when used optimally! On leaderboards on the other hand, comparison will be apples to apples, but in a potentially unoptimal way for a given model family (like some user interact sub-optimally with models) Also contains a cool section (6) on training data memorization rate too! Important to see if your model will output the training data it has seen as such: always an issue for privacy/copyright/... but also very much for evaluation! Because if your model knows its evals by heart, you're not testing for generalization.
liked
a Space
2 days ago
TheFinAI/open-finllm-reasoning-leaderboard
updated
a Space
3 days ago
XplainMind/README
View all activity
Organizations
Articles
2
Article
6
Plutus: Pioneering Greek Financial AI in a Global Context
Article
74
Introducing the Open FinLLM Leaderboard
View all Articles
Papers
10
arxiv:
2502.18772
arxiv:
2502.11433
arxiv:
2502.08127
arxiv:
2410.14059
Expand 10 papers
models
1
jiminHuang/llama31-8b-sft
Updated
Sep 4, 2024
datasets
2
Sort: Recently updated
jiminHuang/flare-sm-acl-long
Viewer
•
Updated
Aug 23, 2024
•
50
•
94
jiminHuang/flare-sm-acl-long-instruction
Viewer
•
Updated
Aug 23, 2024
•
36
•
73