Jimin Huang PRO

jiminHuang

AI & ML interests

Natural language processing and Computational finance

Recent Activity

Organizations

ChanceFocus Asset Management (Shanghai) Company's profile picture hippocrates's profile picture Clinical NLP Lab's profile picture Yale BIDS Xu Lab's profile picture Web_Novel_Trans's profile picture The Fin AI's profile picture Medical Knowledge Explorer's profile picture FINOS's profile picture

jiminHuang's activity

reacted to clefourrier's post with πŸ‘ about 10 hours ago
view post
Post
585
Gemma3 family is out! Reading the tech report, and this section was really interesting to me from a methods/scientific fairness pov.

Instead of doing over-hyped comparisons, they clearly state that **results are reported in a setup which is advantageous to their models**.
(Which everybody does, but people usually don't say)

For a tech report, it makes a lot of sense to report model performance when used optimally!
On leaderboards on the other hand, comparison will be apples to apples, but in a potentially unoptimal way for a given model family (like some user interact sub-optimally with models)

Also contains a cool section (6) on training data memorization rate too! Important to see if your model will output the training data it has seen as such: always an issue for privacy/copyright/... but also very much for evaluation!

Because if your model knows its evals by heart, you're not testing for generalization.
updated a Space 4 days ago
published a Space 4 days ago
upvoted an article 13 days ago
view article
Article

Plutus: Pioneering Greek Financial AI in a Global Context

By TheFinAI and 9 others β€’
β€’ 6
published an article 13 days ago
view article
Article

Plutus: Pioneering Greek Financial AI in a Global Context

By TheFinAI and 9 others β€’
β€’ 6