King Han

kingh0730

AI & ML interests

Code LLMs

Recent Activity

liked a model 20 days ago
KwaiVGI/LivePortrait
liked a Space 20 days ago
KwaiVGI/LivePortrait
liked a Space 20 days ago
retwpay/waiNSFWIllustrious_v110
View all activity

Organizations

UC Berkeley's profile picture Live Code Bench's profile picture Skylow, Inc.'s profile picture

kingh0730's activity

upvoted an article 8 months ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By StringChaos and 6 others β€’
β€’ 15
reacted to clefourrier's post with ❀️πŸ”₯ about 1 year ago
view post
Post
4776
Contamination free code evaluations with LiveCodeBench! πŸ–₯️

LiveCodeBench is a new leaderboard, which contains:
- complete code evaluations (on code generation, self repair, code execution, tests)
- my favorite feature: problem selection by publication date πŸ“…

This feature means that you can get model scores averaged only on new problems out of the training data. This means... contamination free code evals! πŸš€

Check it out!

Blog: https://huggingface.co/blog/leaderboard-livecodebench
Leaderboard: livecodebench/leaderboard

Congrats to @StringChaos @minimario @xu3kev @kingh0730 and @FanjiaYan for the super cool leaderboard!
published an article about 1 year ago
view article
Article

Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs

By StringChaos and 6 others β€’
β€’ 15
liked a Space about 1 year ago