Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Juyoung Suk's picture
7 15 28

Juyoung Suk

juyoungml
doubleyyh's profile picture 21world's profile picture kyudolski's profile picture
·
https://scottsuk0306.github.io/
  • scott_sjy
  • scottsuk0306

AI & ML interests

LLM

Organizations

Pseudo Lab's profile picture Theta One's profile picture KAIST AI's profile picture Human_Eval_RLHF's profile picture prometheus-eval's profile picture multilingual-reward-bench's profile picture interview-eval's profile picture

authored 3 papers 2 months ago

MM-Eval: A Multilingual Meta-Evaluation Benchmark for LLM-as-a-Judge and Reward Models

Paper • 2410.17578 • Published Oct 23, 2024 • 1

LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation

Paper • 2412.10424 • Published Dec 10, 2024 • 2

Trillion 7B Technical Report

Paper • 2504.15431 • Published Apr 21 • 37
authored a paper 7 months ago

Evaluating Language Models as Synthetic Data Generators

Paper • 2412.03679 • Published Dec 4, 2024 • 49
authored 3 papers about 1 year ago

The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models

Paper • 2406.05761 • Published Jun 9, 2024 • 3

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 123

CLIcK: A Benchmark Dataset of Cultural and Linguistic Intelligence in Korean

Paper • 2403.06412 • Published Mar 11, 2024 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs