1 7 2

Feifan Song

songff

AI & ML interests

None yet

Recent Activity

upvoted a paper 24 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

liked a model 3 months ago

KbsdJames/Omni-Judge

liked a dataset 3 months ago

KbsdJames/Omni-MATH

View all activity

Organizations

None yet

songff's activity

upvoted a paper 24 days ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 28 days ago • 48

liked a model 3 months ago

KbsdJames/Omni-Judge

Text Generation • Updated Oct 15, 2024 • 1.16k • 10

liked a dataset 3 months ago

KbsdJames/Omni-MATH

Viewer • Updated Oct 12, 2024 • 4.43k • 2.23k • 68

upvoted 2 papers 3 months ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 75

upvoted 2 papers 4 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 28

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12, 2024 • 47

authored a paper 4 months ago

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Paper • 2410.07985 • Published Oct 10, 2024 • 28

upvoted a paper 5 months ago

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Paper • 2403.11124 • Published Mar 17, 2024 • 1

authored a paper 5 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

upvoted a paper 5 months ago

Towards a Unified View of Preference Learning for Large Language Models: A Survey

Paper • 2409.02795 • Published Sep 4, 2024 • 72

authored 5 papers 5 months ago

ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization

Paper • 2402.09320 • Published Feb 14, 2024 • 6

Making Large Language Models Better Reasoners with Alignment

Paper • 2309.02144 • Published Sep 5, 2023 • 2

Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy

Paper • 2204.07433 • Published Apr 7, 2022

API-Bank: A Comprehensive Benchmark for Tool-Augmented LLMs

Paper • 2304.08244 • Published Apr 14, 2023

Scaling Data Diversity for Fine-Tuning Language Models in Human Alignment

Paper • 2403.11124 • Published Mar 17, 2024 • 1

authored a paper over 1 year ago

Preference Ranking Optimization for Human Alignment

Paper • 2306.17492 • Published Jun 30, 2023 • 6

New activity in OpenAssistant/reward-model-deberta-v3-large-v2 almost 2 years ago

Question about evaluating this reward model on Anthropic/hh-rlhf

#4 opened almost 2 years ago by

songff