Jonathan Berant's picture

5

Jonathan Berant

joberant

·

https://www.cs.tau.ac.il/~joberant/

AI & ML interests

NLP

Organizations

authored a paper over 1 year ago

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Paper • 2407.06071 • Published Jul 8, 2024 • 7

authored a paper almost 2 years ago

Transforming and Combining Rewards for Aligning Large Language Models

Paper • 2402.00742 • Published Feb 1, 2024 • 12

authored a paper about 2 years ago

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Paper • 2312.09244 • Published Dec 14, 2023 • 9

authored a paper over 2 years ago

Long-range Language Modeling with Self-retrieval

Paper • 2306.13421 • Published Jun 23, 2023 • 16