Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Jude Herwitz's picture

Jude Herwitz

jerwitz

·

AI & ML interests

None yet

Organizations

jerwitz 's collections 4

Agent training frameworks

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Paper • 2504.20073 • Published Apr 24 • 12

Reasoning techniques (at inference)

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Paper • 2503.06692 • Published Mar 9 • 2
Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Paper • 2503.07018 • Published Mar 10

reasoning training via RLAIF

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 37

Retrieval-intelligence

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 53
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents

Paper • 2503.08026 • Published Mar 11

Agent training frameworks

RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning

Paper • 2504.20073 • Published Apr 24 • 12

reasoning training via RLAIF

Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

Paper • 2504.20157 • Published Apr 28 • 37

Reasoning techniques (at inference)

InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

Paper • 2503.06692 • Published Mar 9 • 2
Toward Multi-Session Personalized Conversation: A Large-Scale Dataset and Hierarchical Tree Framework for Implicit Reasoning

Paper • 2503.07018 • Published Mar 10

Retrieval-intelligence

ReasonIR: Training Retrievers for Reasoning Tasks

Paper • 2504.20595 • Published Apr 29 • 53
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents

Paper • 2503.08026 • Published Mar 11

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs