Kanishk Gandhi's picture

4 8 4

Kanishk Gandhi

obiwan96

·

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

obiwan96/qwen-cd-100

published a model about 1 month ago

obiwan96/qwen-cd-100

updated a dataset about 1 month ago

obiwan96/countdown-env

View all activity

Organizations

None yet

authored a paper 4 months ago

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

Paper • 2503.01307 • Published Mar 3 • 39

authored 2 papers 6 months ago

Understanding Social Reasoning in Language Models with Language Models

Paper • 2306.15448 • Published Jun 21, 2023 • 1

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 97

authored a paper 7 months ago

Surveying the Effects of Quality, Diversity, and Complexity in Synthetic Data From Large Language Models

Paper • 2412.02980 • Published Dec 4, 2024 • 15

authored 4 papers 10 months ago

Eliciting Compatible Demonstrations for Multi-Human Imitation Learning

Paper • 2210.08073 • Published Oct 14, 2022

Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels

Paper • 2404.14313 • Published Apr 22, 2024

Human-like Affective Cognition in Foundation Models

Paper • 2409.11733 • Published Sep 18, 2024 • 6

Stream of Search (SoS): Learning to Search in Language

Paper • 2404.03683 • Published Apr 1, 2024 • 32

authored a paper about 2 years ago

Certified Reasoning with Language Models

Paper • 2306.04031 • Published Jun 6, 2023 • 2