-
Chain-of-Thought Reasoning Without Prompting
Paper • 2402.10200 • Published • 105 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 114 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 61 -
Do language models plan ahead for future tokens?
Paper • 2404.00859 • Published • 2
Thomas Renkert
trenkert
Prettykittycat35's profile picture
johannhartmann's profile picture
avemio-digital's profile picture
·
AI & ML interests
None yet
Recent Activity
published
a model
about 23 hours ago
trenkert/p2
upvoted
an
article
1 day ago
FineWeb2-C: Help Build Better Language Models in Your Language
upvoted
a
collection
about 1 month ago
EXAONE-3.5