Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
14
Moon Lockwood
moonlock
Follow
0 followers
·
6 following
AI & ML interests
art programming music psychology
Recent Activity
upvoted
a
collection
about 1 month ago
DeepSeek-R1
liked
a model
12 months ago
nisten/Biggie-SmoLlm-0.15B-Base
reacted
to
akhaliq
's
post
with 👍
over 1 year ago
Chain-of-Thought Reasoning Without Prompting paper page: https://huggingface.co/papers/2402.10200 In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT) prompting. These methods, while effective, often involve manually intensive prompt engineering. Our study takes a novel approach by asking: Can LLMs reason effectively without prompting? Our findings reveal that, intriguingly, CoT reasoning paths can be elicited from pre-trained LLMs by simply altering the decoding process. Rather than conventional greedy decoding, we investigate the top-k alternative tokens, uncovering that CoT paths are frequently inherent in these sequences. This approach not only bypasses the confounders of prompting but also allows us to assess the LLMs' intrinsic reasoning abilities. Moreover, we observe that the presence of a CoT in the decoding path correlates with a higher confidence in the model's decoded answer. This confidence metric effectively differentiates between CoT and non-CoT paths. Extensive empirical studies on various reasoning benchmarks show that the proposed CoT-decoding substantially outperforms the standard greedy decoding.
View all activity
Organizations
None yet
moonlock
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
12 months ago
nisten/Biggie-SmoLlm-0.15B-Base
Text Generation
•
0.2B
•
Updated
Aug 7, 2024
•
1.94k
•
236
liked
5 datasets
over 1 year ago
Locutusque/UltraTextbooks
Viewer
•
Updated
Feb 2, 2024
•
5.52M
•
1.23k
•
196
wikimedia/wikipedia
Viewer
•
Updated
Jan 9, 2024
•
61.6M
•
61.4k
•
876
meta-math/MetaMathQA
Viewer
•
Updated
Dec 21, 2023
•
395k
•
7.81k
•
394
CollectiveCognition/chats-data-2023-09-22
Viewer
•
Updated
Sep 23, 2023
•
156
•
96
•
18
teknium/OpenHermes-2.5
Viewer
•
Updated
Apr 15, 2024
•
1M
•
2.73k
•
740
liked
2 models
about 2 years ago
m-a-p/MERT-v1-95M
Audio Classification
•
Updated
May 25
•
1.99M
•
33
megemini/shanshui_style
Text-to-Image
•
Updated
May 5, 2023
•
2
liked
4 datasets
about 2 years ago
alkzar90/rock-glacier-dataset
Updated
Sep 11, 2024
•
37
•
2
nkirschi/oxford-flowers
Viewer
•
Updated
Dec 11, 2022
•
8.19k
•
628
•
17
TempoFunk/tempofunk-sdance
Updated
May 7, 2023
•
44.6k
•
5
vucinatim/spectrogram-captions
Viewer
•
Updated
Jan 3, 2023
•
1k
•
50
•
4
liked
2 models
about 2 years ago
m-a-p/MERT-v1-330M
Audio Classification
•
Updated
May 25
•
65.4k
•
67
google/pix2struct-screen2words-base
Visual Question Answering
•
Updated
May 19, 2023
•
396
•
24