Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
1
29
ShaneSue
ShaneSue
Follow
0 followers
·
3 following
AI & ML interests
None yet
Recent Activity
new
activity
13 days ago
openbmb/MiniCPM-2B-128k:
dim_model_base参数的作用是什么?
new
activity
13 days ago
princeton-nlp/prolong-data-64K:
Data Text
commented
on
a paper
23 days ago
The Curse of Depth in Large Language Models
View all activity
Organizations
None yet
ShaneSue
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
openbmb/MiniCPM-2B-128k
13 days ago
dim_model_base参数的作用是什么?
#7 opened 13 days ago by
ShaneSue
New activity in
princeton-nlp/prolong-data-64K
13 days ago
Data Text
#1 opened 13 days ago by
ShaneSue
commented
a paper
23 days ago
The Curse of Depth in Large Language Models
Paper
•
2502.05795
•
Published
Feb 9
•
35
•
5
New activity in
NamCyan/thevault-docstringstyle
9 months ago
How does the data extract from the-stack?
#1 opened 9 months ago by
ShaneSue
New activity in
ytzi/the-stack-dedup-python-filtered-gpt2
10 months ago
Does the datasets filtered by model?
#2 opened 10 months ago by
ShaneSue
New activity in
bigcode/the-stack-v2-train-smol-ids
about 1 year ago
Downloading a small portion of dataset
2
#4 opened about 1 year ago by
thurac2022
New activity in
allenai/dolma
about 1 year ago
Where is the download urls?
#17 opened about 1 year ago by
ShaneSue
New activity in
bigscience/bloomz
over 1 year ago
Why does the token vocabs are unreadable code?
#53 opened over 1 year ago by
ShaneSue
New activity in
bigscience/bloom
almost 2 years ago
Bloom's tokenizer vocab is messy code
2
#216 opened almost 2 years ago by
ShaneSue
New activity in
bigscience/bloom-7b1
almost 2 years ago
I can't find the max_sequence_length that bloom support?????
1
#45 opened almost 2 years ago by
ShaneSue
I can't find the max_sequence_length that bloom support?????
1
#45 opened almost 2 years ago by
ShaneSue