Tingchen Fu
TingchenFu
AI & ML interests
None yet
Recent Activity
liked
a model
4 days ago
Qwen/QwQ-32B
upvoted
a
paper
about 2 months ago
Autonomy-of-Experts Models
upvoted
a
paper
about 2 months ago
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative
Textual Feedback
Organizations
None yet
TingchenFu's activity
tokenizer configuration
1
#9 opened 8 months ago
by
TingchenFu
[READ IF YOU DO NOT HAVE ACCESS] Getting access to the model
39
#172 opened 10 months ago
by
osanseviero

The performnan on humaneval.
#5 opened about 1 year ago
by
TingchenFu
how to accelerate the inference speed
2
#22 opened over 1 year ago
by
tobywang
How to split tensors to x shards?
2
#1 opened almost 2 years ago
by
Ede-CH
Can I load these weights into a model using 8 gpus?
2
#2 opened over 2 years ago
by
bournezz
There might be some thing wrong with the 500,000 and 600,000 step checkpoint
1
#2 opened almost 2 years ago
by
TingchenFu
This works, but training does not work at all
6
#4 opened almost 2 years ago
by
zokica
In config.json we only have n_embed=1024 while BLOOM was trained with a sequence length of 2048.
1
#9 opened almost 2 years ago
by
TingchenFu
Checkpoint seems not be loaded
2
#8 opened over 2 years ago
by
Yulong-W