Jingyuan Liu
toothacher17
AI & ML interests
NLP deep learning
Recent Activity
new activity
about 2 months ago
moonshotai/Moonlight-16B-A3B-Instruct:why the c-eval result is 76.8 for base model but only 38.9 for instruct model?
updated
a model
2 months ago
moonshotai/Moonlight-16B-A3B
updated
a model
2 months ago
moonshotai/Moonlight-16B-A3B-Instruct
Organizations
None yet
toothacher17's activity
why the c-eval result is 76.8 for base model but only 38.9 for instruct model?
1
#8 opened about 2 months ago
by
xianf
Update modeling_deepseek.py
2
#4 opened 2 months ago
by
Daemontatox

Space to test?
7
#1 opened 2 months ago
by
celsowm

convert to gguf : AttributeError: TikTokenTokenizer has no attribute vocab
3
#2 opened 2 months ago
by
Doctor-Chad-PhD

Run this with chatllm.cpp
3
#5 opened 2 months ago
by
J22