Jingyuan Liu's picture

5

Jingyuan Liu

toothacher17

·

AI & ML interests

NLP deep learning

Recent Activity

new activity about 2 months ago

moonshotai/Moonlight-16B-A3B-Instruct:why the c-eval result is 76.8 for base model but only 38.9 for instruct model?

updated a model 2 months ago

moonshotai/Moonlight-16B-A3B

updated a model 2 months ago

moonshotai/Moonlight-16B-A3B-Instruct

View all activity

Organizations

None yet

toothacher17's activity

New activity in moonshotai/Moonlight-16B-A3B-Instruct about 2 months ago

why the c-eval result is 76.8 for base model but only 38.9 for instruct model?

#8 opened about 2 months ago by

New activity in moonshotai/Moonlight-16B-A3B-Instruct 2 months ago

Update modeling_deepseek.py

#4 opened 2 months ago by

Space to test?

#1 opened 2 months ago by

convert to gguf : AttributeError: TikTokenTokenizer has no attribute vocab

#2 opened 2 months ago by

Doctor-Chad-PhD

Run this with chatllm.cpp

#5 opened 2 months ago by