Shuyue Jia (Bruce)
shuyuej
AI & ML interests
A Ph.D. Student at @vkola-lab, Boston University. Passionate about Large Language Models (LLMs), Multimodal Foundation Models, Generative AI, and Medical AI.
Recent Activity
new activity
29 days ago
shuyuej/MedLLaMA3-70B-base-INT2-GPTQ:not run
liked
a model
about 1 month ago
ncbi/MedCPT-Query-Encoder
updated
a dataset
about 2 months ago
shuyuej/test_MRI_imaging
Organizations
shuyuej's activity
What Happens If the Prompt Exceeds 8,196 Tokens? And difference between input limit and context length limit?
1
2
#36 opened 4 months ago
by
averyyu99
quant versions?
5
1
#12 opened 4 months ago
by
apol

RecursionError: maximum recursion depth exceeded
1
#1 opened almost 2 years ago
by
WajihUllahBaig
missing model.safetensors.index.json
3
#1 opened 9 months ago
by
kresimirfijacko
Can you create gptq 8 bits quants?
1
#1 opened 9 months ago
by
rjmehta
Update quantize_config.json
1
#12 opened 9 months ago
by
shuyuej

Update config.json
1
#11 opened 9 months ago
by
shuyuej

Source codes to quantize the LLaMA 3.1 405B model
3
#10 opened 9 months ago
by
shuyuej

Request for Mistral Large Instruct GPTQ INT4
4
#2 opened 9 months ago
by
sparsh35
Missing config.json
2
5
#6 opened 9 months ago
by
wxl2001
Learning Rate during pretraining
1
#58 opened 9 months ago
by
shuyuej

Model max_seq_length
7
#6 opened 9 months ago
by
shuyuej

Model max_seq_length
1
#4 opened 9 months ago
by
shuyuej

Where can we find `eval_medical_llm.py` and `main.py`
1
#15 opened 11 months ago
by
shuyuej

Fine-Tune a gemma model for question answering
1
17
#62 opened about 1 year ago
by
Iamexperimenting
Weird Performance Issue with Gemma-7b compared to Gemma-2b with Qlora
6
#91 opened 12 months ago
by
UserDAN
What is the actual context size of mistralai/Mixtral-8x7B-Instruct-v0.1 model
4
#186 opened about 1 year ago
by
Pradeep1995

Very different results with float16. [Actually, gemma-7b-it does not work with float16]
3
6
#33 opened about 1 year ago
by
EarthWorm001