Jacky Jiang
t83714
·
AI & ML interests
None yet
Recent Activity
updated
a model
1 day ago
t83714/qwen2.5-32b-instruct-limo-lora-adapter
updated
a model
7 days ago
t83714/llama-3.1-8b-instruct-full-sft-limo
published
a model
7 days ago
t83714/llama-3.1-8b-instruct-full-sft-limo
Organizations
None yet
t83714's activity
Cannot set sequence length higher than 2048 & doesn't support the optimized triton implementation of FlashAttention
#2 opened almost 2 years ago
by
t83714
Would it work well with sequence length > 2048?
2
#1 opened almost 2 years ago
by
SamuelAzran