Jacky Jiang
t83714
·
AI & ML interests
None yet
Recent Activity
liked
a model
26 days ago
Cohere/Cohere-embed-english-v3.0
updated
a model
about 1 month ago
t83714/llama-3.1-8b-instruct-full-sft-limo
new activity
about 1 month ago
t83714/qwen2.5-32b-instruct-limo-lora-adapter:Upload limo-lora-r4-atten-layers-only-qwen-32b-instruct-t0.0_k1_s0_e500.jsonl
Organizations
None yet
t83714's activity
Upload limo-lora-r4-atten-layers-only-qwen-32b-instruct-t0.0_k1_s0_e500.jsonl
#1 opened about 1 month ago
by
t83714
Would it work well with sequence length > 2048?
2
#1 opened almost 2 years ago
by
SamuelAzran