Tomer Ronen
tomer-nv
·
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
FFN Fusion: Rethinking Sequential Computation in Large Language Models
authored
a paper
5 months ago
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
Organizations
tomer-nv's activity
Patching hf bug that creates wrong cache length if only inputs_embeds are passed to the model
#18 opened 7 months ago
by
tomer-nv