AI & ML interests
Knowledge Distillation, Pruning, Quantization, KV Cache Compression, Latency, Inference Speed
Recent Activity
View all activity
DistAya
's datasets
None public yet
Knowledge Distillation, Pruning, Quantization, KV Cache Compression, Latency, Inference Speed