AI & ML interests
Knowledge Distillation, Pruning, Quantization, KV Cache Compression, Latency, Inference Speed
Recent Activity
View all activity
DistAya
's models
None public yet
Knowledge Distillation, Pruning, Quantization, KV Cache Compression, Latency, Inference Speed