PISCO models are compression models for RAG. They are intended as plug-in replacement for RAG systems with x5 faster inference