AI & ML interests

Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.

Recent Activity

PrunaAI's activity

sharpenbΒ 
posted an update 4 days ago
view post
Post
2992
We open-sourced the pruna package that can be easily installed with pip install pruna :) It allows to easily ccompress and evaluate AI models including transformers and diffusers.

- Github repo: https://github.com/PrunaAI/pruna
- Documentation: https://docs.pruna.ai/en/stable/index.html

With open-sourcing, people can now inspect and contribute to the open code. Beyond the code, we provide detailed readme, tutorials, benchmarks, and documentation to make transparent compression, evaluation, and saving/loading/serving of AI models.

Happy to share it with you and always interested in collecting your feedback :)
  • 1 reply
Β·
sharpenbΒ 
posted an update about 1 month ago
view post
Post
525
How to deploy compressed ML models in your pipeline?

We wrote a series of blogs on this topics. Hope that it can be helpful to people:
- Standard Model Compression in ML Pipeline: https://www.pruna.ai/blog/standard-model-compression-ml-pipeline
- Boost Your Replicate Models with Pruna AI: A Step-by-Step Guide: https://www.pruna.ai/blog/guide-replicate-pruna-ai
- Pruna + Triton: A Winning Combination for High-Performance AI Deployments: https://www.pruna.ai/blog/pruna-triton-combination

Feel free to join our discord (https://discord.com/invite/rskEr4BZJx) if you have questions ;)
sharpenbΒ 
posted an update 2 months ago