Aniket Maurya's picture
2 9

Aniket Maurya

aniketmaurya

AI & ML interests

Computer vision, NLP, MLOps

Recent Activity

upvoted an article 16 days ago
Tensor Parallelism
updated a model 26 days ago
aniketmaurya/receipt-model-2025
liked a model 26 days ago
aniketmaurya/receipt-model-2025
View all activity

Organizations

Gradio-Themes-Party's profile picture ICML2023's profile picture ZeroGPU Explorers's profile picture

aniketmaurya's activity

upvoted an article 16 days ago
replied to singhsidhukuldeep's post 26 days ago
view reply

woohoo thanks for checking LitServe @singhsidhukuldeep ! LitServe now has OpenAI API-compatible endpoint and you can also serve a LLM using vLLM engine with LitServe so you get both speed + flexibility.

reacted to singhsidhukuldeep's post with πŸš€ 26 days ago
view post
Post
894
Just tried LitServe from the good folks at @LightningAI !

Between llama.cpp and vLLM, there is a small gap where a few large models are not deployable!

That's where LitServe comes in!

LitServe is a high-throughput serving engine for AI models built on FastAPI.

Yes, built on FastAPI. That's where the advantage and the issue lie.

It's extremely flexible and supports multi-modality and a variety of models out of the box.

But in my testing, it lags far behind in speed compared to vLLM.

Also, no OpenAI API-compatible endpoint is available as of now.

But as we move to multi-modal models and agents, this serves as a good starting point. However, it’s got to become faster...

GitHub: https://github.com/Lightning-AI/LitServe
  • 1 reply
Β·
upvoted an article 9 months ago
view article
Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

β€’ 240