Clelia Astra Bertelli

as-cle-bert

https://www.cleliasportfolio.xyz

AI & ML interests

Recent Activity

replied to their post about 23 hours ago

Ever dreamt of ingesting into a vector DB that pile of CSVs, Word documents and presentations laying in some remote folders on your PC?🗂️ What if I told you that you can do it within three to six lines of code?🤯 Well, with my latest open-source project, 𝐢𝐧𝐠𝐞𝐬𝐭-𝐚𝐧𝐲𝐭𝐡𝐢𝐧𝐠 (https://github.com/AstraBert/ingest-anything), you can take all your non-PDF files, convert them to PDF, extract their text, chunk, embed and load them into a vector database, all in one go!🚀 How? It's pretty simple! 📁 The input files are converted into PDF by PdfItDown (https://github.com/AstraBert/PdfItDown) 📑 The PDF text is extracted using LlamaIndex readers 🦛 The text is chunked exploiting Chonkie 🧮 The chunks are embedded thanks to Sentence Transformers models 🗄️ The embeddings are loaded into a Qdrant vector database And you're done!✅ Curious of trying it? Install it by running: 𝘱𝘪𝘱 𝘪𝘯𝘴𝘵𝘢𝘭𝘭 𝘪𝘯𝘨𝘦𝘴𝘵-𝘢𝘯𝘺𝘵𝘩𝘪𝘯𝘨 And you can start using it in your python scripts!🐍 Don't forget to star it on GitHub and let me know if you have any feedback! ➡️ https://github.com/AstraBert/ingest-anything

replied to their post 3 days ago

posted an update 4 days ago

View all activity

Organizations

as-cle-bert's activity

replied to their post about 23 hours ago

I am working on supporting compatibility with other embeddding models, and we will have that soon, for now I had to reduce the compatibility only to Sentence Transformers.
For what concerns page numbers, I am also working toward having better and more extensive metadata: everything is a big work-in-progress and will come in future releases!

replied to their post 3 days ago

So, there are two possibilities:

If you mean customizing the embedder among the ones available within Sentence Transformers, it is very possible, you just have to change the embedding_model parameter when calling the ingest method
If you mean that you have your own embedding model (like saved on your PC), that is a tad more difficult. I think Sentence Transformer might allow loading the model from your PC as long as it is compatible with the package. I think that this guide might be useful in that regard

For now the package only supports Sentence Transformers models, in the future it will probably extend its support to other embedding models as well :)

posted an update 4 days ago

Post

2778

Ever dreamt of ingesting into a vector DB that pile of CSVs, Word documents and presentations laying in some remote folders on your PC?🗂️
What if I told you that you can do it within three to six lines of code?🤯
Well, with my latest open-source project, 𝐢𝐧𝐠𝐞𝐬𝐭-𝐚𝐧𝐲𝐭𝐡𝐢𝐧𝐠 (https://github.com/AstraBert/ingest-anything), you can take all your non-PDF files, convert them to PDF, extract their text, chunk, embed and load them into a vector database, all in one go!🚀
How? It's pretty simple!
📁 The input files are converted into PDF by PdfItDown (https://github.com/AstraBert/PdfItDown)
📑 The PDF text is extracted using LlamaIndex readers
🦛 The text is chunked exploiting Chonkie
🧮 The chunks are embedded thanks to Sentence Transformers models
🗄️ The embeddings are loaded into a Qdrant vector database

And you're done!✅
Curious of trying it? Install it by running:

𝘱𝘪𝘱 𝘪𝘯𝘴𝘵𝘢𝘭𝘭 𝘪𝘯𝘨𝘦𝘴𝘵-𝘢𝘯𝘺𝘵𝘩𝘪𝘯𝘨

And you can start using it in your python scripts!🐍
Don't forget to star it on GitHub and let me know if you have any feedback! ➡️ https://github.com/AstraBert/ingest-anything

5 replies

replied to their post 6 days ago

Hey @T-2000 , you're absolutely right! I'm in the process of making the application online so for now the repo got a bit messy, tomorrow it will be clean and ready to be spinned up also locally: sorry for the incovenient!

posted an update 8 days ago

Post

2938

Finding a job that matches with our resume shouldn't be difficult, especially now that we have AI... And still, we're drowning in unclear announcements, jobs whose skill requirements might not really fit us, and tons of material😵‍💫
That's why I decided to build 𝐑𝐞𝐬𝐮𝐦𝐞 𝐌𝐚𝐭𝐜𝐡𝐞𝐫 (https://github.com/AstraBert/resume-matcher), a fully open-source application that scans your resume and searches the web for jobs that match with it!🎉
The workflow is very simple:
🦙 A LlamaExtract agent parses the resume and extracts valuable data that represent your profile
🗄️The structured data are passed on to a Job Matching Agent (built with LlamaIndex😉) that uses them to build a web search query based on your resume
🌐 The web search is handled by Linkup, which finds the top matches and returns them to the Agent
🔎 The agent evaluates the match between your profile and the jobs, and then returns a final answer to you

So, are you ready to find a job suitable for you?💼 You can spin up the application completely locally and with Docker, starting from the GitHub repo ➡️ https://github.com/AstraBert/resume-matcher
Feel free to leave your feedback and let me know in the comments if you want an online version of Resume Matcher as well!✨

2 replies

replied to their post 18 days ago

I used good old Canva (pro :)

posted an update 22 days ago

Post

2924

Llama-4 is out and I couldn't resist but to cook something with it... So I came up with 𝐋𝐥𝐚𝐦𝐚𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡𝐞𝐫 (https://llamaresearcher.com), your deep-research AI companion!🔎

The workflow behind 𝗟𝗹𝗮𝗺𝗮𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗲𝗿 is simple:
💬 You submit a query
🛡️ Your query is evaluated by Llama 3 guard model, which deems it safe or unsafe
🧠 If your query is safe, it is routed to the Researcher Agent
⚙️ The Researcher Agent expands the query into three sub-queries, with which to search the web
🌐 The web is searched for each of the sub-queries
📊 The retrieved information is evaluated for relevancy against your original query
✍️ The Researcher Agent produces an essay based on the information it gathered, paying attention to referencing its sources

The agent itself is also built with easy-to-use and intuitive blocks:
🦙 LlamaIndex provides the agentic architecture and the integrations with the language models
⚡Groq makes Llama-4 available with its lightning-fast inference
🔎 Linkup allows the agent to deep-search the web and provides sourced answers
💪 FastAPI does the heavy loading with wrapping everything within an elegant API interface
⏱️ Redis is used for API rate limiting
🎨 Gradio creates a simple but powerful user interface

Special mention also to Lovable, which helped me build the first draft of the landing page for LlamaResearcher!💖

If you're curious and you want to try LlamaResearcher, you can - completely for free and without subscription - for 30 days from now ➡️ https://llamaresearcher.com
And if you're like me, and you like getting your hands in code and build stuff on your own machine, I have good news: this is all open-source, fully reproducible locally and Docker-ready🐋
Just go to the GitHub repo: https://github.com/AstraBert/llama-4-researcher and don't forget to star it, if you find it useful!⭐

As always, have fun and feel free to leave your feedback✨

2 replies

posted an update 26 days ago

Post

736

I heard someone saying 𝘃𝗼𝗶𝗰𝗲 assistants are the future, and someone else that 𝗠𝗖𝗣 will rule the AI world... So I decided to combine both!🚀

Meet 𝐓𝐲𝐒𝐕𝐀 (𝗧𝘆pe𝗦cript 𝗩oice 𝗔ssistant, https://github.com/AstraBert/TySVA), your (speaking) AI companion for everyday TypeScript programming tasks!🎙️

TySVA is a skilled TypeScript expert and, to provide accurate and up-to-date responses, she leverages the following workflow:
🗣️ If you talk to her, she converts the audio into a textual prompt, and use it a starting point to answer your questions (if you send a message, she'll use directly that💬)
🧠 She can solve your questions by (deep)searching the web and/or by retrieving relevant information from a vector database containing TypeScript documentation. If the answer is simple, she can also reply directly (no tools needed!)
🛜 To ease her life, TySVA has all the tools she needs available through Model Context Protocol (MCP)
🔊 Once she's done, she returns her answer to you, along with a voice summary of what she did and what solution she found

But how does she do that? What are her components?🤨

📖 Qdrant + HuggingFace give her the documentation knowledge, providing the vector database and the embeddings
🌐 Linkup provides her with up-to-date, grounded answers, connecting her to the web
🦙 LlamaIndex makes up her brain, with the whole agentic architecture
🎤 ElevenLabs gives her ears and mouth, transcribing and producing voice inputs and outoputs
📜 Groq provides her with speech, being the LLM provider behind TySVA
🎨 Gradio+FastAPI make up her face and fibers, providing a seamless backend-to-frontend integration

If you're now curious of trying her, you can easily do that by spinning her up locally (and with Docker!🐋) from the GitHub repo ➡️ https://github.com/AstraBert/TySVA

And feel free to leave any feedback!✨

posted an update about 1 month ago

Post

629

Drowning in handouts, documents and presentations from your professors and not knowing where to start?🌊😵‍💫
Well, I might have a tool for you: 𝐏𝐝𝐟𝟐𝐍𝐨𝐭𝐞𝐬 (https://github.com/AstraBert/pdf2notes) is an 𝗔𝗜-𝗽𝗼𝘄𝗲𝗿𝗲𝗱, 𝗼𝗽𝗲𝗻-𝘀𝗼𝘂𝗿𝗰𝗲 solution that lets you turn your unstructured and chaotic PDFs into nice and well-ordered notes in a matter of seconds!📝

𝗛𝗼𝘄 𝗱𝗼𝗲𝘀 𝗶𝘁 𝘄𝗼𝗿𝗸?
📄 You first upload a document
⚙️ LlamaParse by LlamaIndex extracts the text from the document, using DeepMind's Gemini 2 Flash to perform multi-modal parsing
🧠 Llama-3.3-70B by Groq turns the extracted text into notes!

The notes are not perfect or you want more in-depth insights? No problem:
💬 Send a direct message to the chatbot
⚙️ The chatbot will retrieve the chat history from a Postgres database
🧠 Llama-3.3-70B will produce the answer you need

All of this is nicely wrapped within a seamless backend-to-frontend framework powered by Gradio and FastAPI🎨

And you can even spin it up easily and locally, using Docker🐋

So, what are you waiting for? Go turn your hundreds of pages of chaotic learning material into neat and elegant notes ➡️ https://github.com/AstraBert/pdf2notes

And, if you would like an online demo, feel free to drop a comment - we'll see what we can build🚀

posted an update about 2 months ago

Post

1683

𝐑𝐀𝐆𝐜𝐨𝐨𝐧🦝 - 𝐀𝐠𝐞𝐧𝐭𝐢𝐜 𝐑𝐀𝐆 𝐭𝐨 𝐡𝐞𝐥𝐩 𝐲𝐨𝐮 𝐛𝐮𝐢𝐥𝐝 𝐲𝐨𝐮𝐫 𝐬𝐭𝐚𝐫𝐭𝐮𝐩

GitHub 👉 https://github.com/AstraBert/ragcoon

Are you building a startup and you're stuck in the process, trying to navigate hundreds of resources, suggestions and LinkedIn posts?😶‍🌫️
Well, fear no more, because 𝗥𝗔𝗚𝗰𝗼𝗼𝗻🦝 is here to do some of the job for you:

📃 It's built on free resources written by successful founders
⚙️ It performs complex retrieval operations, exploiting "vanilla" hybrid search, query expansion with an 𝗵𝘆𝗽𝗼𝘁𝗵𝗲𝘁𝗶𝗰𝗮𝗹 𝗱𝗼𝗰𝘂𝗺𝗲𝗻𝘁 approach and 𝗺𝘂𝗹𝘁𝗶-𝘀𝘁𝗲𝗽 𝗾𝘂𝗲𝗿𝘆 𝗱𝗲𝗰𝗼𝗺𝗽𝗼𝘀𝗶𝘁𝗶𝗼𝗻
📊 It evaluates the 𝗿𝗲𝗹𝗶𝗮𝗯𝗶𝗹𝗶𝘁𝘆 of the retrieved context, and the 𝗿𝗲𝗹𝗲𝘃𝗮𝗻𝗰𝘆 and 𝗳𝗮𝗶𝘁𝗵𝗳𝘂𝗹𝗻𝗲𝘀𝘀 of its own responses, in an auto-correction effort

RAGcoon🦝 is 𝗼𝗽𝗲𝗻-𝘀𝗼𝘂𝗿𝗰𝗲 and relies on easy-to-use components:

🔹LlamaIndex is at the core of the agent architecture, provisions the integrations with language models and vector database services, and performs evaluations
🔹 Qdrant is your go-to, versatile and scalable companion for vector database services
🔹Groq provides lightning-fast LLM inference to support the agent, giving it the full power of 𝗤𝘄𝗤-𝟯𝟮𝗕 by Qwen
🔹Hugging Face provides the embedding models used for dense and sparse retrieval
🔹FastAPI wraps the whole backend into an API interface
🔹𝗠𝗲𝘀𝗼𝗽 by Google is used to serve the application frontend

RAGcoon🦝 can be spinned up locally - it's 𝗗𝗼𝗰𝗸𝗲𝗿-𝗿𝗲𝗮𝗱𝘆🐋, and you can find the whole code to reproduce it on GitHub 👉 https://github.com/AstraBert/ragcoon

But there might be room for an online version of RAGcoon🦝: let me know if you would use it - we can connect and build it together!🚀

posted an update about 2 months ago

Post

2740

I just released a fully automated evaluation framework for your RAG applications!📈

GitHub 👉 https://github.com/AstraBert/diRAGnosis
PyPi 👉 https://pypi.org/project/diragnosis/

It's called 𝐝𝐢𝐑𝐀𝐆𝐧𝐨𝐬𝐢𝐬 and is a lightweight framework that helps you 𝗱𝗶𝗮𝗴𝗻𝗼𝘀𝗲 𝘁𝗵𝗲 𝗽𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗼𝗳 𝗟𝗟𝗠𝘀 𝗮𝗻𝗱 𝗿𝗲𝘁𝗿𝗶𝗲𝘃𝗮𝗹 𝗺𝗼𝗱𝗲𝗹𝘀 𝗶𝗻 𝗥𝗔𝗚 𝗮𝗽𝗽𝗹𝗶𝗰𝗮𝘁𝗶𝗼𝗻𝘀.

You can launch it as an application locally (it's Docker-ready!🐋) or, if you want more flexibility, you can integrate it in your code as a python package📦

The workflow is simple:
🧠 You choose your favorite LLM provider and model (supported, for now, are Mistral AI, Groq, Anthropic, OpenAI and Cohere)
🧠 You pick the embedding models provider and the embedding model you prefer (supported, for now, are Mistral AI, Hugging Face, Cohere and OpenAI)
📄 You prepare and provide your documents
⚙️ Documents are ingested into a Qdrant vector database and transformed into a synthetic question dataset with the help of LlamaIndex
📊 The LLM is evaluated for the faithfulness and relevancy of its retrieval-augmented answer to the questions
📊 The embedding model is evaluated for hit rate and mean reciprocal ranking (MRR) of the retrieved documents

And the cool thing is that all of this is 𝗶𝗻𝘁𝘂𝗶𝘁𝗶𝘃𝗲 𝗮𝗻𝗱 𝗰𝗼𝗺𝗽𝗹𝗲𝘁𝗲𝗹𝘆 𝗮𝘂𝘁𝗼𝗺𝗮𝘁𝗲𝗱: you plug it in, and it works!🔌⚡

Even cooler? This is all built on top of LlamaIndex and its integrations: no need for tons of dependencies or fancy workarounds🦙
And if you're a UI lover, Gradio and FastAPI are there to provide you a seamless backend-to-frontend experience🕶️

So now it's your turn: you can either get diRAGnosis from GitHub 👉 https://github.com/AstraBert/diRAGnosis
or just run a quick and painless:

uv pip install diragnosis

To get the package installed (lightning-fast) in your environment🏃‍♀️

Have fun and feel free to leave feedback and feature/integrations requests on GitHub issues✨

posted an update 2 months ago

Post

2403

I built an AI agent app in less than 8 hours🤯
And, believe me, this is 𝗻𝗼𝘁 clickbait❌

GitHub 👉 https://github.com/AstraBert/PapersChat
Demo 👉 as-cle-bert/PapersChat

The app is called 𝐏𝐚𝐩𝐞𝐫𝐬𝐂𝐡𝐚𝐭, and it is aimed at 𝗺𝗮𝗸𝗶𝗻𝗴 𝗰𝗵𝗮𝘁𝘁𝗶𝗻𝗴 𝘄𝗶𝘁𝗵 𝘀𝗰𝗶𝗲𝗻𝘁𝗶𝗳𝗶𝗰 𝗽𝗮𝗽𝗲𝗿𝘀 𝗲𝗮𝘀𝗶𝗲𝗿.

𝐇𝐞𝐫𝐞 𝐢𝐬 𝐰𝐡𝐚𝐭 𝐭𝐡𝐞 𝐚𝐩𝐩 𝐝𝐨𝐞𝐬:

📄 Parses the papers that you upload thanks to LlamaIndex🦙 (either with LlamaParse or with simpler, local methods)
📄 Embeds documents both with a sparse and with a dense encoder to enable hybrid search
📄 Uploads the embeddings to Qdrant
⚙️ Activates an Agent based on mistralai/Mistral-Small-24B-Instruct-2501 that will reply to your prompt
🧠 Retrieves information relevant to your question from the documents
🧠 If no relevant information is found, it searches PubMed and arXiv databases
🧠 Returns a grounded answer to your prompt

𝐇𝐨𝐰 𝐝𝐢𝐝 𝐈 𝐦𝐚𝐧𝐚𝐠𝐞 𝐭𝐨 𝐦𝐚𝐤𝐞 𝐭𝐡𝐢𝐬 𝐚𝐩𝐩𝐥𝐢𝐜𝐚𝐭𝐢𝐨𝐧 𝐢𝐧 𝟖 𝐡𝐨𝐮𝐫𝐬?

Three key points:

- LlamaIndex🦙 provides countless integrations with LLM providers, text embedding models and vectorstore services, and takes care of the internal architecture of the Agent. You just plug it in, and it works!🔌⚡
- Qdrant is a vector database service extremely easy to set up and use: you just need a one-line Docker command😉
- Gradio makes frontend development painless and fast, while still providing modern and responsive interfaces🏗️

And a bonus point:

- Deploying the demo app couldn't be easier if you use Gradio-based Hugging Face Spaces🤗

So, no more excuses: build your own AI agent today and do it fast, (almost) for free and effortlessly🚀

And if you need a starting point, the code for PapersChat is open and fully reproducible on GitHub 👉 https://github.com/AstraBert/PapersChat

posted an update 2 months ago

Post

1403

𝐒𝐜𝐢𝐍𝐞𝐰𝐬𝐁𝐨𝐭 - 𝐑𝐞𝐩𝐨𝐫𝐭 𝐝𝐚𝐢𝐥𝐲 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 𝐧𝐞𝐰𝐬 𝐨𝐧 𝐁𝐥𝐮𝐞𝐒𝐤𝐲

GitHub 👉 https://github.com/AstraBert/SciNewsBot
BlueSky 👉 https://bsky.app/profile/sci-news-bot.bsky.social

Hi there HF Community!🤗
I just created a very simple AI-powered bot that shares fact-checked news about Science, Environment, Energy and Technology on BlueSky :)

The bot takes news from Google News, filters out the sources that are not represented in the Media Bias Fact Check database, and then evaluates the reliability of the source based on the MBFC metrics. After that, it creates a catchy headline for the article and publishes the post on BlueSky📰

The cool thing? SciNewsBot is open-source and is cheap to maintain, as it is based on mistralai/Mistral-Small-24B-Instruct-2501 (via Mistral API). You can reproduce it locally, spinning it up on your machine, and even launch it on cloud through a comfy Docker setup🐋

Have fun and spread Science!✨

posted an update 3 months ago

Post

2766

𝐏𝐡𝐢𝐐𝐰𝐞𝐧𝐒𝐓𝐄𝐌 - 𝐚 𝐫𝐞𝐚𝐬𝐨𝐧𝐢𝐧𝐠 𝐚𝐬𝐬𝐢𝐬𝐭𝐚𝐧𝐭 𝐟𝐨𝐫 𝐲𝐨𝐮𝐫 𝐒𝐓𝐄𝐌 𝐞𝐝𝐮𝐜𝐚𝐭𝐢𝐨𝐧

Demo 👉 https://pqstem.org
GitHub 👉 https://github.com/AstraBert/PhiQwenSTEM

Hello HF community!🤗
Ever struggled with some complex Maths problem or with a very hard Physics question? Well, fear no more, because now you can rely on PhiQwenSTEM, an assistant specialized in answering STEM-related question!
The assistant can count on a knowledge base of 𝟭𝟱𝗸+ 𝘀𝗲𝗹𝗲𝗰𝘁𝗲𝗱 𝗦𝗧𝗘𝗠 𝗾𝘂𝗲𝘀𝘁𝗶𝗼𝗻-𝗮𝗻𝘀𝘄𝗲𝗿 𝗽𝗮𝗶𝗿𝘀 spanning the domains of Chemistry, Physics, Matemathics and Biochemistry (from EricLu/SCP-116K). It also relies on the combined power of microsoft/Phi-3.5-mini-instruct and Qwen/QwQ-32B-Preview to produce reliable and reasoned answers.
For the next 30 days, you will be able to try for free the web demo: https://pqstem.org
In the GitHub repo you can find all the information to reproduce PhiQwenSTEM 𝗼𝗻 𝘆𝗼𝘂𝗿 𝗹𝗼𝗰𝗮𝗹 𝗺𝗮𝗰𝗵𝗶𝗻𝗲, 𝗯𝗼𝘁𝗵 𝘃𝗶𝗮 𝘀𝗼𝘂𝗿𝗰𝗲 𝗰𝗼𝗱𝗲 𝗮𝗻𝗱 𝘄𝗶𝘁𝗵 𝗮 𝗰𝗼𝗺𝗳𝘆 𝗗𝗼𝗰𝗸𝗲𝗿🐋 𝘀𝗲𝘁𝘂𝗽: https://github.com/AstraBert/PhiQwenSTEM

posted an update 3 months ago

Post

1040

Hi HuggingFace community!🤗

I just published an article in which I try to articulate some counter-points to Dario Amodei's post "On DeepSeek and Export Control"👉 https://huggingface.co/blog/as-cle-bert/why-we-dont-need-export-control

I try to address several key passages of the third section from Amodei's post (https://darioamodei.com/on-deepseek-and-export-controls), bringing my perspective on the importance of open source, open knowledge and multipolarity in a crucial field for our future such as Artificial Intelligence.

Happy reading!✨

replied to their post 3 months ago

Hi!

I generally use LangChain + PyPDF, I leave here a code snippet:

from langchain_community.document_loaders import PyPDFLoader

def preprocess(pdf: str) -> list:
    """
    Uses LangChain's PyPDFLoader to extract text.
    """
    loader = PyPDFLoader(pdf)
    documents = loader.load()
    for doc in documents:
        print(doc.page_content)

This should give a more solid result :)

PS: Langchain is distributed under an MIT license, see their GitHub (https://github.com/langchain-ai/langchain)

posted an update 3 months ago

Post

1613

🚀𝐍𝐞𝐰 𝐝𝐞𝐦𝐨 𝐚𝐥𝐞𝐫𝐭🚀

Convert (almost) everything to PDF with 𝐏𝐝𝐟𝐈𝐭𝐃𝐨𝐰𝐧, now on Spaces! 👉 as-cle-bert/pdfitdown

You can also install it locally:

python3 -m pip install pdfitdown

Don't forget to star it on GitHub, if you find it useful! 👉 https://www.github.com/AstraBert/PdfItDown

3 replies

posted an update 3 months ago

Post

569

Hi HuggingFace Community🤗, I am thrilled to announce:

𝐪𝐝𝐮𝐫𝐥𝐥𝐦 𝚟𝟷-𝚛𝚌.𝟷 (https://github.com/AstraBert/qdurllm/tree/january-2025)

Qdurllm (𝗤𝗱rant, 𝗨𝗥Ls, 𝗟arge 𝗟anguage 𝗠odels) is a local Gradio (Gradio) application that lets you upload you web content to a local Qdrant (Qdrant) database and search through it or chat with it.

The 𝗻𝗲𝘄 𝗽𝗿𝗲-𝗿𝗲𝗹𝗲𝗮𝘀𝗲 (https://github.com/AstraBert/qdurllm/releases/tag/v1.0.0-rc.0) implements 𝘀𝗽𝗮𝗿𝘀𝗲 𝘀𝗲𝗮𝗿𝗰𝗵 (with prithivida/Splade_PP_en_v1) + 𝗿𝗲𝗿𝗮𝗻𝗸𝗶𝗻𝗴 (with nomic-ai/modernbert-embed-base by Hugging Face + Nomic AI) and 𝘀𝗲𝗺𝗮𝗻𝘁𝗶𝗰 𝗰𝗮𝗰𝗵𝗶𝗻𝗴 (based on Qdrant) and switched 𝗳𝗿𝗼𝗺 google/gemma-2-2b-it 𝘁𝗼 Qwen/Qwen2.5-1.5B-Instruct to conform to the SOTA landscape and to finally make the application based 𝗼𝗻𝗹𝘆 𝗼𝗻 𝘁𝗿𝘂𝗹𝘆 𝗼𝗽𝗲𝗻 𝗺𝗼𝗱𝗲𝗹𝘀.

The pre-release is 𝗮𝘃𝗮𝗶𝗹𝗮𝗯𝗹𝗲 𝗳𝗼𝗿 𝘁𝗲𝘀𝘁𝗶𝗻𝗴 and I would be really really happy if you wanted to give it a try and leave your feedback on the discussion thread on GitHub (https://github.com/AstraBert/qdurllm/discussions/8) or here on Hugging Face forum via comments under this post✨.
Find all the information to install and launch it here 👉 https://astrabert.github.io/qdurllm/#2-installation

replied to their post 4 months ago

Thank you so much for letting me know! This is indeed a very interesting role :)

posted an update 4 months ago

Post

1384

Hi HuggingFace community!🤗

I recently released PrAIvateSearch v2.0-beta.0 (https://github.com/AstraBert/PrAIvateSearch), my privacy-first, AI-powered, user-centered and data-safe application aimed at providing a local and open-source alternative to big AI search engines such as SearchGPT or Perplexity AI.

We have several key changes:

- New chat UI built with NextJS
- DuckDuckGo API used for web search instead of Google
- Qwen/Qwen2.5-1.5B-Instruct as a language model served on API (by FastAPI)
- Crawl4AI crawler used for web scraping
- Optimizations in the data workflow inside the application

Read more in my blog post 👉 https://huggingface.co/blog/as-cle-bert/search-the-web-with-ai

Have fun and feel free to leave feedback about how to improve the application!✨

3 replies