AI & ML interests

retrieval augmented generation, grounded generation, large language models, LLMs, question answering, chatbot

Recent Activity

ofermendΒ  updated a Space about 4 hours ago
vectara/cfpb-assistant
ofermendΒ  updated a Space about 4 hours ago
vectara/HMC-demo
ofermendΒ  updated a Space about 5 hours ago
vectara/Justice-Harvard
View all activity

vectara's activity

ofermendΒ 
posted an update 7 months ago
ofermendΒ 
posted an update 10 months ago
view post
Post
1756
If you are a debate fan or did this as an extracurricular activity as a kid, you might have fun with this demo - debate bot. Debate against AI/RAG:

vectara/debate-bot
Β·
nthakurΒ 
posted an update 10 months ago
view post
Post
3371
🦒 The SWIM-IR dataset contains 29 million text-retrieval training pairs across 27 diverse languages. It is one of the largest synthetic multilingual datasets generated using PaLM 2 on Wikipedia! πŸ”₯πŸ”₯

SWIM-IR dataset contains three subsets :
- Cross-lingual:nthakur/swim-ir-cross-lingual
- Monolingual: nthakur/swim-ir-monolingual
- Indic Cross-lingual: nthakur/indic-swim-ir-cross-lingual

Check it out:
https://huggingface.co/collections/nthakur/swim-ir-dataset-662ddaecfc20896bf14dd9b7
clefourrierΒ 
posted an update 10 months ago
view post
Post
5960
In a basic chatbots, errors are annoyances. In medical LLMs, errors can have life-threatening consequences 🩸

It's therefore vital to benchmark/follow advances in medical LLMs before even thinking about deployment.

This is why a small research team introduced a medical LLM leaderboard, to get reproducible and comparable results between LLMs, and allow everyone to follow advances in the field.

openlifescienceai/open_medical_llm_leaderboard

Congrats to @aaditya and @pminervini !
Learn more in the blog: https://huggingface.co/blog/leaderboard-medicalllm
clefourrierΒ 
posted an update 10 months ago
view post
Post
4751
Contamination free code evaluations with LiveCodeBench! πŸ–₯️

LiveCodeBench is a new leaderboard, which contains:
- complete code evaluations (on code generation, self repair, code execution, tests)
- my favorite feature: problem selection by publication date πŸ“…

This feature means that you can get model scores averaged only on new problems out of the training data. This means... contamination free code evals! πŸš€

Check it out!

Blog: https://huggingface.co/blog/leaderboard-livecodebench
Leaderboard: livecodebench/leaderboard

Congrats to @StringChaos @minimario @xu3kev @kingh0730 and @FanjiaYan for the super cool leaderboard!