🤝 Open to Collab

Marwa El Kamil

maghwa

26 21 124

AI & ML interests

None yet

Recent Activity

liked a model 16 days ago

deepseek-ai/DeepSeek-V4-Flash

liked a dataset 23 days ago

ai-conferences/CVPR2026

liked a dataset 23 days ago

Anthropic/AnthropicInterviewer

View all activity

Organizations

upvoted a paper 24 days ago

AgentSocialBench: Evaluating Privacy Risks in Human-Centered Agentic Social Networks

Paper • 2604.01487 • Published Apr 1 • 11

upvoted 3 collections over 1 year ago

upvoted an article over 1 year ago

Article

Finding Moroccan Arabic (Darija) in Fineweb 2

omarkamali

•

Dec 8, 2024

• 23

upvoted an article almost 2 years ago

Article

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

aaditya, pminervini, clefourrier

•

Apr 19, 2024

• 202

upvoted a collection almost 2 years ago

Arabic Aya DPO Datasets

Collection

Our synthetic DPO datasets for Arabic Aya. • 5 items • Updated Jun 4, 2024 • 4

upvoted a paper about 2 years ago

101 Billion Arabic Words Dataset

Paper • 2405.01590 • Published Apr 29, 2024 • 6

upvoted an article about 2 years ago

Article

Tokenization Is A Dead Weight (Tokun Part 1)

apehex

•

Jun 27, 2024

• 18

upvoted 2 papers about 2 years ago

Tokenization Falling Short: The Curse of Tokenization

Paper • 2406.11687 • Published Jun 17, 2024 • 16

CroissantLLM: A Truly Bilingual French-English Language Model

Paper • 2402.00786 • Published Feb 1, 2024 • 26

upvoted an article about 2 years ago

Article

🥐CroissantLLM: A Truly Bilingual French-English Language Model

manu

•

Feb 5, 2024

• 15

upvoted a collection about 2 years ago

FrenchBench Evaluation datasets

Collection

These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper) • 11 items • Updated Jun 7, 2024 • 8

upvoted an article about 2 years ago

Article

Introducing the Open Arabic LLM Leaderboard

alielfilali01, Hamza-Alobeidli2, rcojocaru, basma-b, clefourrier

•

May 14, 2024

• 104

upvoted a paper about 2 years ago

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Paper • 2306.05685 • Published Jun 9, 2023 • 43

upvoted 2 articles about 2 years ago

Article

Let's talk about LLM evaluation

clefourrier

•

May 23, 2024

• 212

Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

asoria, tdoehmen, senwu, lorr, vpm238

•

Apr 4, 2024

• 29

upvoted a paper about 2 years ago

Dynamic Typography: Bringing Words to Life

Paper • 2404.11614 • Published Apr 17, 2024 • 46

upvoted 2 papers over 2 years ago

BloombergGPT: A Large Language Model for Finance

Paper • 2303.17564 • Published Mar 30, 2023 • 33

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 61

Marwa El Kamil

AI & ML interests

Recent Activity

Organizations

maghwa's activity

Finding Moroccan Arabic (Darija) in Fineweb 2

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Tokenization Is A Dead Weight (Tokun Part 1)

🥐CroissantLLM: A Truly Bilingual French-English Language Model

Introducing the Open Arabic LLM Leaderboard

Let's talk about LLM evaluation

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B