3 18 21

Ezzaldeen Mousa

ezzaldeen

AI & ML interests

Deep Cooking :)

Recent Activity

updated a model about 1 month ago

ezzaldeen/Qwen2.5-1.5B-Open-R1-Distill

upvoted a paper about 1 month ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

updated a collection about 2 months ago

🤗 Open-R1-Distill

View all activity

Organizations

upvoted a paper about 1 month ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 119

upvoted 2 articles 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 43

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted 2 articles 3 months ago

Article

Mixture of Experts Explained

and 5 others •

Dec 11, 2023

• 722

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 181

upvoted a collection 4 months ago

Multilingual LLM Evaluation

Collection

Multilingual Evaluation Benchmarks • 8 items • Updated Mar 3 • 25

upvoted an article 5 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 873

upvoted an article 7 months ago

Article

Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline:

•

Nov 30, 2024

• 20

upvoted a paper over 1 year ago

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 23

upvoted a collection over 1 year ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 243

upvoted 5 papers over 1 year ago

upvoted 3 papers almost 2 years ago

LoRA: Low-Rank Adaptation of Large Language Models

Paper • 2106.09685 • Published Jun 17, 2021 • 41

Intrinsic Dimensionality Explains the Effectiveness of Language Model Fine-Tuning

Paper • 2012.13255 • Published Dec 22, 2020 • 4

3D-LLM: Injecting the 3D World into Large Language Models

Paper • 2307.12981 • Published Jul 24, 2023 • 37

Ezzaldeen Mousa

AI & ML interests

Recent Activity

Organizations

ezzaldeen's activity

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Open R1: Update #3

Mixture of Experts Explained

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Open-R1: a fully open reproduction of DeepSeek-R1

Towards a Fully Arabic Retrieval-Augmented Generation (RAG) Pipeline: