Open to Collab

18 70 48

Mohammed Hamdy

mmhamdy

AI & ML interests

TechBio | AI4Sci | NLP | Reinforcement Learning

Recent Activity

posted an update 17 days ago

The new DeepSeek Engram paper is super fun! It also integrates mHC, and I suspect they're probably releasing all these papers to make the V4 report of reasonable length😄 Here's a nice short summary from Gemini

upvoted an article 2 months ago

Continuous batching from first principles

reacted to Kseniase's post with ❤️ 2 months ago

12 Types of JEPA Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones: 1. LeJEPA → https://huggingface.co/papers/2511.08544 Explains a full theory for JEPAs, defining the “ideal” JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA 2. JEPA-T → https://huggingface.co/papers/2510.00974 A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text 3. Text-JEPA → https://huggingface.co/papers/2507.20491 Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs 4. N-JEPA (Noise-based JEPA) → https://huggingface.co/papers/2507.15216 Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification 5. SparseJEPA → https://huggingface.co/papers/2504.16140 Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy 6. TS-JEPA (Time Series JEPA) → https://huggingface.co/papers/2509.25449 Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders Read further below ↓ It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

View all activity

Organizations

posted an update 17 days ago

Post

3027

upvoted an article 2 months ago

Article

Continuous batching from first principles

Nov 25, 2025

•

313

reacted to Kseniase's post with ❤️ 2 months ago

Post

6180

12 Types of JEPA

Since Yann LeCun together with Randall Balestriero released a new paper on JEPA (Joint-Embedding Predictive Architecture), laying out its theory and introducing an efficient practical version called LeJEPA, we figured you might need even more JEPA. Here are 7 recent JEPA variants plus 5 iconic ones:

1. LeJEPA → LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics (2511.08544)
Explains a full theory for JEPAs, defining the “ideal” JEPA embedding as an isotropic Gaussian, and proposes the SIGReg objective to push JEPA toward this ideal, resulting in practical LeJEPA

2. JEPA-T → JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation (2510.00974)
A text-to-image model that tokenizes images and captions with a joint predictive Transformer, enhances fusion with cross-attention and text embeddings before training loss, and generates images by iteratively denoising visual tokens conditioned on text

3. Text-JEPA → Speaking in Words, Thinking in Logic: A Dual-Process Framework in QA Systems (2507.20491)
Converts natural language into first-order logic, with a Z3 solver handling reasoning, enabling efficient, explainable QA with far lower compute than large LLMs

4. N-JEPA (Noise-based JEPA) → Improving Joint Embedding Predictive Architecture with Diffusion Noise (2507.15216)
Connects self-supervised learning with diffusion-style noise by using noise-based masking and multi-level schedules, especially improving visual classification

5. SparseJEPA → SparseJEPA: Sparse Representation Learning of Joint Embedding Predictive Architectures (2504.16140)
Adds sparse representation learning to make embeddings more interpretable and efficient. It groups latent variables by shared semantic structure using a sparsity penalty while preserving accuracy

6. TS-JEPA (Time Series JEPA) → Joint Embeddings Go Temporal (2509.25449)
Adapts JEPA to time-series by learning latent self-supervised representations and predicting future latents for robustness to noise and confounders

Read further below ↓
It you like it, also subscribe to the Turing Post: https://www.turingpost.com/subscribe

1 reply

liked a Space 3 months ago

Unlocking On-Policy Distillation for Any Model Family

📝

Improve model performance by transferring knowledge between different model families

upvoted 2 articles 3 months ago

Article

Promoter-GPT: Writing DNA Instructions with Language Models

Oct 22, 2025

•

Article

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

Oct 20, 2025

•

liked a dataset 4 months ago

transferable-samplers/many-peptides-md

Updated Dec 15, 2025 • 11.7k • 7

published an article 4 months ago

Article

The Next Frontier: Large Language Models In Biology

Oct 12, 2025

•

liked 3 Spaces 4 months ago

Science Release Heatmap

🔥

Explore AI4Science models and organizations from the past year

Maintain the unmaintainable

📚

Explore the complex relationships between 400+ machine learning models

Transformers Timeline

🤗

Interactive timeline to explore the 🤗Transformers models

published a Space 4 months ago

BioLLM Story

🌖

Create and render documents with code and text using Quarto

reacted to AdinaY's post with 🔥 4 months ago

Post

3538

BAAI has released ROME🔥 evaluating 30+ large reasoning models on text & visual reasoning

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions (2509.17177)

✨Tests visual reasoning, not just recognition
✨Covers capability × alignment × safety × efficiency
✨More transparent & reliable (less data contamination)
✨Helps make real-world deployment choices

updated a Space 4 months ago

README

📉

published a Space 4 months ago

README

📉

liked a model 6 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 211k • 1.22k

upvoted an article 7 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Jul 8, 2025

•

liked a dataset 8 months ago

nvidia/Nemotron-Personas-USA

Viewer • Updated Dec 16, 2025 • 1M • 3.31k • 248

reacted to AdinaY's post with 👍 8 months ago

Post

3193

RoboBrain 2.0🔥 OPEN embedded brain model by BAAIBeijing

BAAI/RoboBrain2.0-7B

✨ 7B - Apache 2.0 / 32B coming soon
✨ Supports multiple images, long videos, and high-resolution visuals
✨ Spatial + temporal reasoning
✨ Real-time memory & scene graphs

upvoted an article 8 months ago

Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Jun 4, 2025

•

Mohammed Hamdy

AI & ML interests

Recent Activity

Organizations

mmhamdy's activity

Continuous batching from first principles

Unlocking On-Policy Distillation for Any Model Family

Promoter-GPT: Writing DNA Instructions with Language Models

Scaling Test-Time Compute to Achieve Gold Medal at IOI 2025 with Open-Weight Models

The Next Frontier: Large Language Models In Biology

Science Release Heatmap

Maintain the unmaintainable

Transformers Timeline

BioLLM Story

README

README

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes