34 100 232

dame rajee

damerajee

AI & ML interests

None yet

Recent Activity

upvoted an article about 17 hours ago

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

liked a model about 18 hours ago

mixedbread-ai/mxbai-rerank-base-v2

upvoted a paper 1 day ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

View all activity

Organizations

damerajee's activity

upvoted an article about 17 hours ago

Article

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Mar 22, 2024

• 88

liked a model about 18 hours ago

mixedbread-ai/mxbai-rerank-base-v2

Text Ranking • Updated Apr 2 • 10.4k • 38

upvoted a paper 1 day ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published 2 days ago • 67

liked a model 1 day ago

unum-cloud/uform-vl-english-large

Feature Extraction • Updated Mar 28, 2024 • 12

liked a model 5 days ago

sesame/csm-1b

Text-to-Speech • Updated Mar 16 • 55k • 1.99k

liked a model 12 days ago

facebook/webssl-dino300m-full2b-224

Image Feature Extraction • Updated 14 days ago • 1.39k • 8

upvoted a collection 12 days ago

Web-SSL

Collection

17 items • Updated 14 days ago • 14

liked a model 29 days ago

RekaAI/reka-flash-3

Updated Mar 13 • 2.64k • 370

liked a model about 1 month ago

colbert-ir/colbertv2.0

Updated Apr 5, 2024 • 1.88M • 253

upvoted an article about 1 month ago

Article

🪆 Introduction to Matryoshka Embedding Models

Feb 23, 2024

• 105

liked a Space about 1 month ago

Deepseek Ai DeepSeek V3 0324

💻

Generate text with AI model

reacted to Kseniase's post with ❤️👀👀 about 1 month ago

Post

5129

8 types of RoPE

As we always use Transformers, it's helpful to understand RoPE—Rotary Position Embedding. Since token order matters, RoPE encodes it by rotating token embeddings based on their position, so the model knows how to interpret which token comes first, second, and so on.

Here are 8 types of RoPE that can be implemented in different cases:

1. Original RoPE -> RoFormer: Enhanced Transformer with Rotary Position Embedding (2104.09864)
Encodes token positions by rotating token embeddings in the complex plane via a position-based rotation matrix, thereby providing the self-attention mechanism with relative positional info.

2. LongRoPE -> LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens (2402.13753)
Extends the context window of pre-trained LLMs to 2048k tokens, leveraging non-uniformities in positional interpolation with an efficient search.

3. LongRoPE2 -> LongRoPE2: Near-Lossless LLM Context Window Scaling (2502.20082)
Extends the effective context window of pre-trained LLMs to the target! length, rescaling RoPE guided by “needle-driven” perplexity.

4. Multimodal RoPE (MRoPE) -> Qwen2.5-VL Technical Report (2502.13923)
Decomposes positional embedding into 3 components: temporal, height and width, so that positional features are aligned across modalities: text, images and videos.

5. Directional RoPE (DRoPE) -> DRoPE: Directional Rotary Position Embedding for Efficient Agent Interaction Modeling (2503.15029)
Adds an identity scalar, improving how angles are handled without extra complexity. It helps balance accuracy, speed, and memory usage.

6. VideoRoPE -> VideoRoPE: What Makes for Good Video Rotary Position Embedding? (2502.05173)
Adapts RoPE for video, featuring 3D structure, low-frequency temporal allocation, diagonal layout, and adjustable spacing.

7. VRoPE -> VRoPE: Rotary Position Embedding for Video Large Language Models (2502.11664)
An another RoPE for video, which restructures positional indices and balances encoding for uniform spatial focus.

8. XPos (Extrapolatable Position Embedding) -> https://huggingface.co/papers/2212.10
Introduces an exponential decay factor into the rotation matrix, improving stability on long sequences.

1 reply

reacted to onekq's post with 🚀🤯🔥 about 2 months ago

Post

3758

Folks, let's get ready.🥳 We will be busy soon. 😅🤗https://github.com/huggingface/transformers/pull/36878

liked a model about 2 months ago

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • Updated Mar 23 • 85.3k • 1.32k

upvoted a paper about 2 months ago

Words or Vision: Do Vision-Language Models Have Blind Faith in Text?

Paper • 2503.02199 • Published Mar 4 • 8