14 5 3

Hannibal

Hannibal046

Hannibal046

AI & ML interests

None yet

Recent Activity

upvoted an article about 2 months ago

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

liked a model 2 months ago

Tengyunw/qwen3_8b_eagle3

liked a model 3 months ago

Qwen/Qwen3-Embedding-0.6B

View all activity

Organizations

upvoted an article about 2 months ago

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

and 5 others •

Jul 16

• 67

liked a model 2 months ago

Tengyunw/qwen3_8b_eagle3

Updated 9 days ago • 1.81k • 20

liked a model 3 months ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20 • 3.31M • • 572

New activity in BAAI/bge-reranker-v2-m3 4 months ago

如何使用MTEB评估BAAI/bge-reranker-v2-m3的C-MTEB Reranking项目

#33 opened 12 months ago by

IeohMingChan

upvoted an article 4 months ago

Article

The Transformers Library: standardizing model definitions

and 3 others •

May 15

• 117

upvoted 3 papers 8 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 418

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 298

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 284

updated 2 models about 1 year ago

Hannibal046/xrag-v1.1-7b

Text Generation • 7B • Updated Jul 13, 2024 • 5

Hannibal046/gtr_t5_nq_32_stage2

0.3B • Updated Jun 29, 2024 • 3 • 1

liked a model over 1 year ago

bosonai/Higgs-Llama-3-70B

Text Generation • 71B • Updated Aug 20, 2024 • 9.33k • • 226

New activity in meta-llama/Meta-Llama-3-8B-Instruct over 1 year ago

The request to access the repo has been sent for several days, why hasn't it passed yet?

#70 opened over 1 year ago by

water-cui

request to access is still pending a review

#50 opened over 1 year ago by

Hoo1196

updated 2 models over 1 year ago

Hannibal046/xrag-moe

Text Generation • Updated Apr 24, 2024 • 4

Hannibal046/xrag-7b

Text Generation • Updated Apr 23, 2024 • 1.18k • 2

New activity in mistralai/Mistral-7B-Instruct-v0.1 over 1 year ago

Which padding side to choose while finetuning

👍 12

#47 opened almost 2 years ago by

parikshit1619

updated 4 models over 1 year ago

Hannibal

AI & ML interests

Recent Activity

Organizations

Hannibal046's activity

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

如何使用MTEB评估BAAI/bge-reranker-v2-m3的C-MTEB Reranking项目

The Transformers Library: standardizing model definitions

The request to access the repo has been sent for several days, why hasn't it passed yet?

request to access is still pending a review

Which padding side to choose while finetuning