Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mann Patel's picture
19 48

Mann Patel

manncodes
gmayank100's profile picture 6b4b86ec-928a-4b7e-9c1e-8d5f009e3272's profile picture
·
  • punsbymann
  • manncodes
  • manncodes

AI & ML interests

NLP, Mech Interp, Reasoning, MLSystems

Recent Activity

upvoted a collection 4 days ago
— Long-context post-training 🧶 —
upvoted a paper about 1 month ago
Qwen2.5-1M Technical Report
liked a model about 1 month ago
ByteDance-Seed/Seed-OSS-36B-Instruct
View all activity

Organizations

Capital One's profile picture

manncodes 's collections 3

data
  • BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

    Paper • 2508.10975 • Published Aug 14 • 59
LLMs
  • TrustLLM: Trustworthiness in Large Language Models

    Paper • 2401.05561 • Published Jan 10, 2024 • 69
  • Exponentially Faster Language Modelling

    Paper • 2311.10770 • Published Nov 15, 2023 • 119
long context
  • MiniMax-01: Scaling Foundation Models with Lightning Attention

    Paper • 2501.08313 • Published Jan 14 • 297
data
  • BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

    Paper • 2508.10975 • Published Aug 14 • 59
long context
  • MiniMax-01: Scaling Foundation Models with Lightning Attention

    Paper • 2501.08313 • Published Jan 14 • 297
LLMs
  • TrustLLM: Trustworthiness in Large Language Models

    Paper • 2401.05561 • Published Jan 10, 2024 • 69
  • Exponentially Faster Language Modelling

    Paper • 2311.10770 • Published Nov 15, 2023 • 119
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs