Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Pedram Rostami's picture
4 6 33

Pedram Rostami

PedramR
Mohammadreza's profile picture mjdousti's profile picture foshati's profile picture
·
  • PedramRostami

AI & ML interests

NLP, Machine Learning

Recent Activity

liked a model 1 day ago
electroglyph/Qwen3-Embedding-0.6B-onnx-uint8
liked a model 16 days ago
cross-encoder/ms-marco-MiniLM-L12-v2
upvoted an article 23 days ago
Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models
View all activity

Organizations

University of Tehran's profile picture gaokerena's profile picture

upvoted an article 23 days ago
view article
Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

24 days ago
•
104
upvoted 5 papers almost 2 years ago

Training LLMs over Neurally Compressed Text

Paper • 2404.03626 • Published Apr 4, 2024 • 23

The Unreasonable Ineffectiveness of the Deeper Layers

Paper • 2403.17887 • Published Mar 26, 2024 • 82

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 189

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 51

Tuning Language Models by Proxy

Paper • 2401.08565 • Published Jan 16, 2024 • 22
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs