Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Satya Saurabh Mishra's picture

6 1

Satya Saurabh Mishra

saurabhmishra9

·

satyam19mishra

AI & ML interests

Data Science, Machine Learning, AI etc

Organizations

saurabhmishra9 's collections 5

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 6.89M • • 4.63k
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.65M • • 1.71k
meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 463k • • 2.5k
meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 639k • • 1.09k

AI Agents: Evolution, Architecture, and Real-World Applications

Paper • 2503.12687 • Published Mar 16 • 2

Inference Optimizations

Inference Optimization of Foundation Models on AI Accelerators

Paper • 2407.09111 • Published Jul 12, 2024
A Survey on Inference Optimization Techniques for Mixture of Experts Models

Paper • 2412.14219 • Published Dec 18, 2024

Prompting and RAG

Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks

Paper • 2412.15605 • Published Dec 20, 2024 • 2
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Paper • 2403.14403 • Published Mar 21, 2024 • 7
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 35

The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities

Paper • 2408.13296 • Published Aug 23, 2024 • 1

meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 6.89M • • 4.63k
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.65M • • 1.71k
meta-llama/Llama-3.3-70B-Instruct

Text Generation • 71B • Updated Dec 21, 2024 • 463k • • 2.5k
meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 639k • • 1.09k

Prompting and RAG

Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge Tasks

Paper • 2412.15605 • Published Dec 20, 2024 • 2
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Paper • 2403.14403 • Published Mar 21, 2024 • 7
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations

Paper • 2412.13171 • Published Dec 17, 2024 • 35

AI Agents: Evolution, Architecture, and Real-World Applications

Paper • 2503.12687 • Published Mar 16 • 2

The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities

Paper • 2408.13296 • Published Aug 23, 2024 • 1

Inference Optimizations

Inference Optimization of Foundation Models on AI Accelerators

Paper • 2407.09111 • Published Jul 12, 2024
A Survey on Inference Optimization Techniques for Mixture of Experts Models

Paper • 2412.14219 • Published Dec 18, 2024

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs