Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
felixbrock 's Collections
llm-pretraining
cv-gen-ai-3D
OCR
dataset
cv-eval
cv-pre-eval
llm-model
RLHF/RLAIF
cv-embedding
llm-system
llm-gen-ai-text
text-to-image-model
llm-performance
llm-monitoring
llm-agent
llm-doc-retrieval
privacy/security
cv-performance
llm-eval
selflearning

llm-model

updated Sep 15, 2023
Upvote
-

  • One Wide Feedforward is All You Need

    Paper • 2309.01826 • Published Sep 4, 2023 • 33

  • Gated recurrent neural networks discover attention

    Paper • 2309.01775 • Published Sep 4, 2023 • 10

  • FLM-101B: An Open LLM and How to Train It with $100K Budget

    Paper • 2309.03852 • Published Sep 7, 2023 • 44

  • Large Language Models as Optimizers

    Paper • 2309.03409 • Published Sep 7, 2023 • 77

  • GPT Can Solve Mathematical Problems Without a Calculator

    Paper • 2309.03241 • Published Sep 6, 2023 • 18

  • Llama 2: Open Foundation and Fine-Tuned Chat Models

    Paper • 2307.09288 • Published Jul 18, 2023 • 243

  • NExT-GPT: Any-to-Any Multimodal LLM

    Paper • 2309.05519 • Published Sep 11, 2023 • 78

  • Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts

    Paper • 2309.04354 • Published Sep 8, 2023 • 15

  • Large Language Models for Compiler Optimization

    Paper • 2309.07062 • Published Sep 11, 2023 • 23
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs