Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wambugu Muchemi's picture
1 23

Wambugu Muchemi

FrankXII
shtefcs's profile picture
·
  • Wambugu-Muchemi

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago
sentence-transformers/all-MiniLM-L6-v2:Error 422
liked a model 5 months ago
deepseek-ai/Janus-Pro-7B
liked a model 8 months ago
ibm-granite/granite-geospatial-biomass
View all activity

Organizations

SmartSurf's profile picture

Collections 1

Treasure
  • Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

    Paper • 2403.07816 • Published Mar 12, 2024 • 42
  • microsoft/phi-1_5

    Text Generation • 1B • Updated Apr 29, 2024 • 171k • • 1.34k
  • Language models scale reliably with over-training and on downstream tasks

    Paper • 2403.08540 • Published Mar 13, 2024 • 15
  • Akashpb13/Swahili_xlsr

    Automatic Speech Recognition • 0.3B • Updated Aug 27, 2023 • 88 • 8
Treasure
  • Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

    Paper • 2403.07816 • Published Mar 12, 2024 • 42
  • microsoft/phi-1_5

    Text Generation • 1B • Updated Apr 29, 2024 • 171k • • 1.34k
  • Language models scale reliably with over-training and on downstream tasks

    Paper • 2403.08540 • Published Mar 13, 2024 • 15
  • Akashpb13/Swahili_xlsr

    Automatic Speech Recognition • 0.3B • Updated Aug 27, 2023 • 88 • 8

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs