Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Matt's picture
8 2

Matt

stallone
bikash05's profile picture GigaBoy's profile picture mayank-mishra's profile picture
·

AI & ML interests

None yet

Organizations

IBM's profile picture Hugging Face Party @ PyTorch Conference's profile picture

authored a paper 11 months ago

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23, 2024 • 25
authored a paper 12 months ago

Scaling Granite Code Models to 128K Context

Paper • 2407.13739 • Published Jul 18, 2024 • 20
authored 5 papers about 1 year ago

The infrastructure powering IBM's Gen AI model development

Paper • 2407.05467 • Published Jul 7, 2024 • 2

Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

Paper • 2407.00121 • Published Jun 27, 2024

Diversity Measurement and Subset Selection for Instruction Tuning Datasets

Paper • 2402.02318 • Published Feb 4, 2024 • 2

Rapid Development of Compositional AI

Paper • 2302.05941 • Published Feb 12, 2023

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Paper • 2405.04324 • Published May 7, 2024 • 23
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs