Mayank Mishra's picture

Mayank Mishra

mayank-mishra

AI & ML interests

Large Language Models, Distributed Training and Inference

Recent Activity

Organizations

IBM's profile picture BigCode's profile picture Aurora-M/MDEL's profile picture Blog-explorers's profile picture Aurora-M's profile picture IBM Granite's profile picture IBM Research's profile picture

Posts 4

Articles 3

Article
34

Improving Hugging Face Training Efficiency Through Packing with Flash Attention