tomg-group-umd/Gemstone-256x23_cooldown
Text Generation
•
48.4M
•
Updated
•
22
AI security & privacy, algorithmic bias, foundations of ML
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Gemstones: A Model Suite for Multi-Faceted Scaling Laws