-
Attention Is All You Need
Paper • 1706.03762 • Published • 125 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper • 1810.04805 • Published • 29 -
Universal Language Model Fine-tuning for Text Classification
Paper • 1801.06146 • Published • 8 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
Effi PRO
itseffi
AI & ML interests
None yet
Recent Activity
upvoted a changelog about 19 hours ago
Publish models from CI without HF_TOKEN liked a Space about 19 hours ago
HuggingFaceBio/carbon-tokenization upvoted a paper about 19 hours ago
Intelligence per Watt: Measuring Intelligence Efficiency of Local AI