25 6 6

Andrea Zugarini

azugarini

AI & ML interests

Natural Language Processing, Language Models, Language Model Compression

Recent Activity

upvoted a paper 22 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

updated a collection 2 months ago

Clue-instruct

liked a model 2 months ago

expertai/SLIMER-PARALLEL-LLaMA3

View all activity

Organizations

azugarini's activity

upvoted a paper 22 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 124

updated a collection 2 months ago

Clue-instruct

Collection

Clue-instruct dataset and different models fine-tuned on it. • 8 items • Updated Nov 8, 2024

liked a model 2 months ago

expertai/SLIMER-PARALLEL-LLaMA3

Text Generation • Updated Nov 5, 2024 • 13 • 4

liked a model 3 months ago

expertai/SLIMER-LLaMA3

Text Generation • Updated Nov 5, 2024 • 14 • 2

upvoted a paper 4 months ago

SLIMER-IT: Zero-Shot NER on Italian Language

Paper • 2409.15933 • Published Sep 24, 2024 • 4

New activity in azugarini/crossword-clues-QA 4 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 4 months ago by

librarian-bot

updated a dataset 4 months ago

azugarini/crossword-clues-QA

Viewer • Updated Sep 24, 2024 • 1.17k • 34

upvoted a paper 5 months ago

Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Paper • 2408.06663 • Published Aug 13, 2024 • 16

upvoted a paper 6 months ago

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1, 2024 • 6

updated a dataset 6 months ago

azugarini/CapMIT1003

Updated Jul 14, 2024 • 30

liked a model 6 months ago

expertai/LLaMAntino-3-SLIMER-IT

Text Generation • Updated Sep 25, 2024 • 30 • 3

updated a collection 6 months ago

Clue-instruct

Collection

Clue-instruct dataset and different models fine-tuned on it. • 8 items • Updated Nov 8, 2024

updated a dataset 6 months ago

azugarini/clue-instruct

Viewer • Updated Jul 11, 2024 • 44.1k • 35

updated 4 models 6 months ago

authored a paper 6 months ago

Dynamic Few-Shot Learning for Knowledge Graph Question Answering

Paper • 2407.01409 • Published Jul 1, 2024

upvoted a paper 6 months ago

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 34

updated a collection 7 months ago

Tokenizer Adaptation

Collection

Collection of research on tokenizers' adaptation to specific domains and/or languages. Special focus on sequence compression directions • 4 items • Updated Jul 6, 2024