Pruned Falcon-11B variant optimized for 13 European languages through layer selection.
S. Smits
ssmits
AI & ML interests
Synthetic data, on-premise / local language-specific LLMs, optimizing embeddings models retrieval
Recent Activity
updated
a model
37 minutes ago
ssmits/Qwen2.5-7B-embed-base
updated
a model
41 minutes ago
ssmits/Qwen2.5-7B-Instruct-embed-base
upvoted
an
article
17 days ago
Fine-tune ModernBERT for text classification using synthetic data
Organizations
Collections
3
models
67
ssmits/Qwen2.5-7B-embed-base
Text Classification
•
Updated
•
7
ssmits/Qwen2.5-7B-Instruct-embed-base
Text Classification
•
Updated
•
9
•
1
ssmits/ModernBERT-base-dutch-test
Updated
•
18
•
1
ssmits/Qwen2-7B-embed-base
Feature Extraction
•
Updated
•
18
ssmits/Zamba2-1.2B
Updated
•
2
ssmits/Zamba2-1.2B-instruct-Dutch
Text Generation
•
Updated
•
28
•
1
ssmits/Llama-3.1-Nemotron-92B-Instruct-HF-late-GGUF
Updated
ssmits/Llama-3.1-Nemotron-92B-Instruct-HF-early-GGUF
Updated
ssmits/Llama-3.1-Nemotron-92B-Instruct-HF-late
Text Generation
•
Updated
•
102
•
2
ssmits/Llama-3.1-Nemotron-92B-Instruct-HF-early
Text Generation
•
Updated
•
163
•
2
datasets
6
ssmits/fineweb-2-dutch
Viewer
•
Updated
•
83.5M
•
661
•
2
ssmits/processed-falcon-dutch-dataset
Viewer
•
Updated
•
450k
•
611
ssmits/tokenized-falcon2-dutch-2048
Viewer
•
Updated
•
1.91M
•
142
ssmits/tokenized-llama3-dutch-2048
Viewer
•
Updated
•
1.27M
•
1.2k
ssmits/tokenized-falcon2-dutch-4096
Viewer
•
Updated
•
1.05M
•
534
ssmits/small-test-dataset
Viewer
•
Updated
•
21.8k
•
34