Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
22.1
TFLOPS
27
12
319
Carlo Moro
cnmoro
Follow
alamios's profile picture
gabrielmotablima's profile picture
noxinc's profile picture
38 followers
·
78 following
https://www.linkedin.com/in/carlo-moro-4a20a7132/
cnmoro
AI & ML interests
I like small & fast models Trabalhando nos menores modelos em português que existem https://ko-fi.com/cnmoro
Recent Activity
liked
a model
about 10 hours ago
teapotai/teapotllm
reacted
to
tomaarsen
's
post
with 🔥
1 day ago
‼️Sentence Transformers v4.0 is out! You can now train and finetune reranker models with multi-GPU training, bf16 support, loss logging, callbacks & much more. I also prove that finetuning on your domain helps much more than you might think. 1️⃣ Reranker Training Refactor Reranker models can now be trained using an extensive trainer with a lot of powerful features: - MultiGPU Training (Data Parallelism (DP) and Distributed Data Parallelism (DDP)) - bf16 training support; loss logging - Evaluation datasets + evaluation loss - Improved callback support + an excellent Weights & Biases integration - Gradient checkpointing, gradient accumulation - Model card generation - Resuming from a training checkpoint without performance loss - Hyperparameter Optimization and much more! Read my detailed blogpost to learn about the components that make up this new training approach: https://huggingface.co/blog/train-reranker Notably, the release is fully backwards compatible: all deprecations are soft, meaning that they still work but emit a warning informing you how to upgrade. 2️⃣ New Reranker Losses - 11 new losses: - 2 traditional losses: BinaryCrossEntropy and CrossEntropy - 2 distillation losses: MSE and MarginMSE - 2 in-batch negatives losses: MNRL (a.k.a. InfoNCE) and CMNRL - 5 learning to rank losses: Lambda, p-ListMLE, ListNet, RankNet, ListMLE 3️⃣ New Reranker Documentation - New Training Overview, Loss Overview, API Reference docs - 5 new, 1 refactored training examples docs pages - 13 new, 6 refactored training scripts - Migration guides (2.x -> 3.x, 3.x -> 4.x) 4️⃣ Blogpost Alongside the release, I've written a blogpost where I finetune ModernBERT on a generic question-answer dataset. My finetunes easily outperform all general-purpose reranker models, even models 4x as big. Finetuning on your domain is definitely worth it: https://huggingface.co/blog/train-reranker See the full release notes here: https://github.com/UKPLab/sentence-transformers/releases/v4.0.1
updated
a dataset
1 day ago
cnmoro/reasoning-v1-20m-portuguese
View all activity
Organizations
cnmoro
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 10 hours ago
teapotai/teapotllm
Text2Text Generation
•
Updated
3 days ago
•
5.03k
•
•
86
liked
a Space
1 day ago
Running
1
1
ISOM5240 Assignment1
🐢
Storytelling Application using Hugging Face Pipelines
liked
2 models
2 days ago
Remade-AI/Super-Saiyan
Image-to-Video
•
Updated
8 days ago
•
219
•
7
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
Updated
about 9 hours ago
•
16.3k
•
747
liked
a dataset
2 days ago
cnmoro/reasoning-v1-20m-portuguese
Viewer
•
Updated
1 day ago
•
5.51M
•
45
•
3
liked
a model
6 days ago
manycore-research/SpatialLM-Llama-1B
Text Generation
•
Updated
8 days ago
•
6.85k
•
764
liked
a model
7 days ago
cnmoro/ptt5-base-ptbr-summarization
Summarization
•
Updated
Nov 10, 2023
•
509
•
•
3
liked
a model
8 days ago
mlabonne/gemma-3-1b-it-abliterated
Image-Text-to-Text
•
Updated
7 days ago
•
196
•
3
liked
2 models
9 days ago
stabilityai/stable-virtual-camera
Image-to-Video
•
Updated
9 days ago
•
8.12k
•
147
BlinkDL/rwkv7-g1
Text Generation
•
Updated
4 days ago
•
70
liked
a dataset
10 days ago
glaiveai/reasoning-v1-20m
Viewer
•
Updated
9 days ago
•
22.2M
•
6.31k
•
122
liked
a model
11 days ago
mistralai/Mistral-Small-3.1-24B-Instruct-2503
Image-Text-to-Text
•
Updated
6 days ago
•
102k
•
1.01k
liked
a model
12 days ago
Felladrin/Qwen2-96M
Text Generation
•
Updated
13 days ago
•
803
•
3
liked
a model
13 days ago
cnmoro/Tucano-160m-Portuguese-Instruct-v2
Text Generation
•
Updated
13 days ago
•
14
•
1
liked
a model
15 days ago
CohereForAI/c4ai-command-a-03-2025
Text Generation
•
Updated
8 days ago
•
20.8k
•
312
liked
2 models
17 days ago
google/gemma-3-1b-it
Text Generation
•
Updated
7 days ago
•
290k
•
235
RekaAI/reka-flash-3
Updated
15 days ago
•
4.97k
•
339
liked
a model
19 days ago
cnmoro/Tucano-160m-Portuguese-Instruct
Text Generation
•
Updated
19 days ago
•
36
•
1
liked
a model
22 days ago
Tower-Babel/Babel-9B
Updated
23 days ago
•
4.19k
•
21
liked
a model
25 days ago
cnmoro/Qwen2.5-0.5B-Portuguese-v2
Text Generation
•
Updated
26 days ago
•
19
•
1
Load more