Gemma 3 4B Fine-Tuned for Object Detection

This model is a fine-tuned version of Gemma 3 4B for license plate object detection.

Detected License Plates (Sample 1) Detected License Plates (Sample 2)
image/png image/png
image/png image/png

Model Details

Model Description

This model aims to prove that VLMs NOT previously trained for object detection and without previous knowledge of location tokens (<locXXXX>) can still be fine tuned for object detection out of the box. This is an experimental model.

Model Sources

Uses

Follow these steps to configure, train, and run predictions (using the code repository):

  1. Configuration (config.py): All major parameters are centralized here. Before running any script, review and adjust these settings as needed.
  2. Training (train.py): This script handles the fine-tuning process.
  3. Running inference (infer.py): Run this to visualize object detection.

Citation

If you use our work, please cite us:

@misc{gosthipaty_gemma3_object_detection_2025,
  author = {Aritra Roy Gosthipaty and Sergio Paniego},
  title = {Fine-tuning Gemma 3 for Object Detection},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/ariG23498/gemma3-object-detection.git}}
}
Downloads last month
131
Safetensors
Model size
4.3B params
Tensor type
BF16
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for ariG23498/gemma-3-4b-pt-object-detection

Finetuned
(104)
this model

Dataset used to train ariG23498/gemma-3-4b-pt-object-detection

Spaces using ariG23498/gemma-3-4b-pt-object-detection 2

Collection including ariG23498/gemma-3-4b-pt-object-detection