gajeshladhar commited on
Commit
d4cb709
Β·
1 Parent(s): e28003d

readme.md updated

Browse files
Files changed (2) hide show
  1. README.md +27 -5
  2. assets/downstream-deepglobe-roads.png +0 -0
README.md CHANGED
@@ -52,16 +52,38 @@ loss = DinoSpatialLoss(student_feat, teacher_feat)
52
 
53
  ---
54
 
55
- ## πŸ“ˆ Performance: Latent Quality vs Supervised YOLO
56
 
57
- Despite not using any labels, `core-dino` shows **higher latent alignment and semantic grouping** than supervised YOLO encoders.
58
 
59
- πŸ“Š Embedding similarity visualizations show:
60
- - better clustering of semantically similar regions
61
- - stronger consistency across resolution changes
 
 
62
 
63
  ![Embedding Comparison](assets/embed-sim-vs-yolo.png)
64
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
65
  ---
66
 
67
  ## πŸ—‚οΈ Model Details
 
52
 
53
  ---
54
 
55
+ ## πŸ“ˆ Performance: Latent Quality & Downstream Evaluation
56
 
57
+ Despite being trained without any labels, `core-dino` demonstrates strong latent alignment and generalization capability β€” both in visual similarity and downstream tasks.
58
 
59
+ ### πŸ” 1. Embedding Similarity (Unsupervised)
60
+
61
+ Visual inspection of patch-level embedding similarity shows that `core-dino`:
62
+ - captures finer semantic structure than supervised YOLO
63
+ - is robust across real-world resolution shifts
64
 
65
  ![Embedding Comparison](assets/embed-sim-vs-yolo.png)
66
 
67
+ ---
68
+
69
+ ### πŸ›£οΈ 2. Downstream: Road Extraction (DeepGlobe Dataset)
70
+
71
+ We evaluated `core-dino` on the [DeepGlobe Road Extraction Dataset](https://competitions.codalab.org/competitions/18467#learn_the_details), using it as a frozen backbone in a simple segmentation pipeline.
72
+
73
+ - **Setup:**
74
+ - Both `core-dino` and YOLO backbones were **frozen**
75
+ - Only a **2-layer convolutional head** was trained
76
+ - Task: Binary road segmentation (IoU loss)
77
+
78
+ - **Result:**
79
+ - `core-dino` consistently outperformed the supervised YOLO backbone across all epochs
80
+ - Shows superior feature quality in transfer settings
81
+
82
+ <p align="center">
83
+ <img src="assets/downstream-deepglobe-roads.png" alt="Downstream Performance" width="600">
84
+ </p>
85
+
86
+
87
  ---
88
 
89
  ## πŸ—‚οΈ Model Details
assets/downstream-deepglobe-roads.png ADDED