twins_svt-gravit-c2 / README.md

parlange

Upload Twins_SVT model from experiment c2

63b9bb2 verified 29 days ago

preview code

raw

history blame contribute delete

5.67 kB

metadata

license: apache-2.0
tags:
  - image-classification
  - pytorch
  - timm
  - twins_svt
  - vision-transformer
  - transformer
  - gravitational-lensing
  - strong-lensing
  - astronomy
  - astrophysics
datasets:
  - parlange/gravit-c21-j24
metrics:
  - accuracy
  - auc
  - f1
paper:
  - title: 'GraViT: A Gravitational Lens Discovery Toolkit with Vision Transformers'
    url: https://arxiv.org/abs/2509.00226
    authors: Parlange et al.
model-index:
  - name: Twins_SVT-c2
    results:
      - task:
          type: image-classification
          name: Strong Gravitational Lens Discovery
        dataset:
          type: common-test-sample
          name: Common Test Sample (More et al. 2024)
        metrics:
          - type: accuracy
            value: 0.8012
            name: Average Accuracy
          - type: auc
            value: 0.8481
            name: Average AUC-ROC
          - type: f1
            value: 0.5502
            name: Average F1-Score

🌌 twins_svt-gravit-c2

🔭 This model is part of GraViT: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery

🔗 GitHub Repository: https://github.com/parlange/gravit

🛰️ Model Details

🤖 Model Type: Twins_SVT
🧪 Experiment: C2 - C21+J24-half
🌌 Dataset: C21+J24
🪐 Fine-tuning Strategy: half

💻 Quick Start

import torch
import timm

# Load the model directly from the Hub
model = timm.create_model(
    'hf-hub:parlange/twins_svt-gravit-c2',
    pretrained=True
)
model.eval()

# Example inference
dummy_input = torch.randn(1, 3, 224, 224)
with torch.no_grad():
    output = model(dummy_input)
    predictions = torch.softmax(output, dim=1)
print(f"Lens probability: {predictions[0][1]:.4f}")

⚡️ Training Configuration

Training Dataset: C21+J24 (Cañameras et al. 2021 + Jaelani et al. 2024)
Fine-tuning Strategy: half

🔧 Parameter	📝 Value
Batch Size	192
Learning Rate	AdamW with ReduceLROnPlateau
Epochs	100
Patience	10
Optimizer	AdamW
Scheduler	ReduceLROnPlateau
Image Size	224x224
Fine Tune Mode	half
Stochastic Depth Probability	0.1

📈 Training Curves

🏁 Final Epoch Training Metrics

Metric	Training	Validation
📉 Loss	0.2950	0.2921
🎯 Accuracy	0.8781	0.8943
📊 AUC-ROC	0.9481	0.9535
⚖️ F1 Score	0.8778	0.8953

☑️ Evaluation Results

ROC Curves and Confusion Matrices

Performance across all test datasets (a through l) in the Common Test Sample (More et al. 2024):

📋 Performance Summary

Average performance across 12 test datasets from the Common Test Sample (More et al. 2024):

Metric	Value
🎯 Average Accuracy	0.8012
📈 Average AUC-ROC	0.8481
⚖️ Average F1-Score	0.5502

📘 Citation

If you use this model in your research, please cite:

@misc{parlange2025gravit,
      title={GraViT: Transfer Learning with Vision Transformers and MLP-Mixer for Strong Gravitational Lens Discovery}, 
      author={René Parlange and Juan C. Cuevas-Tello and Octavio Valenzuela and Omar de J. Cabrera-Rosas and Tomás Verdugo and Anupreeta More and Anton T. Jaelani},
      year={2025},
      eprint={2509.00226},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2509.00226}, 
}

Model Card Contact

For questions about this model, please contact the author through: https://github.com/parlange/