nubahador commited on
Commit
e545d00
·
verified ·
1 Parent(s): 0a34130

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -28
README.md CHANGED
@@ -2,33 +2,6 @@
2
  license: mit
3
  ---
4
 
5
- ### Vision Transformer (ViT) with LoRA for Spectrogram Regression
6
-
7
- ---
8
- ### Fine-Tuning Details
9
-
10
- | Category | Specification |
11
- |-----------------------|---------------------------------------------------------------------------------------------------|
12
- | **Framework** | PyTorch |
13
- | **Architecture** | Pre-trained Vision Transformer (ViT) |
14
- | **Adaptation Method** | LoRA (Low-Rank Adaptation) |
15
- | **Task** | Regression on time-frequency representations |
16
- | **Target Variables** | 1. Chirp start time (ms)<br>2. Start frequency (kHz)<br>3. End frequency (kHz) |
17
- | **Training Protocol** | • Automatic Mixed Precision (AMP)<br>• Early stopping<br>• Learning Rate scheduling |
18
- | **Output** | Quantitative predictions + optional natural language descriptions |
19
-
20
- ---
21
- ### Resource Details
22
-
23
- | Resource | Description | Link |
24
- |----------|-------------|------|
25
- | Trained Vision Transformer Model | Access to a pre-trained Vision Transformer model fine-tuned on synthetic spectrograms for chirp localization | [HuggingFace Model Hub](https://huggingface.co/nubahador/Fine_Tuned_Transformer_Model_for_Chirp_Localization/tree/main) |
26
- | Synthetic Spectrogram Dataset | Download link for 100,000 synthetic spectrograms with corresponding labels for chirp localization | [HuggingFace Dataset Hub](https://huggingface.co/datasets/nubahador/ChirpLoc100K___A_Synthetic_Spectrogram_Dataset_for_Chirp_Localization/tree/main) |
27
- | PyTorch Implementation | Repository containing the PyTorch code for fine-tuning the Vision Transformer on spectrograms | [Implementation GitHub Repository](https://github.com/nbahador/Train_Spectrogram_Transformer) |
28
- | Synthetic Chirp Generator | Python package for generating synthetic chirp spectrograms (images with corresponding labels) | [Dataset GitHub Repository](https://github.com/nbahador/chirp_spectrogram_generator) |
29
-
30
- ---
31
-
32
  <div style="display: flex; flex-wrap: wrap; gap: 15px; margin-top: 15px;">
33
  <div style="flex: 1; min-width: 200px; background: white; border-radius: 8px; padding: 15px; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
34
  <h4 style="margin-top: 0; color: #5f6368;">🧑‍💻 Curated by</h4>
@@ -43,6 +16,76 @@ license: mit
43
  <p>MIT</p>
44
  </div>
45
  </div>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
  </div>
47
 
48
  <div style="background: #f8f9fa; border-radius: 8px; padding: 20px; margin-bottom: 20px; border-left: 4px solid #ea4335;">
@@ -86,7 +129,7 @@ license: mit
86
 
87
  <div style="background: #f8f9fa; border-radius: 8px; padding: 20px; border-left: 4px solid #00bcd4;">
88
  <h2 style="margin-top: 0;">ℹ️ More Information</h2>
89
- <p>For more information and generation code, visit the <a href="https://github.com/nbahador/chirp_spectrogram_generator/tree/main">GitHub repository</a>.</p>
90
 
91
  <div style="margin-top: 15px; padding-top: 15px; border-top: 1px solid #e0e0e0;">
92
  <h4 style="margin-bottom: 5px;">Dataset Card Authors</h4>
 
2
  license: mit
3
  ---
4
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  <div style="display: flex; flex-wrap: wrap; gap: 15px; margin-top: 15px;">
6
  <div style="flex: 1; min-width: 200px; background: white; border-radius: 8px; padding: 15px; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
7
  <h4 style="margin-top: 0; color: #5f6368;">🧑‍💻 Curated by</h4>
 
16
  <p>MIT</p>
17
  </div>
18
  </div>
19
+
20
+ <div style="background: #f8f9fa; border-radius: 8px; padding: 20px; margin: 20px 0; border-left: 4px solid #4285f4;">
21
+ <h2 style="margin-top: 0;">🔍 Model Architecture</h2>
22
+ <div style="background: white; border-radius: 8px; padding: 15px; box-shadow: 0 2px 4px rgba(0,0,0,0.1);">
23
+ <h3 style="margin-top: 0;">Vision Transformer (ViT) with LoRA for Spectrogram Regression</h3>
24
+
25
+ <div style="margin-bottom: 15px;">
26
+ <h4 style="margin-bottom: 10px;">Fine-Tuning Details</h4>
27
+ <table style="width: 100%; border-collapse: collapse;">
28
+ <tr>
29
+ <td style="padding: 8px; border-bottom: 1px solid #eee; width: 30%;"><strong>Framework</strong></td>
30
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">PyTorch</td>
31
+ </tr>
32
+ <tr>
33
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Architecture</strong></td>
34
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">Pre-trained Vision Transformer (ViT)</td>
35
+ </tr>
36
+ <tr>
37
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Adaptation Method</strong></td>
38
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">LoRA (Low-Rank Adaptation)</td>
39
+ </tr>
40
+ <tr>
41
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Task</strong></td>
42
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">Regression on time-frequency representations</td>
43
+ </tr>
44
+ <tr>
45
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Target Variables</strong></td>
46
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">
47
+ 1. Chirp start time (ms)<br>
48
+ 2. Start frequency (kHz)<br>
49
+ 3. End frequency (kHz)
50
+ </td>
51
+ </tr>
52
+ <tr>
53
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Training Protocol</strong></td>
54
+ <td style="padding: 8px; border-bottom: 1px solid #eee;">
55
+ • Automatic Mixed Precision (AMP)<br>
56
+ • Early stopping<br>
57
+ • Learning Rate scheduling
58
+ </td>
59
+ </tr>
60
+ <tr>
61
+ <td style="padding: 8px;"><strong>Output</strong></td>
62
+ <td style="padding: 8px;">Quantitative predictions + optional natural language descriptions</td>
63
+ </tr>
64
+ </table>
65
+ </div>
66
+
67
+ <div>
68
+ <h4 style="margin-bottom: 10px;">Resource Details</h4>
69
+ <table style="width: 100%; border-collapse: collapse;">
70
+ <tr>
71
+ <td style="padding: 8px; border-bottom: 1px solid #eee; width: 30%;"><strong>Trained Vision Transformer Model</strong></td>
72
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><a href="https://huggingface.co/nubahador/Fine_Tuned_Transformer_Model_for_Chirp_Localization/tree/main">HuggingFace Model Hub</a></td>
73
+ </tr>
74
+ <tr>
75
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>Synthetic Spectrogram Dataset</strong></td>
76
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><a href="https://huggingface.co/datasets/nubahador/ChirpLoc100K___A_Synthetic_Spectrogram_Dataset_for_Chirp_Localization/tree/main">HuggingFace Dataset Hub</a></td>
77
+ </tr>
78
+ <tr>
79
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><strong>PyTorch Implementation</strong></td>
80
+ <td style="padding: 8px; border-bottom: 1px solid #eee;"><a href="https://github.com/nbahador/Train_Spectrogram_Transformer">Implementation GitHub Repository</a></td>
81
+ </tr>
82
+ <tr>
83
+ <td style="padding: 8px;"><strong>Synthetic Chirp Generator</strong></td>
84
+ <td style="padding: 8px;"><a href="https://github.com/nbahador/chirp_spectrogram_generator">Dataset GitHub Repository</a></td>
85
+ </tr>
86
+ </table>
87
+ </div>
88
+ </div>
89
  </div>
90
 
91
  <div style="background: #f8f9fa; border-radius: 8px; padding: 20px; margin-bottom: 20px; border-left: 4px solid #ea4335;">
 
129
 
130
  <div style="background: #f8f9fa; border-radius: 8px; padding: 20px; border-left: 4px solid #00bcd4;">
131
  <h2 style="margin-top: 0;">ℹ️ More Information</h2>
132
+ <p>For more information and generation code, visit the <a href="https://github.com/nbahador/Train_Spectrogram_Transformer">GitHub repository</a>.</p>
133
 
134
  <div style="margin-top: 15px; padding-top: 15px; border-top: 1px solid #e0e0e0;">
135
  <h4 style="margin-bottom: 5px;">Dataset Card Authors</h4>