Upload README.md with huggingface_hub
Browse files
README.md
ADDED
@@ -0,0 +1,9 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
|
2 |
+
# EnhancedViTUNet for Front-to-BEV Prediction
|
3 |
+
|
4 |
+
This model takes a **front-view RGB image** and predicts a **Bird’s-Eye View (BEV)** image.
|
5 |
+
|
6 |
+
- **Architecture**: Vision Transformer (ViT) encoder + U-Net style decoder
|
7 |
+
- **Training**: On synthetic Gazebo11 simulation dataset with ROI-masked L1 + perceptual VGG loss
|
8 |
+
- **Input size**: 384×384 RGB
|
9 |
+
- **Output size**: 384×384 RGB BEV
|