Wilbur1240 commited on
Commit
258be65
·
verified ·
1 Parent(s): b5256f3

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+
2
+ # EnhancedViTUNet for Front-to-BEV Prediction
3
+
4
+ This model takes a **front-view RGB image** and predicts a **Bird’s-Eye View (BEV)** image.
5
+
6
+ - **Architecture**: Vision Transformer (ViT) encoder + U-Net style decoder
7
+ - **Training**: On synthetic Gazebo11 simulation dataset with ROI-masked L1 + perceptual VGG loss
8
+ - **Input size**: 384×384 RGB
9
+ - **Output size**: 384×384 RGB BEV