scfive
/

gladi8behavev1

Model card Files Files and versions

xet

Community

scfive commited on May 30

Commit

6c17d81

verified ·

1 Parent(s): 0638912

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +57 -15

README.md CHANGED Viewed

@@ -1,15 +1,57 @@
-typing-extensions>=4.5.0
-ultralytics==8.0.196
-opencv-python==4.8.1.78
-numpy==1.24.3
-torch==2.1.0
-torchvision==0.16.0
-transformers==4.30.2
-Pillow==10.0.0
-networkx==3.1
-openai==1.3.0
-torch-geometric==2.3.1
-protobuf==3.20.3
-tensorboard==2.13.0
-gradio==4.0.0
-supervision==0.3.0

+---
+title: Glad8tr Video Analysis
+emoji: 🎥
+colorFrom: blue
+colorTo: indigo
+sdk: gradio
+sdk_version: 4.0.0
+app_file: app.py
+pinned: false
+---
+# Glad8tr Video Analysis
+This is a video analysis application that uses computer vision to detect objects, analyze poses, and provide cognitive state analysis. The application is deployed on Hugging Face Spaces.
+## Features
+- Object detection and tracking
+- Pose estimation and analysis
+- Scene context analysis
+- Cognitive state analysis
+- Real-time video processing
+## Usage
+1. Upload a video file (supported formats: MP4, AVI, MOV)
+2. The application will process the video and provide:
+   - Object detection results
+   - Pose analysis
+   - Scene context information
+   - Cognitive state analysis
+3. Download the processed video with annotations
+## Technical Details
+- Built with PyTorch and YOLOv8
+- Uses Gradio for the web interface
+- Optimized for Hugging Face Spaces deployment
+- Processes videos in real-time with frame sampling
+## Limitations
+- Processing is limited to the first 100 frames for demo purposes
+- Maximum video resolution: 1920x1080
+- Processing time depends on video length and complexity
+## Model Information
+The application uses two YOLOv8 models:
+1. Object detection model (glad8trv8s.pt)
+2. Pose estimation model (glad8trv8s-pose.pt)
+If the custom models are not available, the application will fall back to the base YOLOv8 models.
+## License
+This project is licensed under the MIT License.