Wi-zz
/

florence-2-auto-crop

Model card Files Files and versions

xet

Community

Wi-zz commited on Jun 10

Commit

3cd611d

verified ·

1 Parent(s): 42ce57b

Upload README.md

Browse files

Files changed (1) hide show

README.md +199 -0

README.md ADDED Viewed

	@@ -0,0 +1,199 @@

+# Magic Crop ✨🪄
+A powerful AI-powered image cropping tool that intelligently crops images to preserve maximum pixels while removing unwanted elements like watermarks. Uses the Florence-2-large vision-language model for accurate object detection and smart cropping decisions.
+## 🌟 Features
+- **AI-Powered Detection**: Uses Florence-2-large model for accurate object detection
+- **Smart Cropping**: Automatically determines the best crop direction (above, below, left, or right)
+- **Object-Aware Cropping**: Optional feature to avoid cropping other important objects
+- **Batch Processing**: Process multiple images efficiently with configurable batch sizes
+- **Recursive Folder Processing**: Handle entire directory structures
+- **Customizable Prompts**: Detect any object type, not just watermarks
+- **Crop Threshold Protection**: Skip images that would require excessive cropping
+- **Error Handling**: Robust error handling with optional file organization
+- **GPU Acceleration**: Automatic CUDA detection for faster processing
+## 📋 Requirements
+- Python 3.7+
+- PyTorch
+- Transformers
+- PIL (Pillow)
+- tqdm
+- Florence-2-large model (local model provided)
+## 🛠️ Installation
+1. **Install Python dependencies**:
+```bash
+pip install requirements.txt
+```
+2. **Download Florence-2-large model**:
+    - Place the Florence-2-large model files in a `./Florence-2-large/` directory relative to the script (automatically downloaded with huggingface-cli)
+    - The script expects local model files (`local_files_only=True`)
+## 🚀 Usage
+### Basic Usage
+```bash
+python crop.py input_image.jpg --prompt "watermark"
+```
+### Process Multiple Images
+```bash
+python crop.py image1.jpg image2.jpg image3.jpg -o output_folder
+```
+### Process Folders Recursively
+```bash
+python crop.py /path/to/images/ -r -o /path/to/output/
+```
+### Advanced Usage with Object-Aware Cropping
+```bash
+python crop.py /path/to/images/ -r -o output/ --object-aware --crop-threshold 15 --prompt "logo"
+```
+## 📝 Command-Line Arguments
+| Argument | Type | Default | Description |
+| :-- | :-- | :-- | :-- |
+| `input_paths` | str+ | Required | Paths to input images or folders |
+| `-r, --recursive` | flag | False | Process folders recursively |
+| `-o, --output_folder` | str | Input path | Output folder path |
+| `--bs` | int | 1 | Batch size for processing |
+| `--prompt` | str | "Watermark" | Object detection prompt |
+| `--object-aware` | flag | False | Enable object-aware cropping |
+| `--crop-threshold` | float | 20.0 | Max crop area percentage threshold |
+| `--move-skipped` | flag | False | Copy skipped files to '_Skipped_' folder |
+| `--move-errored` | flag | False | Copy errored files to '_Errored_' folder |
+| `--debug` | flag | False | Enable debug output |
+## 🎯 How It Works
+### 1. Object Detection
+The script uses Florence-2-large to detect objects matching your prompt in the image.
+### 2. Smart Cropping Logic
+- **Corner Distance Priority**: Prefers objects closer to image corners
+- **Size Consideration**: Among corner objects, prefers smaller ones
+- **Crop Direction**: Calculates crop area for all four directions (above, below, left, right)
+- **Maximum Preservation**: Chooses the direction that preserves the most pixels
+### 3. Object-Aware Mode
+When enabled, the script:
+- Detects all objects in the image
+- Calculates how many object pixels would be lost with each crop direction
+- Chooses the crop that minimizes object pixel loss
+### 4. Safety Thresholds
+- Skips images where cropping would remove more than the threshold percentage
+- Default threshold: 20% of total image area
+## 📁 Output Structure
+```
+output_folder/
+├── original_image_crop_above.jpg
+├── another_image_crop_left.jpg
+├── _Skipped_/           # (if --move-skipped enabled)
+│   └── skipped_files...
+└── _Errored_/           # (if --move-errored enabled)
+    └── errored_files...
+```
+## 💡 Examples
+### Remove Watermarks
+```bash
+python crop.py watermarked_images/ -r -o clean_images/ --prompt "watermark"
+```
+### Remove Logos with Object Protection
+```bash
+python crop.py branded_images/ -r --object-aware --prompt "logo" --crop-threshold 10
+```
+### Process with Error Organization
+```bash
+python crop.py images/ -r --move-skipped --move-errored --debug
+```
+### High-Performance Batch Processing
+```bash
+python crop.py large_dataset/ -r --bs 4 -o processed/ --crop-threshold 25
+```
+## ⚠️ Important Notes
+- **Model Requirement**: Ensure Florence-2-large model is properly installed in `./Florence-2-large/`
+- **Memory Usage**: Larger batch sizes require more GPU/CPU memory
+- **Quality**: Output images are saved as JPEG with 98% quality
+- **File Formats**: Supports PNG, JPG, JPEG, GIF, BMP input formats
+- **GPU Recommended**: CUDA-capable GPU significantly speeds up processing
+## 🐛 Troubleshooting
+### Common Issues
+1. **Model Loading Error**: Ensure Florence-2-large is in the correct directory
+2. **CUDA Out of Memory**: Reduce batch size (`--bs 1`)
+3. **No Objects Detected**: Try different prompts or check image quality
+4. **Large Crop Areas**: Adjust `--crop-threshold` value
+### Debug Mode
+Use `--debug` flag to see:
+- Detailed object detection results
+- Crop calculations
+- Processing decisions
+## 📊 Performance Tips
+- Use GPU for faster processing
+- Increase batch size for bulk processing (if memory allows)
+- Enable `--object-aware` only when necessary (slower but more accurate)
+- Use appropriate crop thresholds to avoid processing unsuitable images
+## 🤝 Contributing
+Feel free to submit issues, feature requests, or pull requests to improve this tool!
+## 📄 License
+This project uses the Florence-2-large model. Please check the model's license terms for usage restrictions.
+<div style="text-align: center">⁂</div>