Spaces:

tuandunghcmut
/

corgi-qwen3-vl-demo

Runtime error

corgi-qwen3-vl-demo / PROJECT_PLAN.md

dung-vpt-uney

Deploy latest CoRGI Gradio demo

b6a01d6 18 days ago

2.84 kB

CoRGI Custom Demo — Project Plan

Objective: ship a runnable CoRGI demo (CLI + Gradio) powered entirely by Qwen3-VL for structured reasoning, ROI evidence extraction, and answer synthesis.
Scope: stay within the corgi_custom package, reuse Qwen3-VL cookbooks where possible, keep dependency footprint minimal (no extra detectors/rerankers).
Environment: Conda env pytorch, default VLM Qwen/Qwen3-VL-8B-Thinking.

Status	Milestone	Notes
✅	Core pipeline skeleton (dataclasses, parsers, Qwen client wrappers)	Already merged in repo.
✅	Project documentation & progress tracking scaffolding	Plan + progress log committed.
✅	CLI runner that prints step-by-step pipeline output	Supports overrides + JSON export.
✅	Gradio demo mirroring CLI functionality	Blocks UI with markdown report messaging.
✅	Automated tests for new modules	CLI + Gradio helpers covered with unit tests.
✅	HF Space deployment automation	Bash script + app harness for zerogpu Spaces.
🟡	Final verification (unit tests, smoke instructions)	Document how to run `pytest` and the demos.

Docs & Tracking
- Finalize plan and progress log templates.
- Document environment setup expectations.
Pipeline UX
- Implement CLI entrypoint (corgi.cli:main).
- Provide structured stdout for steps/evidence/answer.
- Allow optional JSON dump for downstream tooling.
Interactive Demo
- Build Gradio app harness (image upload + question textbox).
- Stream progress (optional) and display textual reasoning/evidence.
- Handle model loading errors gracefully.
Testing & Tooling
- Add fixture-friendly helpers to avoid heavy model loads in tests.
- Write unit tests for CLI argument parsing + formatting.
- Add regression test for pipeline serialization.
Docs & Hand-off
- Update README/demo instructions.
- Provide sample command sequences for CLI/Gradio.
- Capture open risks & future enhancements.
Deployment & Ops
- Add Hugging Face Space entrypoint (app.py).
- Write deployment helper script (scripts/push_space.sh).
- Add automated checklists/logs for Space updates.

Model loading latency / VRAM → expose config knobs and mention 4B fallback.
Parsing drift from Qwen outputs → keep parser tolerant; add debug flag to dump raw responses.
Test runtime → mock Qwen client via fixtures; avoid loading real model in unit tests.