Spaces:

tuandunghcmut
/

corgi-qwen3-vl-demo

Runtime error

dung-vpt-uney commited on 30 days ago

Commit

454a159

1 Parent(s): f1e4aa7

Deploy latest CoRGI Gradio demo

Files changed (4) hide show

PROGRESS_LOG.md CHANGED Viewed

@@ -13,6 +13,7 @@
 - Added ZeroGPU support: cached model/processor globals live on CUDA when available, a `@spaces.GPU`-decorated executor handles pipeline runs, and requirements now include the `spaces` SDK.
 - Introduced structured logging for the app (`app.py`) and pipeline execution to trace model loads, cache hits, and Gradio lifecycle events on Spaces.
 - Reworked the Gradio UI to show per-step panels with annotated evidence galleries, giving each CoRGI reasoning step its own window alongside the final synthesized answer.
 ## 2024-10-21
 - Updated default checkpoints to `Qwen/Qwen3-VL-8B-Thinking` and verified CLI/Gradio/test coverage.

 - Added ZeroGPU support: cached model/processor globals live on CUDA when available, a `@spaces.GPU`-decorated executor handles pipeline runs, and requirements now include the `spaces` SDK.
 - Introduced structured logging for the app (`app.py`) and pipeline execution to trace model loads, cache hits, and Gradio lifecycle events on Spaces.
 - Reworked the Gradio UI to show per-step panels with annotated evidence galleries, giving each CoRGI reasoning step its own window alongside the final synthesized answer.
+- Preloaded the default Qwen3-VL model/tokenizer at import so Spaces load the GPU weights before serving requests.
 ## 2024-10-21
 - Updated default checkpoints to `Qwen/Qwen3-VL-8B-Thinking` and verified CLI/Gradio/test coverage.

corgi/__pycache__/cli.cpython-313.pyc CHANGED Viewed

Binary files a/corgi/__pycache__/cli.cpython-313.pyc and b/corgi/__pycache__/cli.cpython-313.pyc differ

corgi/__pycache__/gradio_app.cpython-313.pyc CHANGED Viewed

Binary files a/corgi/__pycache__/gradio_app.cpython-313.pyc and b/corgi/__pycache__/gradio_app.cpython-313.pyc differ

corgi/gradio_app.py CHANGED Viewed

@@ -50,6 +50,20 @@ def _default_factory(model_id: Optional[str]) -> CoRGIPipeline:
     return CoRGIPipeline(vlm_client=Qwen3VLClient(config=config))
 def _get_pipeline(model_id: str, factory: Callable[[Optional[str]], CoRGIPipeline]) -> CoRGIPipeline:
     pipeline = _PIPELINE_CACHE.get(model_id)
     if pipeline is None:
@@ -387,6 +401,11 @@ def build_demo(
     global _GLOBAL_FACTORY
     _GLOBAL_FACTORY = factory
     logger.info("Registering pipeline factory %s", factory)
     with gr.Blocks(title="CoRGI Qwen3-VL Demo") as demo:
         state = gr.State()  # stores PipelineState

     return CoRGIPipeline(vlm_client=Qwen3VLClient(config=config))
+def _warm_default_pipeline() -> None:
+    if DEFAULT_MODEL_ID in _PIPELINE_CACHE:
+        return
+    try:
+        logger.info("Preloading default pipeline for model_id=%s", DEFAULT_MODEL_ID)
+        _PIPELINE_CACHE[DEFAULT_MODEL_ID] = _default_factory(DEFAULT_MODEL_ID)
+    except Exception as exc:  # pragma: no cover - defensive
+        logger.exception("Failed to preload default model %s: %s", DEFAULT_MODEL_ID, exc)
+_GLOBAL_FACTORY = _default_factory  # type: ignore[assignment]
+_warm_default_pipeline()
 def _get_pipeline(model_id: str, factory: Callable[[Optional[str]], CoRGIPipeline]) -> CoRGIPipeline:
     pipeline = _PIPELINE_CACHE.get(model_id)
     if pipeline is None:
     global _GLOBAL_FACTORY
     _GLOBAL_FACTORY = factory
     logger.info("Registering pipeline factory %s", factory)
+    try:
+        logger.info("Preloading pipeline with factory for model_id=%s", DEFAULT_MODEL_ID)
+        _PIPELINE_CACHE[DEFAULT_MODEL_ID] = factory(DEFAULT_MODEL_ID)
+    except Exception as exc:  # pragma: no cover - defensive
+        logger.exception("Unable to preload pipeline via factory: %s", exc)
     with gr.Blocks(title="CoRGI Qwen3-VL Demo") as demo:
         state = gr.State()  # stores PipelineState