Commit History

Update model lookup to use filename instead of ID in get_model function
9f9c6d5

alexfremont commited on

Improve model unloading with explicit GPU memory cleanup and CUDA cache clearing
8a8fe7c

alexfremont commited on

Add timestamps to memory monitoring logs and display outputs
8580232

alexfremont commited on

Improve Docker container memory monitoring with cgroup v1/v2 support and better logging
80a3544

alexfremont commited on

Improve read on machine stats
1c5ef91

alexfremont commited on

Improve system memory info formatting by using newlines instead of pipes
fa98542

alexfremont commited on

Remove periodic memory status updates and related helper function
1c12662

alexfremont commited on

Remove system status endpoint and monitoring functionality
e241d94

alexfremont commited on

Add memory status display with auto-refresh every 10 seconds
078f3c3

alexfremont commited on

Add system monitoring features and memory usage tracking for loaded models
6ba6dc7

alexfremont commited on

Disable prepared statement cache for pgbouncer compatibility
db280f4

alexfremont commited on

Remove DELETE endpoint for model unload, keep POST alternative only
28b854e

alexfremont commited on

Add POST endpoint alternative for unloading models from memory
ca20804

alexfremont commited on

Add API endpoint to unload models from memory without database deletion
db789ea

alexfremont commited on

Improve model info formatting for cleaner display in API responses
1445bc9

alexfremont commited on

Add enhanced Gradio UI with tabs, table view and model details formatting
df3bf97

alexfremont commited on

Move fetch_models_for_group from queries to models module
f85309c

alexfremont commited on

Fix import path from api.routes to api.router
1b3980d

alexfremont commited on

Replace gradio import syntax and remove unnecessary whitespace
77f06e1

alexfremont commited on

Remove unused imports and simplify FastAPI/Gradio dependencies
68a1cf9

alexfremont commited on

Change gradio import syntax from direct to aliased import
927cd6e

alexfremont commited on

Refactor model loading to store metadata alongside pipelines in model_pipelines dict
dfb1c84

alexfremont commited on

Add force_download param to model pipeline loader for manual cache refresh
638ed9d

alexfremont commited on

Fix model logging to use display_name instead of name field
c2cd706

alexfremont commited on

Remove name field from model queries in db/models.py
ec6e669

alexfremont commited on

Update model logging to use hf_filename instead of model_name
419e526

alexfremont commited on

Move API key middleware to main.py and add startup/shutdown lifecycle management
5df1f2d

alexfremont commited on

Clean up imports and remove unused code across API modules
3635acb

alexfremont commited on

Update type hint from Dict|None to Union[Dict, None] in fetch_model_by_id
48b9b47

alexfremont commited on

Refactor API auth and add management endpoints for model loading/updating
bccef3b

alexfremont commited on

Update requirements.txt with new package dependencies
832b9bb

alexfremont commited on

Add model management endpoints and database fetch functionality
0053356

alexfremont commited on

Add theme.css and queue endpoints to static asset routing whitelist
2c455c2

alexfremont commited on

Add exception for Gradio static assets in API authentication middleware
97bf0f2

alexfremont commited on

Merge Gradio UI into FastAPI app and standardize port to 7860
b26a6dc

alexfremont commited on

Add Gradio UI service and update Docker config for dual API/UI setup
5160659

alexfremont commited on

Add os import and update API key validation in router.py
c35fb5d

alexfremont commited on

Skip auth middleware for debug mode and HF Space container logs
ba5d6d2

alexfremont commited on

Refactor API architecture with modular design and database integration
e109700

alexfremont commited on

Update README.md
5559385
verified

alexfremont commited on

update model name
b5d6f86

alexfremont commited on

update api with new models
4b470c5

alexfremont commited on

add new and remove models
41517aa

alexfremont commited on

try new version of API
37e8687

alexfremont commited on