runtime error
Exit code: 1. Reason: thon==0.2.11) (4.12.2) Requirement already satisfied: numpy>=1.20.0 in /usr/local/lib/python3.10/site-packages (from llama-cpp-python==0.2.11) (1.22.0) Requirement already satisfied: diskcache>=5.6.1 in /usr/local/lib/python3.10/site-packages (from llama-cpp-python==0.2.11) (5.6.3) [nltk_data] Downloading package punkt to /home/user/nltk_data... [nltk_data] Unzipping tokenizers/punkt.zip. Loading Whisper ASR config.json: 0%| | 0.00/2.39k [00:00<?, ?B/s][A config.json: 100%|██████████| 2.39k/2.39k [00:00<00:00, 13.8MB/s] model.bin: 0%| | 0.00/3.09G [00:00<?, ?B/s][A model.bin: 1%|▏ | 41.9M/3.09G [00:01<01:17, 39.2MB/s][A model.bin: 10%|▉ | 294M/3.09G [00:02<00:17, 157MB/s] [A model.bin: 20%|█▉ | 608M/3.09G [00:03<00:10, 227MB/s][A model.bin: 44%|████▍ | 1.35G/3.09G [00:04<00:04, 427MB/s][A model.bin: 58%|█████▊ | 1.79G/3.09G [00:05<00:03, 417MB/s][A model.bin: 75%|███████▍ | 2.31G/3.09G [00:06<00:01, 452MB/s][A model.bin: 100%|█████████▉| 3.09G/3.09G [00:07<00:00, 413MB/s] preprocessor_config.json: 0%| | 0.00/340 [00:00<?, ?B/s][A preprocessor_config.json: 100%|██████████| 340/340 [00:00<00:00, 2.11MB/s] tokenizer.json: 0%| | 0.00/2.48M [00:00<?, ?B/s][A tokenizer.json: 100%|██████████| 2.48M/2.48M [00:00<00:00, 27.2MB/s] vocabulary.json: 0%| | 0.00/1.07M [00:00<?, ?B/s][A vocabulary.json: 100%|██████████| 1.07M/1.07M [00:00<00:00, 20.0MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 45, in <module> whisper_model = WhisperModel("large-v3", device="cuda", compute_type="float16") File "/usr/local/lib/python3.10/site-packages/faster_whisper/transcribe.py", line 133, in __init__ self.model = ctranslate2.models.Whisper( RuntimeError: CUDA failed with error CUDA driver version is insufficient for CUDA runtime version
Container logs:
Fetching error logs...