fp8 quant made via llm-compressor. compressed via ztsd due to slow internet.

Linux

Install zstd:

# Ubuntu/Debian
sudo apt install zstd

# RHEL/CentOS/Fedora
sudo dnf install zstd

Extract (multi-threaded):

# Single command (if tar supports zstd)
tar -I 'zstd -T0' -xf Austral-FP8.tar.zst

# Two-step method
zstd -d -T0 Austral-FP8.tar.zst
tar -xf Austral-FP8.tar

macOS

Install zstd:

brew install zstd

Extract (multi-threaded):

# Single command
tar -I 'zstd -T0' -xf Austral-FP8.tar.zst

# Two-step method
zstd -d -T0 Austral-FP8.tar.zst
tar -xf Austral-FP8.tar

Windows

Install zstd:

  • Download from Facebook/zstd releases
  • Or use Chocolatey: choco install zstandard
  • Or use WSL with Linux instructions

Extract (multi-threaded):

PowerShell/CMD:

zstd.exe -d -T0 Austral-FP8.tar.zst
tar -xf Austral-FP8.tar

WSL:

zstd -d -T0 Austral-FP8.tar.zst
tar -xf Austral-FP8.tar

Options Explained

  • -d: Decompress
  • -T0: Use all available CPU cores
  • -T4: Use 4 threads (specify number)
  • --rm: Remove source file after extraction

Notes

  • Maximum compression means slower decompression but better file size
  • Multi-threading (-T0) significantly speeds up extraction on multi-core systems
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for NewEden/Austral-70B-FP8