thepushkarp
/

Dia-1.6B-safetensors-fp16

Model card Files Files and versions

thepushkarp commited on 6 days ago

Commit

e5962ab

·

verified ·

1 Parent(s): e9a998d

Update README.md

Files changed (1) hide show

README.md +32 -1

README.md CHANGED Viewed

@@ -22,6 +22,37 @@ Maximum relative tensor difference: 0.229572
 Average absolute tensor difference: 0.000010
 ```
 <center>
 <a href="https://github.com/nari-labs/dia">
 <img src="https://github.com/nari-labs/dia/raw/main/dia/static/images/banner.png">
@@ -80,7 +111,7 @@ import soundfile as sf
 from dia.model import Dia
-model = Dia.from_pretrained("thepushkarp/Dia-1.6B-safetensors-fp16")
 text = "[S1] Dia is an open weights text to dialogue model. [S2] You get full control over scripts and voices. [S1] Wow. Amazing. (laughs) [S2] Try it now on Git hub or Hugging Face."

 Average absolute tensor difference: 0.000010
 ```
+To use the safetensors file, use this custom script which allows loading from safetensors:
+First install the library:
+```
+git clone https://github.com/thepushkarp/dia.git
+cd dia
+python -m venv .venv
+source .venv/bin/activate
+```
+Then run:
+```
+import soundfile as sf
+from dia.model import Dia
+model = Dia.from_pretrained(
+    "thepushkarp/Dia-1.6B-safetensors-fp16",
+    config_path="config.json",
+    checkpoint_path="dia-v0_1-fp16.safetensors",
+)
+text = "[S1] Dia is an open weights text to dialogue model. [S2] You get full control over scripts and voices. [S1] Wow. Amazing. (laughs) [S2] Try it now on Git hub or Hugging Face."
+output = model.generate(text)
+sf.write("simple.mp3", output, 44100)
+```
+---
 <center>
 <a href="https://github.com/nari-labs/dia">
 <img src="https://github.com/nari-labs/dia/raw/main/dia/static/images/banner.png">
 from dia.model import Dia
+model = Dia.from_pretrained("nari-labs/Dia-1.6B")
 text = "[S1] Dia is an open weights text to dialogue model. [S2] You get full control over scripts and voices. [S1] Wow. Amazing. (laughs) [S2] Try it now on Git hub or Hugging Face."