An end-to-end (e2e) Voice Language Model by Fish Audio.
Generate realistic dialogue from a script, using Dia!
Use the FLUX-Pro model as much as you want.