zip2zip: Inference-Time Adaptive Vocabularies for Language Models via Token Compression Paper • 2506.01084 • Published Jun 1 • 7
Generating Structured Outputs from Language Models: Benchmark and Studies Paper • 2501.10868 • Published Jan 18 • 2