pszemraj
/

long-t5-tglobal-base-16384-book-summary

text2text-generation

Model card Files Files and versions

pszemraj commited on Jun 28, 2022

Commit

d97a777

·

1 Parent(s): f42b69e

add paper link

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -84,6 +84,8 @@ A fine-tuned version of [google/long-t5-tglobal-base](https://huggingface.co/goo
 - 20+ epochs of fine-tuning from the base model on V100/A100 GPUs
 - all training used 16384 token input / 1024 max output
 ## How-To in Python
 Install/update transformers `pip install -U transformers`

 - 20+ epochs of fine-tuning from the base model on V100/A100 GPUs
 - all training used 16384 token input / 1024 max output
+Read the paper by Guo et al. here: [LongT5: Efficient Text-To-Text Transformer for Long Sequences](https://arxiv.org/pdf/2112.07916.pdf)
 ## How-To in Python
 Install/update transformers `pip install -U transformers`