agentlans
/

flan-t5-small-keywords

@@ -34,6 +34,8 @@ The model takes a paragraph as input and generates a list of keywords or key phr
 **Limitations:**
 - The model may sometimes generate irrelevant keywords
 - Performance may vary depending on the length and complexity of the input text
 - The model is trained on English text and may not perform well on other languages
 ## Training and Evaluation
@@ -63,13 +65,11 @@ print(keywords)
 Example input paragraph:
-```
-In the heart of the bustling city, a hidden gem awaits discovery: a quaint little bookstore that seems to have escaped the relentless march of time. As you step inside, the scent of aged paper and rich coffee envelops you, creating an inviting atmosphere that beckons you to explore its shelves. Each corner is adorned with carefully curated collections, from classic literature to contemporary bestsellers, inviting readers of all tastes to lose themselves in the pages of a good book. The soft glow of warm lighting casts a cozy ambiance, while the gentle hum of conversation among fellow book lovers adds to the charm. This bookstore is not just a place to buy books; it's a sanctuary for those seeking solace, inspiration, and a sense of community in the fast-paced world outside.
-```
 Example output keywords:
-`['quaint little bookstore', 'old paper coffee scent', 'curated collections', 'lovely hum of conversation', 'spiritual community', 'spiritual bookstore']`
 ## Limitations and Bias
@@ -79,7 +79,6 @@ This model has been trained on English Wikipedia paragraphs, which may introduce
 - **Training Data:** dataset of Wikipedia paragraphs and keywords
 - **Training Procedure:** Fine-tuning of google/flan-t5-small
-- **Hyperparameters:** Not specified
 ### Training hyperparameters
@@ -94,10 +93,10 @@ The following hyperparameters were used during training:
 ### Framework versions
-- Transformers 4.44.2
-- Pytorch 2.2.2+cu121
-- Datasets 2.18.0
-- Tokenizers 0.19.1
 ## Ethical Considerations

 **Limitations:**
 - The model may sometimes generate irrelevant keywords
 - Performance may vary depending on the length and complexity of the input text
+  - For best results, use long clean texts
+  - Length limit is 512 tokens due to Flan-T5 architecture
 - The model is trained on English text and may not perform well on other languages
 ## Training and Evaluation
 Example input paragraph:
+```In the heart of the bustling city, a hidden gem awaits discovery: a quaint little bookstore that seems to have escaped the relentless march of time. As you step inside, the scent of aged paper and rich coffee envelops you, creating an inviting atmosphere that beckons you to explore its shelves. Each corner is adorned with carefully curated collections, from classic literature to contemporary bestsellers, inviting readers of all tastes to lose themselves in the pages of a good book. The soft glow of warm lighting casts a cozy ambiance, while the gentle hum of conversation among fellow book lovers adds to the charm. This bookstore is not just a place to buy books; it's a sanctuary for those seeking solace, inspiration, and a sense of community in the fast-paced world outside.```
 Example output keywords:
+`['old paper coffee scent', 'cosy hum of conversation', 'quaint bookstore', 'community in the fast-paced world', 'solace inspiration', 'curated collections']`
 ## Limitations and Bias
 - **Training Data:** dataset of Wikipedia paragraphs and keywords
 - **Training Procedure:** Fine-tuning of google/flan-t5-small
 ### Training hyperparameters
 ### Framework versions
+- Transformers 4.45.1
+- Pytorch 2.4.1+cu121
+- Datasets 3.0.1
+- Tokenizers 0.20.0
 ## Ethical Considerations