Update README.md
Browse files
README.md
CHANGED
@@ -9,10 +9,10 @@ tags:
|
|
9 |
|
10 |
[Github](https://github.com/fpgaminer/joycaption)
|
11 |
|
12 |
-
JoyCaption is an image captioning Visual Language Model (VLM)
|
13 |
|
14 |
Key Features:
|
15 |
-
- **Free and Open**:
|
16 |
- **Uncensored**: Equal coverage of SFW and NSFW concepts. No "cylindrical shaped object with a white substance coming out on it" here.
|
17 |
- **Diversity**: All are welcome here. Do you like digital art? Photoreal? Anime? Furry? JoyCaption is for everyone. Pains are being taken to ensure broad coverage of image styles, content, ethnicity, gender, orientation, etc.
|
18 |
- **Minimal Filtering**: JoyCaption is trained on large swathes of images so that it can understand almost all aspects of our world. almost. Illegal content will never be tolerated in JoyCaption's training.
|
@@ -39,7 +39,7 @@ from transformers import AutoProcessor, LlavaForConditionalGeneration
|
|
39 |
|
40 |
IMAGE_PATH = "image.jpg"
|
41 |
PROMPT = "Write a long descriptive caption for this image in a formal tone."
|
42 |
-
MODEL_NAME = "fancyfeast/llama-joycaption-
|
43 |
|
44 |
|
45 |
# Load JoyCaption
|
@@ -79,7 +79,7 @@ with torch.no_grad():
|
|
79 |
# Generate the captions
|
80 |
generate_ids = llava_model.generate(
|
81 |
**inputs,
|
82 |
-
max_new_tokens=
|
83 |
do_sample=True,
|
84 |
suppress_tokens=None,
|
85 |
use_cache=True,
|
|
|
9 |
|
10 |
[Github](https://github.com/fpgaminer/joycaption)
|
11 |
|
12 |
+
JoyCaption is an image captioning Visual Language Model (VLM) built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
|
13 |
|
14 |
Key Features:
|
15 |
+
- **Free and Open**: Always released for free, open weights, no restrictions, and just like [bigASP](https://www.reddit.com/r/StableDiffusion/comments/1dbasvx/the_gory_details_of_finetuning_sdxl_for_30m/), will come with training scripts and lots of juicy details on how it gets built.
|
16 |
- **Uncensored**: Equal coverage of SFW and NSFW concepts. No "cylindrical shaped object with a white substance coming out on it" here.
|
17 |
- **Diversity**: All are welcome here. Do you like digital art? Photoreal? Anime? Furry? JoyCaption is for everyone. Pains are being taken to ensure broad coverage of image styles, content, ethnicity, gender, orientation, etc.
|
18 |
- **Minimal Filtering**: JoyCaption is trained on large swathes of images so that it can understand almost all aspects of our world. almost. Illegal content will never be tolerated in JoyCaption's training.
|
|
|
39 |
|
40 |
IMAGE_PATH = "image.jpg"
|
41 |
PROMPT = "Write a long descriptive caption for this image in a formal tone."
|
42 |
+
MODEL_NAME = "fancyfeast/llama-joycaption-beta-one-hf-llava"
|
43 |
|
44 |
|
45 |
# Load JoyCaption
|
|
|
79 |
# Generate the captions
|
80 |
generate_ids = llava_model.generate(
|
81 |
**inputs,
|
82 |
+
max_new_tokens=512,
|
83 |
do_sample=True,
|
84 |
suppress_tokens=None,
|
85 |
use_cache=True,
|