fancyfeast commited on
Commit
3101114
·
verified ·
1 Parent(s): 537fbaf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -9,10 +9,10 @@ tags:
9
 
10
  [Github](https://github.com/fpgaminer/joycaption)
11
 
12
- JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
13
 
14
  Key Features:
15
- - **Free and Open**: It will be released for free, open weights, no restrictions, and just like [bigASP](https://www.reddit.com/r/StableDiffusion/comments/1dbasvx/the_gory_details_of_finetuning_sdxl_for_30m/), will come with training scripts and lots of juicy details on how it gets built.
16
  - **Uncensored**: Equal coverage of SFW and NSFW concepts. No "cylindrical shaped object with a white substance coming out on it" here.
17
  - **Diversity**: All are welcome here. Do you like digital art? Photoreal? Anime? Furry? JoyCaption is for everyone. Pains are being taken to ensure broad coverage of image styles, content, ethnicity, gender, orientation, etc.
18
  - **Minimal Filtering**: JoyCaption is trained on large swathes of images so that it can understand almost all aspects of our world. almost. Illegal content will never be tolerated in JoyCaption's training.
@@ -39,7 +39,7 @@ from transformers import AutoProcessor, LlavaForConditionalGeneration
39
 
40
  IMAGE_PATH = "image.jpg"
41
  PROMPT = "Write a long descriptive caption for this image in a formal tone."
42
- MODEL_NAME = "fancyfeast/llama-joycaption-alpha-two-hf-llava"
43
 
44
 
45
  # Load JoyCaption
@@ -79,7 +79,7 @@ with torch.no_grad():
79
  # Generate the captions
80
  generate_ids = llava_model.generate(
81
  **inputs,
82
- max_new_tokens=300,
83
  do_sample=True,
84
  suppress_tokens=None,
85
  use_cache=True,
 
9
 
10
  [Github](https://github.com/fpgaminer/joycaption)
11
 
12
+ JoyCaption is an image captioning Visual Language Model (VLM) built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
13
 
14
  Key Features:
15
+ - **Free and Open**: Always released for free, open weights, no restrictions, and just like [bigASP](https://www.reddit.com/r/StableDiffusion/comments/1dbasvx/the_gory_details_of_finetuning_sdxl_for_30m/), will come with training scripts and lots of juicy details on how it gets built.
16
  - **Uncensored**: Equal coverage of SFW and NSFW concepts. No "cylindrical shaped object with a white substance coming out on it" here.
17
  - **Diversity**: All are welcome here. Do you like digital art? Photoreal? Anime? Furry? JoyCaption is for everyone. Pains are being taken to ensure broad coverage of image styles, content, ethnicity, gender, orientation, etc.
18
  - **Minimal Filtering**: JoyCaption is trained on large swathes of images so that it can understand almost all aspects of our world. almost. Illegal content will never be tolerated in JoyCaption's training.
 
39
 
40
  IMAGE_PATH = "image.jpg"
41
  PROMPT = "Write a long descriptive caption for this image in a formal tone."
42
+ MODEL_NAME = "fancyfeast/llama-joycaption-beta-one-hf-llava"
43
 
44
 
45
  # Load JoyCaption
 
79
  # Generate the captions
80
  generate_ids = llava_model.generate(
81
  **inputs,
82
+ max_new_tokens=512,
83
  do_sample=True,
84
  suppress_tokens=None,
85
  use_cache=True,