bghira
/

pseudo-flex-base

@@ -63,11 +63,49 @@ Ergo, at 1300 steps, the decision was made to cease training on the original LAI
 This consisted of 17,800 images at a base resolution of 1024x1024, with about 700 samples in portrait and 700 samples in landscape.
-## Improvement in quality
 Similar to the text encoder swap, the images showed a marked improvement over the next several checkpoints.
-# Test release
 This model has been packaged up in a test form so that it can be thoroughly assessed by users.

 This consisted of 17,800 images at a base resolution of 1024x1024, with about 700 samples in portrait and 700 samples in landscape.
+## Contrast issues
+As the checkpoint 3275 was tested, a common observation was that darker images were washed out, and brighter images seemed "meh".
+Various CFG rescale and guidance levels were tested, with the best dark images occurring around `guidance_scale=9.2` and `guidance_rescale=0.0` but they remained "washed out".
+## Dataset change number two
+A new LAION subset was prepared with unique images and no square images - just a limited collection of aspect ratios:
+* 16:9
+* 9:16
+* 2:3
+* 3:2
+This was intended to speed up the understanding of the model, and prevent overfitting on captions.
+This LAION subset contained 17,800 images, evenly distributed through aspect ratios.
+The images were then captioned using T5 Flan with BLIP2, to obtain highly accurate results.
+## Contrast fix: offset noise / SNR gamma to the rescue?
+Offset noise and SNR gamma were applied experimentally to the checkpoint **4250**:
+* `snr_gamma=5.0`
+* `noise_offset=0.2`
+* `noise_pertubation=0.1`
+Within 25 steps of training, the contrast was back, and the prompt `a solid black square` once again produced a reasonable result.
+At 50 steps of offset noise, things really seemed to "click" and `a solid black square` had the fewest deformities I've seen.
+Step 75 checkpoint was broken. The SNR gamma math results in numeric instability and was disabled. The offset noise parameters were untouched.
+## Success! Improvement in quality and contrast.
 Similar to the text encoder swap, the images showed a marked improvement over the next several checkpoints.
+It was left to its own devices, and at step 4475, enough improvement was observed that another revision in this repository was created.
+# Status: Test release
 This model has been packaged up in a test form so that it can be thoroughly assessed by users.