Make refiner switchover based on model timesteps instead of sampling steps #14978

drhead · 2024-02-20T21:57:51Z

Description

This makes the default behavior of refiner switchover aligned with model timesteps instead of sampling timesteps. This is easier to use, since setting refiner switchover to 0.8 (for a refiner trained on the last 200 timesteps like SDXL's) will now always switch to the refiner as soon as we're sampling from timesteps the refiner was trained on, where previously using img2img or different noise schedules could lead to unexpected model behavior (examples below).
I changed this to work off of the existing refiner switchover slider. The new default behavior will be to derive timesteps from sigma (or in DDIM's case take them directly) and use that to determine whether it is time to switch over. Old behavior is supported by a compatibility option.
Implements [Feature Request]: Refiner switchover should be controlled by (fraction of) training timesteps and not by fraction of sampling #14970
~~edit: converted to draft while I troubleshoot an off-by-one error~~ There is a bug where model alphas_cumprod changes (compatibility casting option or zero terminal snr) are reverted during the timestep the refiner is applied. This will have to be fixed separately.

Screenshots/videos:

Examples of old behavior in txt2img:

The refiner model used here was trained for the last 200 timesteps. The Karras schedule type, especially on zero snr, drastically changes the model timesteps called during this 50 step sampling process, which results in the refiner being switched to too early on what is actually the correct setting on the default noise schedule. The effectively correct setting for Karras samplers is 0.88 for this refiner under the old configuration.

Now for the fixed version:

With this fix, the behavior of the refiner is consistent with the same settings across different schedules, and it no longer triggers too early. 0.8 is reliably a correct setting.

Examples of old behavior in img2img/inpainting (inpainting mask is over the head, adding a hat, 0.75 denoising strength):

This one is more complicated, and the differences are subtle. The effectively correct settings for the normal schedule is to switch over at 0.75, and for Karras it is correct to switch over at 0.85. Using the expected setting of 0.8 therefore is too late for normal schedules and too early for Karras ones. As denoising strength gets lower, the problem becomes more severe.

This grid shows the behavior after the fix. Switch at 0.8 is now correct for both.

Checklist:

I have read contributing wiki page
I have performed a self-review of my own code
My code follows the style guidelines
My code passes tests

drhead · 2024-02-21T03:31:44Z

I will note that there are two remaining cases that are handled poorly by the refiner:

DPM Adaptive -- Will switch to the refiner on the first timestep under the threshold, but can then sample a higher timestep, producing incorrect output. Previously, nothing was able to trigger refiner switchover since DPM Adaptive has no definite step size. Can be mitigated by choosing a slightly higher trigger point or by not using the refiner.
Restart -- Refiner will be switched to the first time timesteps fall under the threshold (around the halfway point of sampling), which will then go over the threshold, then below it again, then above it, then below it once more.

There is no proper way to handle these cases without having the refiner switch on and off multiple times during sampling. Neither particularly worked well with refiners to begin with, and Restart is the only one that could be argued to be a regression. I can look into ways to get Restart to only switch to the refiner on the last "restart", which would have Restart working as well as it could have before, but I don't think that robust handling of both of these cases would have benefits that justify the extra complexity required.

drhead added 3 commits February 20, 2024 16:18

Add compatibility option for refiner switching

f4869f8

Pass sigma to apply_refiner

09d2e58

Allow refiner to be triggered by model timestep instead of sampling

25eeeaa

drhead requested a review from AUTOMATIC1111 as a code owner February 20, 2024 21:57

fix missing arg

bf34803

drhead marked this pull request as draft February 20, 2024 22:46

drhead marked this pull request as ready for review February 20, 2024 23:04

drhead mentioned this pull request Feb 21, 2024

Protect alphas_cumprod during refiner switchover #14979

Merged

4 tasks

AUTOMATIC1111 approved these changes Mar 2, 2024

View reviewed changes

AUTOMATIC1111 merged commit aabedcb into AUTOMATIC1111:dev Mar 2, 2024
3 checks passed

AUTOMATIC1111 added a commit that referenced this pull request Mar 2, 2024

infotext support for #14978

bb24c13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make refiner switchover based on model timesteps instead of sampling steps #14978

Make refiner switchover based on model timesteps instead of sampling steps #14978

drhead commented Feb 20, 2024 •

edited

drhead commented Feb 21, 2024

Make refiner switchover based on model timesteps instead of sampling steps #14978

Make refiner switchover based on model timesteps instead of sampling steps #14978

Conversation

drhead commented Feb 20, 2024 • edited

Description

Screenshots/videos:

Checklist:

drhead commented Feb 21, 2024

drhead commented Feb 20, 2024 •

edited