@isidentical
@samusenps
already posted my flow and I encourage you to try it but I would like to comment on one thing:
"We have been also testing face embeddings, but even with multiple samples the quality is not anywhere close to what we expect."
If you want to achieve the best results then mixing models is the way to go IMHO, but I've had quite good results with only the embeddings. You can check my models and filter by embeddings to see some examples: https://civitai.com/user/malcolmrey
If you like what you see, there is an article on how exactly (all parameters) I train those embeddings :)