ComfyUI. I get the code from this dude that makes super optimized ComfyUI code for it: https://markdkberry.com/workflows/research/
If you have at least 12 gb of vram you can run the Q4 GGUF models with a tiny bit of noise. I upgraded to 24 gb recently so I could run the Q8s. My last 15.ai post is running the Q4s but I never tried upscaling that so you could potentially get higher quality gens with it. You will need to figure out how to install triton and sageattention into ComfyUI, however.