Stable Diffusion is great at taking source images and applying a style to them. In this example, I turn Google Maps into anime background style by taking a section of my screen as input. I then set it to generate an image when a change is detected.
ComfyUI is a node-based GUI for Stable Diffusion. I like ComfyUI because of my game development experience with node editors (Unreal Blueprints, Unity Shader Graph, Substance Designer, Blender Shader Editor). Most people who start learning Stable Diffusion go with Stable Diffusion web UI, known as Automatic1111 to avoid ambiguity. There’s also Fooocus. It’s great for generating good images quickly, but you can’t see what exactly what is happening under the hood.
ComfyUI workflow
Here is the workflow:
- Checkpoint model:
meinamix_meinaV11
- Positive Prompt:
day, noon, (blue sky:1.0), clear sky
- Negative Prompt:
(worst quality, low quality:1.4), (zombie, sketch, interlocked fingers, comic)
- Resolution:
768 x 512
- ControlNet model:
control_vllp_sd15_canny.pth
Depending on the Google Maps location, I add a country or city name in the positive prompt (e.g. Japan, New York, Paris, etc.). I used toyxyz’s custom webcam node to capture a section of the screen and plug the output into a ControlNet canny model.
KSampler:
seed: 1
control_after_generate: fixed
steps: 15
cfg:
4.0
sampler_name: euler_ancestral
scheduler: normal
denoise: 1.00
You might be asking, “Why didn’t you use LCM-LoRA like in the workflow in toyxyz’s example?” I tried it, but the final image was looking too similar to the webcam’s capture. I wanted it to be more creative, and modifying the prompts was not quite giving me the results I wanted.
It is possible to optimize this further and make better and faster generations. Perhaps by using StreamDiffusion, TouchDesigner, or a model based on SDXL-Lightning. Check out FĂ©lix Sanz’s articles on SDXL-Lightning comparison and Optimizing Stable Diffusion XL.