@Thoryn
Glad I’m not the only one. My uploads have slowed dramatically.
Glad I’m not the only one. My uploads have slowed dramatically.
@Equum Unfortunately, those are limitations of CLIP, which is what a lot of older models use, including PonyV6, Illustrious, and basically almost every model that competently renders ponies. One of the major limitations of CLIP is that it doesn’t really comprehend spatial relationships, which means it really only can do spatial relationships when the specific spatial relationship you’re prompting exists a lot in the training data. It’s not capable of dynamically putting together a new spatial relationship without a LoRA to assist.
Newer models have a much better understanding of spatial relationships (and most of them are also natural-language capable) which makes them much more useful for prompts like what you’re describing. Unfortunately, most of them suck at general pony knowledge; some of them might kinda vaguely get major characters like Twilight or Celestia right, but many of them can’t even comprehend the CMC. They’re also generally heavier models that require better hardware just to get tolerable generation times.