Playing Around with Stable Diffusion

Suggest an Edit

If you have been following for a while now, you may have noticed that I sometimes include images in my posts. Some of these have been generated by AI, specifically Midjourney. Using Midjourney for the first time completely blew me away.

I’m currently avidly playing around with this cutting-edge technology. Primarily, I use the Stable Diffusion checkpoints (which the AI art generation community have been sharing) for this stuff. I’ve got AUTO1111’s client setup locally and ~60GB (which is actually very little) of various checkpoint models and a bunch of other things like embeddings and LoRA models.

My GTX1660 6GB GPU has put up with more than I expected. These workloads can easily consume >8GB VRAM and my barebones, unoptimized workflow doesn’t help - if I can even call it that.

Honestly, I’m just kind of staggering about. Generate an amazing image here and there, play around with tools people have open sourced, explore the huge library of fine-tuned models people are training, learn about prompt engineering. There is just so much! Things in this space are also moving crazy fast - it is unsurprising if a guide created only a month ago no longer works.

Some notable things I’ve spent my time on:

I plan to continue all this, but I think my GPU is reaching its limits. I’m unsure as to whether I should fork out some 💰 for a more powerful GPU or to simply get a google colab subscription.

I’ll probably just wait and see until after I finish the fast.ai course.


If you’ve made it this far, here’s a little showcase 😉

Interior Design?

Rustic interior room. Unusual wooden beams, wooden table

Civil Engineering Marvels

Distant future industrial hive world, landscape, long building, flames

Man standing by himself admiring a colossal scale structure inside an enormous room with huge doorways. Regal architecture

メカ

Futuristic quadrupedal mecha in a fighting stance, rust, industrial wasteland background

Large mecha suit leaning over a spacious room with office seats to each side

Large mecha suit resembling Lagann on an alien world

The Unknown

A pillar surrounded by a black moat, depressing uncomfortable atmosphere

Quadrupedal alien inside an immense castle interior with portals beneath the ceiling, starry skies

Sharp

Abstract rendition a futuristic fighter in a still-frame

Abstract

Boy amused by a bird flying through a zentangle

Fantasy

Monarch

Regal Princess, ornate luxurious castle chamber

Princess wearing a crown with ornate jewellery, abstract bacakground

Female knight staring at the viewer infront of a iris background