{"id":15,"date":"2023-01-26T20:56:22","date_gmt":"2023-01-26T20:56:22","guid":{"rendered":"https:\/\/promptphantom.com\/?p=15"},"modified":"2023-01-28T10:56:59","modified_gmt":"2023-01-28T16:56:59","slug":"the-complete-beginners-guide-to-the-stable-diffusion-interface","status":"publish","type":"post","link":"https:\/\/promptphantom.com\/the-complete-beginners-guide-to-the-stable-diffusion-interface\/","title":{"rendered":"The Complete Beginner&#8217;s Guide to the Stable Diffusion Interface"},"content":{"rendered":"<h3 class=\"wp-block-heading\" id=\"aioseo-part-1-using-txt2img-in-stable-diffusion-web-ui\"><em>Part 1: Using Txt2Img in Stable Diffusion Web UI<\/em><\/h3>\n\n\n<p>So you\u2019ve recently learned about AI art and a software called <strong>Stable Diffusion<\/strong>. Welcome to the wonderful world of computer-generated art! While it may look complicated to operate Stable Diffusion at first glance, the controls are actually quite simple. <\/p>\n\n\n\n<p>In this tutorial, I want to explain the various parameters of this powerful AI in simple terms. We\u2019ll look at every little lever and thingamabob inside the standard web interface and learn what they do. But don&#8217;t worry if you have a slightly different interface than, because these parameters will be the same on almost all versions of Stable Diffusion: whether you\u2019re using a web UI, online service, or working from the command line. By the end of this guide, you&#8217;ll be one step ahead of your peers. You&#8217;ll possess a thorough understanding of which controls affect what outcome in your art\u2014so that you can make better art with less hassle!<\/p>\n\n\n<div role=\"navigation\" aria-label=\"Table of Contents\" class=\"simpletoc wp-block-simpletoc-toc\"><h2 style=\"margin: 0;\"><button type=\"button\" aria-expanded=\"false\" aria-controls=\"simpletoc-content-container\" class=\"simpletoc-collapsible\">Table of Contents<span class=\"simpletoc-icon\" aria-hidden=\"true\"><\/span><\/button><\/h2><div id=\"simpletoc-content-container\" class=\"simpletoc-content\"><ul class=\"simpletoc-list\">\n\n<li><a href=\"#aioseo-a-2-part-guide-starts-here\">A 2-Part Guide Starts Here<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-technical-details\">Technical Details<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-txt2img\">Txt2Img<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-prompt\">Prompt<\/a>\n\n\n<li><a href=\"#aioseo-negative-prompt\">Negative Prompt<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-sampling-steps\">Sampling Steps<\/a>\n\n\n<ul><li>\n<a href=\"#aioseo-steps-vs-quality\">Steps vs Quality<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-goldilocks-zone-of-denoising\">Goldilocks Zone of Denoising<\/a>\n\n<\/li>\n<\/ul>\n<li><a href=\"#aioseo-sampling-method\">Sampling Method<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-image-size\">Image Size<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-batch-count-batch-size\">Batch Count &amp; Batch Size<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-cfg-scale\">CFG Scale<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-seed\">Seed<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-extra-text2img-features\">Extra Text2Img Features<\/a>\n\n\n<ul><li>\n<a href=\"#aioseo-restore-faces\">Restore Faces<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-tiling\">Tiling<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-highres-fix\">Highres. Fix<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-output-controls\">Output Controls<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-save\">Save<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-send-to-img2img\">Send to Img2img<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-send-to-inpaint\">Send to Inpaint<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-send-to-extras\">Send to Extras<\/a>\n\n<\/li>\n<li><a href=\"#aioseo-output-directory\">Output Directory<\/a>\n\n<\/li>\n<\/ul>\n<li><a href=\"#aioseo-part-2-img2img\">Part 2: Img2img<\/a>\n<\/li><\/ul><\/div><\/div>\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-a-2-part-guide-starts-here\">A 2-Part Guide Starts Here<\/h1>\n\n\n<p>This is the first installment of a mini tutorial series. <\/p>\n\n\n\n<ul>\n<li>Here in <strong>Part 1<\/strong>, we will review the <strong>TXT2IMG<\/strong> settings.<\/li>\n\n\n\n<li>In <strong><a href=\"https:\/\/promptphantom.com\/the-beginners-guide-to-img2img-in-stable-diffusion\/\" data-type=\"post\" data-id=\"54\" target=\"_blank\" rel=\"noreferrer noopener\">Part 2<\/a><\/strong>, we&#8217;ll continue to the <strong>IMG2IMG <\/strong>settings and look at some common add-ons, too.<\/li>\n<\/ul>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-technical-details\">Technical Details<\/h1>\n\n\n<p>For this guide, I will be using <a rel=\"noreferrer noopener\" href=\"https:\/\/github.com\/AUTOMATIC1111\/stable-diffusion-webui\" data-type=\"URL\" data-id=\"https:\/\/github.com\/AUTOMATIC1111\/stable-diffusion-webui\" target=\"_blank\">Automatic1111<\/a>\u2019s Web UI running on my own computer. This is one of the most popular UI\u2019s available and I would recommend it for people who have sufficiently powerful Nvidia graphics cards. (Not sure if you\u2019re system can run Stable Diffusion? Go check this quick guide to find out). While this version may have some extra features and extensions, the primary controls should be the same on any version of Stable Diffusion.<\/p>\n\n\n\n<p>Quick Notes before we start: All my example images in this guide were done at 512&#215;512 pixels unless stated otherwise. <strong>Your generation time and capabilities will differ based on your own computer\u2019s hardware.<\/strong> I am running Stable Diffusion locally and my computer has the following specs:<\/p>\n\n\n\n<ul>\n<li>CPU: AMD Ryzen 5 3600<\/li>\n\n\n\n<li>RAM: 16GB <\/li>\n\n\n\n<li>GPU: Nvidia GeForce RTX 3060 with 12GM of VRAM<\/li>\n<\/ul>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-txt2img\">Txt2Img<\/h1>\n\n\n<p>Stable Diffusion can generate images from 2 primary inputs: a standalone text description of what you want the image to be, or an image with a complementary text providing more information about the desired output. These two types of generation are referred to, respectively, as \u201ctxt2img\u201d and \u201cimg2img\u201d (pretty straightforward so far).<\/p>\n\n\n\n<p>In most interfaces you\u2019ll encounter, the primary panel will be txt2img. When Auto\u2019s UI is loaded, this is the first tab to the left at the top. So let\u2019s start by looking at the parameters that are tweak-able when converting text to an image.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-prompt\">Prompt<\/h1>\n\n\n<p>When you initialize Stable Diffusion (on your computer or using an online provider), the first feature you will likely notice is a text box titled \u201cPrompt\u201d. As obvious as it sounds, this box is where you will describe the picture you want to see so the AI knows what to make. This is perhaps the single-most important parameter in Stable Diffusion. The difference between a good image and a heap of meaningless pixels often comes down to the text prompt you write.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img fetchpriority=\"high\" decoding=\"async\" width=\"1024\" height=\"173\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area-1024x173.jpg\" alt=\"\" class=\"wp-image-46\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area-1024x173.jpg 1024w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area-300x51.jpg 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area-768x130.jpg 768w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area-1536x260.jpg 1536w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Input-Area.jpg 1813w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>While prompt writing is a whole topic unto itself, I\u2019ll give you the basics of how to use the Prompt box. To create an image from text, you need to generally tell the AI three things:<\/p>\n\n\n\n<ol class=\"has-ast-global-color-6-background-color has-background\">\n<li>What <strong>content<\/strong> should be in the image (what subject matter, background, etc. you want to see?)<\/li>\n\n\n\n<li>What <strong>medium<\/strong> the image should look like (do you want a painting? Or a photograph?)<\/li>\n\n\n\n<li>What <strong>style<\/strong> the image should emulate (do you want it to look like a specific artist?)<\/li>\n<\/ol>\n\n\n\n<p>You can be as generalized or as specific in your text description as you want. A big part of playing with AI art is learning which prompt structures work for you and which generate passable results. Let\u2019s look at an example prompt; for this example, we\u2019ll leave all of the other parameters at their default for now.<\/p>\n\n\n<h3 class=\"wp-block-heading\" id=\"aioseo-our-practice-prompt-is\">Our practice prompt is:<\/h3>\n\n\n<pre class=\"wp-block-verse\"><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><\/pre>\n\n\n\n<p>When we hit generate, the AI will use what it knows about this text description\u2019s elements to create an image. Let\u2019s break down our prompt and look at each phrases impact on the end result.<\/p>\n\n\n\n<pre class=\"wp-block-verse\"><em><strong>A painting of a Kyoto cityscape by Satoshi Kon<\/strong><\/em><\/pre>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon.png\" alt=\"\" class=\"wp-image-17\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon-150x150.png 150w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/figure>\n\n\n\n<p>\u201c<strong>A painting\u201d is our medium modifier;<\/strong> we are telling Stable Diffusion to draw on what it knows about paintings when making the piece of art. This is a very general modifer, as paintings come in many different styles (impressionist, digital, abstract, etc) and can be done in a variety of mediums (watercolor, acrylics, etc).<\/p>\n\n\n\n<p>\u201c<strong>Kyoto cityscape\u201d is our subject qualifier; <\/strong>Stable Diffusion will ruminate on all the images it\u2019s scene of Kyoto, cityscapes, and cityscapes in Kyoto, to determine what such an image would look like. This term is a moderately specific, as we\u2019ve noted which city the AI should take inspiration from rather than just ask for any old cityscape.<\/p>\n\n\n\n<p>\u201c<strong>by Satoshi Kon\u201d is our style modifier;<\/strong> Stable Diffusion will use this text to take whatever image we\u2019re asking and try to make it look like the specific artist\u2019s style. A good tip here is to know what an artist\u2019s style looks like and what kind of mediums and subjects are most common for that artist. Doing this will help the AI to make a more believable image. <\/p>\n\n\n\n<p>If, for example, we asked Stable Diffusion to make a pencil sketch in the style of Wes Anderson (a film director), it may have trouble anticipating what such an image would look like because it doesn\u2019t have pencil sketches by Wes Anderson in it\u2019s references. It might then spit out something garbled or unrelated to your prompt.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00523-3806904524-A-pencil-sket.png\" alt=\"\" class=\"wp-image-18\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00523-3806904524-A-pencil-sket.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00523-3806904524-A-pencil-sket-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00523-3806904524-A-pencil-sket-150x150.png 150w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><figcaption class=\"wp-element-caption\"><br>(actually, this example worked quite well! Never underestimate the power of AI learning!)<\/figcaption><\/figure>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-negative-prompt\">Negative Prompt<\/h1>\n\n\n<p>Directly underneath the main Prompt box is another text box titled \u201cNegative Prompt\u201d. This is the area where we can specify things we do not want to see in our output image. In our last generation, you\u2019ll notice that the image came out in black and white. I like the style of the image, but I would prefer something in color. We could change the regular prompt to specify a \u201ccolored pencil sketch\u201d, but first let\u2019s test out the negative prompt.<\/p>\n\n\n\n<p>I will add the terms \u201cmonochrome\u201d and \u201cblack and white\u201d to the negative prompt. I\u2019ll separate them with a comma to specify they are two separate terms. While these terms have very similar meanings, I want to exclude both of them because I don\u2019t know which phrase the AI is more likely to associate with this image style. Let\u2019s look at the results when I generate the image with this negative prompt:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00524-3596025345-A-pencil-sket.png\" alt=\"\" class=\"wp-image-19\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00524-3596025345-A-pencil-sket.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00524-3596025345-A-pencil-sket-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00524-3596025345-A-pencil-sket-150x150.png 150w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/figure>\n\n\n\n<p>Not too bad&#8230;but what if I want to avoid having people appear in the image? Let\u2019s add the term \u201cpeople\u201d to the negative prompt as well and see what happens:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"512\" height=\"512\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00525-1460618241-A-pencil-sket.png\" alt=\"\" class=\"wp-image-20\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00525-1460618241-A-pencil-sket.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00525-1460618241-A-pencil-sket-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00525-1460618241-A-pencil-sket-150x150.png 150w\" sizes=\"(max-width: 512px) 100vw, 512px\" \/><\/figure>\n\n\n\n<p>As you can see, even one word added to the positive or negative prompt sections can affect the results that Stable Diffusion gives us. Let\u2019s move on to the other parameters before we get to bogged down in this topic. Prompt writing really is an entire lesson unto itself.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"903\" height=\"613\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Controls-Area.jpg\" alt=\"\" class=\"wp-image-47\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Controls-Area.jpg 903w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Controls-Area-300x204.jpg 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Txt2img-Controls-Area-768x521.jpg 768w\" sizes=\"(max-width: 903px) 100vw, 903px\" \/><figcaption class=\"wp-element-caption\"><em>The controls for image parameters in Auto111&#8217;s Web UI<\/em><\/figcaption><\/figure>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-sampling-steps\">Sampling Steps<\/h1>\n\n\n<p>The first slider you\u2019ll see in this web UI is labeled \u201cSampling Steps\u201d. <strong>Steps refer to how many passes the AI will make when taking visual noise and refining it into your desired image. Think of it like layers of paint put down by a painter.<\/strong> A painter will start with a wash which has no details at all; next, he will add large blocks and shapes of paint to get the overall design of the painting on the canvas; once that dries, he can then take a smaller brush and paint in the details of the piece.<\/p>\n\n\n\n<p>Stable Diffusion operates in a similar fashion: when you give the AI a prompt to generate, it starts with nothing but a canvas of latent space. Which each step, it puts another layer of \u201cpaint\u201d down on that latent space, first with blurry blocks of color to define what goes where. With each extra step, the AI will add more and more detail to the image until it reaches the number of steps that you specified.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-steps-vs-quality\">Steps vs Quality<\/h2>\n\n\n<p><strong>Now, this explanation may lead you to believe that more steps mean better pictures. But that\u2019s not always the case. At a certain point, adding more steps doesn\u2019t help improve the quality of an image; it can actually cause the image to start looking overworked.<\/strong><\/p>\n\n\n\n<p>If we go back to the painter analogy, we will find that the painting he puts on canvas only needs so many layers before it forms a coherent image. If he keeps piling on layers of paint, the image will start to look messy. Let\u2019s say this hypothetical painter is making a landscape with pine trees. If you ask him to keep adding more and more detail after the landscape is already there, all he can do is add increasingly minute details to the canvas. If he spends long enough doing that, he might brush individual pine needles onto every tree in the whole forest&#8230;and lose his mind in the process!<\/p>\n\n\n\n<p>Stable Diffusion works in a similar way: at a certain point, the noise becomes coherent enough to make a clear image. If you keep pushing for more steps beyond that point, the AI will have no choice but to keep looking for tiny minute details to throw in here and there. If you push too hard, it will start making up weird details just for the sake of adding more to the image.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-goldilocks-zone-of-denoising\">Goldilocks Zone of Denoising<\/h2>\n\n\n<p>Suffice to say, there is a Goldilocks zone of denoising. The number of steps you should aim for will depend on your preferred subject matter and art style. We will discuss that topic further in another article. For beginners, I would recommend using 20 or 30 steps.<\/p>\n\n\n\n<figure class=\"wp-block-table is-style-regular\"><table><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-21\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-2.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-2.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-2-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-2-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-22\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-20.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-20.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-20-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-20-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-23\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-100.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-100.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-100-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00561-2691516055-Steps-100-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt:<\/strong> <em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps:<\/strong> 2<br><strong>Sampling Method: <\/strong>Euler<br><strong>Size:<\/strong> 512 x 512 pixels<br><strong>CFG Scale: <\/strong>7<br><strong>Seed: <\/strong>2691516055<br><strong>Time to Make:<\/strong> 0.81 seconds<\/td><td><strong>Prompt:<\/strong> <em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampling Method: <\/strong>Euler<br><strong>Size:<\/strong> 512 x 512 pixels<br><strong>CFG Scale: <\/strong>7<br><strong>Seed: <\/strong>2691516055<br><strong>Time to Make:<\/strong> 3.84 seconds<\/td><td><strong>Prompt:<\/strong> <em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps: <\/strong>100<br><strong>Sampling Method: <\/strong>Euler<br><strong>Size:<\/strong> 512 x 512 pixels<br><strong>CFG Scale: <\/strong>7<br><strong>Seed: <\/strong>2691516055<br><strong>Time to Make:<\/strong> 25.37 seconds<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><strong>As you can see from this comparison, there are diminishing returns when increasing steps. <\/strong>The difference between 2 and 20 steps is much greater than the difference between 20 and 150. The generation time also goes up with every additional step you add. Keep that in mind if you have limited CPU or GPU resources.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-sampling-method\">Sampling Method<\/h1>\n\n\n<p>Just below Sampling Steps, you will find several selection bubbles labeled with \u201cSampling Method\u201d. I was not a mathematics major, so I can\u2019t explain to you exactly how these operate. But, in layman\u2019s terms, this image generation process works by giving a computer images that were reduced to noise and teaching it how to recreate the original image. There are fancy and confusing mathematical equations involved to accomplish this. <\/p>\n\n\n\n<p><strong>In short, each Sampling Method is just a different approach on how to solve the equations that make image generation possible. <\/strong>As a result, each method will give you a final image based on your prompt, but will have small nuances of difference because they got to the end result in a slightly different way.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01040-3107331901-a-plate-of-cake-1024x683.png\" alt=\"\" class=\"wp-image-25\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01040-3107331901-a-plate-of-cake-1024x683.png 1024w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01040-3107331901-a-plate-of-cake-300x200.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01040-3107331901-a-plate-of-cake-768x512.png 768w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01040-3107331901-a-plate-of-cake.png 1536w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><strong>Prompt: <\/strong><em>analog style photo of a chocolate cake sitting on a countertop in a country kitchen, detailed, realistic, by Wes Anderson<\/em> <em>(initial size was 768&#215;512)<\/em><\/figcaption><\/figure>\n\n\n\n<p>I liken this to baking a chocolate cake. You and your friend start with the same ingredients, but you each follow your own set of directions that marginally vary. Both of you will end up with a chocolate cake at the end, but the way in which you mixed and cooked yours will make it somewhat distinct from the one your friend finished. So Sampling Methods are like different techniques to bake the same chocolate cake.<\/p>\n\n\n\n<p>In Stable Diffusion, there are a variety of sampling methods available. We\u2019ll wait to discuss their difference at length in another article. For now, I would recommend that you start with <strong>Euler or <\/strong><strong>Euler a<\/strong> as you begin your image generation journey.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-image-size\">Image Size<\/h1>\n\n\n<p>Next, we see the Width and Height parameters. These are quite self-explanatory. With these sliders, you can control the size of your output image. It sounds simple enough, but keep these two points in mind when changing the output size:<\/p>\n\n\n\n<ol>\n<li><strong>Divisible Numbers. <\/strong>Please note that, when you drag the sliders around, the numbers on the right will change in increments of 64. This is quite important, as all images generated by Stable Diffusion must be in a size divisible by 64. If you try to enter a custom non-divisible size (for example, you want a video-sized thumbnail at 1920&#215;1080 pixels) and hit generate, you will get an error instead of an image. So if you\u2019re after a specific image size, you\u2019ll want to generate an image with the closest aspect ratio and crop to your desired size in a photo editor later.<\/li>\n\n\n\n<li><strong>Size Limits.<\/strong> Just as importantly, you need to understand that the image size the AI can generate is dependent on the hardware capabilities of your GPU. The larger an image size, the more pixels involved and\u2014as a result\u2014the more time and processing power it will take to make the image. If you choose a height and\/or width that is too big for your GPU to process, you will get an error instead of an image. Thankfully, there are many different AI upscaling solutions you can use to increase the image resolution of your creations later. We\u2019ll talk about that in another guide.<\/li>\n<\/ol>\n\n\n\n<p>Now the <strong>aspect ratio<\/strong> of your image will not only impact the processing time: it can also change the way that Stable Diffusion composes the image. I\u2019ve seen it happen many times now where a widescreen aspect ratio, when paired with a human portrait prompt, generates odd results or duplicated faces because the AI is trying to fill in the extra space. As a generally rule of thumb, you should stick to square or vertical sizes for better portrait results.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-batch-count-batch-size\">Batch Count &amp; Batch Size<\/h1>\n\n\n<p>Batches refer to how many images you want Stable Diffusion to generate in one go with your current prompt and settings. <strong>It\u2019s like putting multiple pans of cookies in the oven. Batch count is how many cookie sheets you put in the oven, while batch size is how many cookies (i.e. images) will be on each sheet.<\/strong><\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"683\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01041-2850563847-plate-of-cookies-1024x683.png\" alt=\"\" class=\"wp-image-26\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01041-2850563847-plate-of-cookies-1024x683.png 1024w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01041-2850563847-plate-of-cookies-300x200.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01041-2850563847-plate-of-cookies-768x512.png 768w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01041-2850563847-plate-of-cookies.png 1536w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\">Can you tell I was hungry when I wrote this article? <strong>Prompt: <\/strong><em>analog style photograph of a diner, a plate of cookies and a cup of coffee on a table, detailed, realistic, evening light, by Wes Anderson<\/em> <em>(initial size was 768&#215;512)<\/em><\/figcaption><\/figure>\n\n\n\n<p>Just keep in mind that, the more images you ask Stable Diffusion to generate in one sitting, the longer your wait time will be. I would recommend starting with small batch sizes until, between one and four images, as you get a feel for which prompts will give you what results.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-cfg-scale\">CFG Scale<\/h1>\n\n\n<p>The<strong> CFG Scale<\/strong> stands for \u201cClassifier Free Guidance Scale\u201d. That sounds overly technical, but really this slider just controls how closely the AI will follow your prompt when generating an image. <strong>The higher you set the scale, the more strictly Stable Diffusion will interpret your prompt; the lower you set the scale, the more creative it will get in making the output image. Let\u2019s look at two extreme examples while building upon our initial practice prompt. <\/strong><\/p>\n\n\n\n<p>Here are the results of \u201cA painting of a Kyoto cityscape by Satoshi Kon\u201d with a low CFG (at 1.5, the AI gets a lot of freedom with this prompt) and then at a high CFG (28, the AI is strictly trying to capture exactly what the prompt says):<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"444\" class=\"wp-image-27\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00526-1789531817-A-painting-of.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00526-1789531817-A-painting-of.png 576w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00526-1789531817-A-painting-of-300x267.png 300w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"444\" class=\"wp-image-28\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00527-2717817458-A-painting-of.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00527-2717817458-A-painting-of.png 576w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00527-2717817458-A-painting-of-300x267.png 300w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em> <br>CFG Scale of 1.5<\/td><td><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em> <br>CFG Scale of 28<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>As you can see, having an extreme in either direction can make the results look a bit&#8230;odd. A good middle ground is too leave the CFG between about 7 and 12. For txt2img, I rarely tweak the CFG unless the AI is throwing a lot of extra stuff into the image that shouldn\u2019t belong. Here is another generation with the CFG Scale positioned at 7.5, giving a more comprehensible but creative result:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"576\" height=\"512\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00528-4184072319-A-painting-of.png\" alt=\"\" class=\"wp-image-29\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00528-4184072319-A-painting-of.png 576w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00528-4184072319-A-painting-of-300x267.png 300w\" sizes=\"(max-width: 576px) 100vw, 576px\" \/><\/figure>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-seed\">Seed<\/h1>\n\n\n<p>A <strong>seed<\/strong> is a string of numbers that identify each individual image the AI generates. Every image that comes out of Stable Diffusion will have a seed number. But seeds don\u2019t end there. <strong>If you like the visual style of a specific image, you can re-use that same seed number with a different prompt to generate an image with similar visual characteristics to the original image from which that seed was pulled.<\/strong><\/p>\n\n\n\n<p>Let\u2019s say I really liked the look of our very first image and I want to use different prompts with a similar aesthetic. I will first click on the green recycling symbol next to the seed box. This is show the seed of whatever image we last generated. Next, I will copy and paste the seed from our original generation into this box in place of the seed it\u2019s currently showing. (Stable Diffusion saves the output images with the seed number in the file name by default; so to retrieve that seed I will go to my output folder and copy it from the image\u2019s file name).<\/p>\n\n\n\n<p>For this example, my seed is: 2323820377<\/p>\n\n\n\n<p>With that seed now locked in, let\u2019s change our prompt a bit. I will change the subject matter from \u201ca Kyoto cityscape\u201d to \u201cLos Angeles\u201d. Here are the results side-by-side for comparison:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-17\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/01962-2323820377-A-painting-of-a-Kyoto-cityscape-by-Satoshi-Kon-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-30\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00529-2323820377-A-painting-of.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00529-2323820377-A-painting-of.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00529-2323820377-A-painting-of-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00529-2323820377-A-painting-of-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt: <\/strong><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps: <\/strong>20<br><strong>Sampler:<\/strong> Euler a<br><strong>Seed: <\/strong>2323820377<br><strong>CFG Scale: <\/strong>7.5<\/td><td><strong>Prompt:<\/strong> <em>A <em>painting of Los Angeles by Satoshi Kon<\/em><br><strong>Steps: <\/strong>20<br><strong>Sampler:<\/strong> Euler a<br><strong>Seed: <\/strong>2323820377<br><strong>CFG Scale: <\/strong>7.5<\/em><\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-extra-text2img-features\">Extra Text2Img Features<\/h1>\n\n\n<p>Now that we\u2019ve covered the basic features inside Stable Diffusion\u2019s txt2img technology, let\u2019s look at some of the features specific to this Web interface (again, I\u2019m using Automatic1111\u2019s Web UI). These include:<\/p>\n\n\n\n<ul>\n<li>Restore Faces<\/li>\n\n\n\n<li>Tiling<\/li>\n\n\n\n<li>Highres Fix<\/li>\n<\/ul>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-restore-faces\">Restore Faces<\/h2>\n\n\n<p>Restore Faces. When checked, this feature instructs Stable Diffusion to apply an additional algorithm to the generation process that is designed to improve the appearance of human faces. In the Web UI settings, you can specify which algorithm you want to use: Codeformer or GFPGAN. Codeformer is a model that was actually designed to quite literally restore faces in old photos that have damage to them; GFPGAN is an AI model (a GAN model to be exact) that does the same thing but in a slightly different way.<\/p>\n\n\n\n<p>Generally, both models are good but have a different nuance to the way they process eyes. Codeformer has a bit more highlight or sparkle to eyes for a photorealistic look, while GFPGAN can provide more cartoon or anime-style eyes. In my personal experience so far, I usually prefer the results from Codeformer.<\/p>\n\n\n\n<p>Below is a side-by-side-by-side comparison of three portraits with the same seed: first with no face restoration, then with Codeformer, and then with GFPGAN.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-31\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00530-1973059559-A-photo-portr.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00530-1973059559-A-photo-portr.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00530-1973059559-A-photo-portr-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00530-1973059559-A-photo-portr-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-32\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00531-1973059559-A-photo-portr-GFPGAN.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00531-1973059559-A-photo-portr-GFPGAN.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00531-1973059559-A-photo-portr-GFPGAN-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00531-1973059559-A-photo-portr-GFPGAN-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-33\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00532-1973059559-A-photo-portr-Codeformer.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00532-1973059559-A-photo-portr-Codeformer.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00532-1973059559-A-photo-portr-Codeformer-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00532-1973059559-A-photo-portr-Codeformer-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt: <\/strong><em>A photo portrait of a beautiful young Dutch woman by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps: <\/strong>20<br><strong>Sampler: <\/strong>Euler<br><strong>Seed:<\/strong> 1973059559<br><strong>CFG Scale: <\/strong>7.5<br>NO FACE RESTORE<\/td><td><strong>Prompt: <\/strong><em>A photo portrait of a beautiful young Dutch woman by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler: <\/strong>Euler<br><strong>Seed: <\/strong>1973059559<br><strong>CFG Scale: <\/strong>7.5<br>GFPGAN at 0.15 weight<\/td><td><strong>Prompt:<\/strong><em> A photo portrait of a beautiful young Dutch woman by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed:<\/strong> 1973059559<br><strong>CFG Scale:<\/strong> 7.5<br>Codeformer at 0.15 weight<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>Here is another portrait example, same side-by-side:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-34\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00534-1364391388-A-close-up-ph.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00534-1364391388-A-close-up-ph.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00534-1364391388-A-close-up-ph-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00534-1364391388-A-close-up-ph-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-35\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00535-1364391388-A-close-up-ph-Codeformer.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00535-1364391388-A-close-up-ph-Codeformer.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00535-1364391388-A-close-up-ph-Codeformer-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00535-1364391388-A-close-up-ph-Codeformer-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-36\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00536-1364391388-A-close-up-ph-GFPGAN.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00536-1364391388-A-close-up-ph-GFPGAN.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00536-1364391388-A-close-up-ph-GFPGAN-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00536-1364391388-A-close-up-ph-GFPGAN-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt: <\/strong><em>A close-up photo portrait of a handsome young American man in a park by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler: <\/strong>Euler<br><strong>Seed: <\/strong>1364391388<br><strong>CFG Scale:<\/strong> 7.5<br>NO FACE RESTORE<\/td><td><strong>Prompt:<\/strong> <em>A close-up photo portrait of a handsome young American man in a park by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed:<\/strong> 1364391388<br><strong>CFG Scale:<\/strong> 7.5<br>GFPGAN at 0.15 weight<\/td><td><strong>Prompt:<\/strong> <em>A close-up photo portrait of a handsome young American man in a park by Marta Bevacqua, detailed, realistic, 50mm lens<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler: <\/strong>Euler<br><strong>Seed:<\/strong> 1364391388<br><strong>CFG Scale:<\/strong> 7.5<br>Codeformer at 0.15 weight<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>As you can probably see, both Codeformer and GFPGAN dramatically improve faces but with different flavors. Like I mentioned earlier, notice how the young man\u2019s eyes glint more when using Codeformer as opposed to GFPGAN. Meanwhile, the young woman I generated did not have much glint to her eyes but the proportions of her face improved with both restoration models.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-tiling\">Tiling<\/h2>\n\n\n<p>Tiling is an extremely useful feature for graphic designers! This option, when checked, makes the output image seamless\u2014that is, you\u2019ll be able to tile the image much like a fabric or wallpaper design (I mean that you would find for actual walls, not a computer desktop).<\/p>\n\n\n\n<p>Going back to our original practice prompt, I generated another image using a similar (but slightly altered) prompt with a fresh random seed (to get back to using randomized seeds, click on the dice symbol next to the seed box). I then recycled that seed and regenerated the image with tiling on.<\/p>\n\n\n\n<p>Here is a comparison of an image with the same seed, without and then with tiling enabled:<\/p>\n\n\n\n<figure class=\"wp-block-table is-style-regular\"><table class=\"has-fixed-layout\"><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-37\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00540-3738328424-A-painting-of.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00540-3738328424-A-painting-of.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00540-3738328424-A-painting-of-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00540-3738328424-A-painting-of-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-38\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt: <\/strong>A painting of a Kyoto city skyline by Satoshi Kon<br><strong>Steps: <\/strong>20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed: <\/strong>3738328424<br><strong>CFG Scale: <\/strong>7.5<br>No Tiling<\/td><td><strong>Prompt:<\/strong> A painting of a Kyoto city skyline by Satoshi Kon<br><strong>Steps:<\/strong> 20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed: <\/strong>3738328424<br><strong>CFG Scale: <\/strong>7.5<br>Tiling Enabled<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>Then I took the second image and actually stacked it 2 by 2 in Photoshop to see how well it actually tiled:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-2x2-1-1024x1024.png\" alt=\"\" class=\"wp-image-39\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-2x2-1.png 1024w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-2x2-1-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-2x2-1-150x150.png 150w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00539-3738328424-A-painting-of-Tiling-2x2-1-768x768.png 768w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>2&#215;2 spread of the previous tile image<\/em>. <em>Very seamless, much tile.<\/em><\/figcaption><\/figure>\n\n\n\n<p>Now that makes for one very neat lo-fi pattern, if I may say so myself.<\/p>\n\n\n\n<p>This feature really shines with simpler designs like you would normally find in seamless patterns. Just for a quick example, here is a basic animal print I made using the tile feature (this is how the image looks when tiled 2&#215;2):<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"1024\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00541-1753911219-leopard-print-tiling-2x2-1-1024x1024.png\" alt=\"\" class=\"wp-image-40\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00541-1753911219-leopard-print-tiling-2x2-1.png 1024w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00541-1753911219-leopard-print-tiling-2x2-1-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00541-1753911219-leopard-print-tiling-2x2-1-150x150.png 150w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00541-1753911219-leopard-print-tiling-2x2-1-768x768.png 768w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>leopard print pattern<\/em><\/figcaption><\/figure>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-highres-fix\">Highres. Fix<\/h2>\n\n\n<p>Stable Diffusion has trouble with adding details to images with higher resolutions. To help with that, this web interface has a \u201cHighres. Fix\u201d option. What this does is start rending the image at a lower resolution, then it upscales that render and continues adding in details at a higher resolution. Here is a generation using our original prompt text but as a comparison done at low resolution, high resolution, and lastly high resolution fix enabled.<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table class=\"has-fixed-layout\"><tbody><tr><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-41\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00543-2323820377-A-painting-of.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00543-2323820377-A-painting-of.png 512w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00543-2323820377-A-painting-of-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00543-2323820377-A-painting-of-150x150.png 150w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-42\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00544-2323820377-A-painting-of-Highres-Off.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00544-2323820377-A-painting-of-Highres-Off.png 896w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00544-2323820377-A-painting-of-Highres-Off-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00544-2323820377-A-painting-of-Highres-Off-150x150.png 150w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00544-2323820377-A-painting-of-Highres-Off-768x768.png 768w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><td><img loading=\"lazy\" decoding=\"async\" width=\"500\" height=\"500\" class=\"wp-image-43\" style=\"width: 500px;\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00545-2323820377-A-painting-of-Highres-on.png\" alt=\"\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00545-2323820377-A-painting-of-Highres-on.png 896w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00545-2323820377-A-painting-of-Highres-on-300x300.png 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00545-2323820377-A-painting-of-Highres-on-150x150.png 150w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/00545-2323820377-A-painting-of-Highres-on-768x768.png 768w\" sizes=\"(max-width: 500px) 100vw, 500px\" \/><\/td><\/tr><tr><td><strong>Prompt: <\/strong><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler: <\/strong>Euler<br><strong>Seed:<\/strong> 2323820377<br><strong>CFG Scale: <\/strong>7.5<br><strong>Size: <\/strong>512 x 512<br>Highres Fix is OFF<\/td><td><strong>Prompt: <\/strong><em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps: <\/strong>20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed:<\/strong> 2323820377<br><strong>CFG Scale: <\/strong>7.5<br><strong>Size:<\/strong> 896 x 896<br>Highres Fix is OFF<\/td><td><strong>Prompt:<\/strong> <em>A painting of a Kyoto cityscape by Satoshi Kon<\/em><br><strong>Steps:<\/strong> 20<br><strong>Sampler:<\/strong> Euler<br><strong>Seed: <\/strong>2323820377<br><strong>CFG Scale: <\/strong>7.5<br><strong>Size:<\/strong> 896 x 896<br>Highres Fix is ON<br>(Firstpass W+H are both 512 and Denoising at 0.3)<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>As you can see, increasing the dimensions vastly changes the composition-even though the prompt, sampling, and seed are all the same! With Highres fix checked on, we can tell Stable Diffusion to take the image it would have generated at 512&#215;512 pixels, upscale it, and add some more detail.<\/p>\n\n\n\n<p class=\"has-medium-font-size\"><strong>Highres fix can be very helpful if you make an image you like at a low resolution and want to bump up the size in txt2img without losing the initial composition.<\/strong><\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-output-controls\">Output Controls<\/h2>\n\n\n<p>Before we wrap things up, you need to know about the buttons that let you export images. Let&#8217;s review the output panel.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"907\" height=\"647\" src=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Output-Area.jpg\" alt=\"\" class=\"wp-image-44\" srcset=\"https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Output-Area.jpg 907w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Output-Area-300x214.jpg 300w, https:\/\/promptphantom.com\/wp-content\/uploads\/2023\/01\/SD-Auto1111-Output-Area-768x548.jpg 768w\" sizes=\"(max-width: 907px) 100vw, 907px\" \/><\/figure>\n\n\n\n<p>On the right-hand side of the interface you will find the image previewer and several buttons related to image output. Let\u2019s briefly review each one:<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-save\">Save<\/h2>\n\n\n<p>As you may expect, this button allows you to save a generated image to a directory of your choice. When you click save, a PNG filename will appear below the output buttons with a clickable link that says \u201cdownload\u201d. Choosing \u201cdownload\u201d will allow you to pick what folder on your computer the image will be saved into.<\/p>\n\n\n\n<p>Keep in mind that, even if you don\u2019t save the image with this method, it is still saved in an output directory specified by Stable Diffusion. Every image you generate in this web interface will be kept in the following directory (unless you modify the settings for someplace different):<\/p>\n\n\n\n<p><em>C:\\stable-diffusion-webui\\outputs\\txt2img-images<\/em><\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-send-to-img2img\">Send to Img2img<\/h2>\n\n\n<p>This button will take the currently selected image and move it over to the Img2img section. From here, you can do a variety of AI generation operations with this image used as the source. We\u2019ll get to those img2img features in the next articlel, but in short you can do things like expand the image\u2019s size or create variations based on the source image.<\/p>\n\n\n\n<p>Note that when you use this button, it will also transfer your source image\u2019s text prompt over to Img2img as well.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-send-to-inpaint\">Send to Inpaint<\/h2>\n\n\n<p>The Inpaint button works the same as the \u201cSend to Img2img\u201d button, except that this command will send your selected image to the \u201cInpaint\u201d tab in the Img2img work area. From here you can do inpaintings; we will discuss exactly what inpainting is in Part 2 of this tutorial series.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-send-to-extras\">Send to Extras<\/h2>\n\n\n<p>This button will send your selected image to the \u201cExtras\u201d section where you can upscale it. Upscaling is the process of increasing the dimensions and resolution of an image without making it blurry. We will talk about Upscaling later in this guide and in other articles.<\/p>\n\n\n<h2 class=\"wp-block-heading\" id=\"aioseo-output-directory\">Output Directory<\/h2>\n\n\n<p>The right-most button in the output are is an open folder symbol. This simply opens up the default output folder via Windows Explorer.<\/p>\n\n\n<h1 class=\"wp-block-heading\" id=\"aioseo-part-2-img2img\">Part 2: Img2img<\/h1>\n\n\n<p>By now, you should have a solid understanding of Txt2img in Stable Diffusion. In the <a href=\"https:\/\/promptphantom.com\/the-beginners-guide-to-img2img-in-stable-diffusion\/\" target=\"_blank\" rel=\"noopener\" title=\"The Beginner\u2019s Guide to Img2Img in Stable Diffusion\">next part<\/a> of this tutorial series, we will move to the Img2img section of the Web UI and learn about inpainting, upscaling, and more.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Part 1: Using Txt2Img in Stable Diffusion Web UI So you\u2019ve recently learned about AI art and a software called Stable Diffusion. Welcome to the wonderful world of computer-generated art! While it may look complicated to operate Stable Diffusion at first glance, the controls are actually quite simple. In this tutorial, I want to explain [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":49,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"disabled","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"iawp_total_views":310,"footnotes":""},"categories":[4,3],"tags":[5,6],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/posts\/15"}],"collection":[{"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/comments?post=15"}],"version-history":[{"count":7,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/posts\/15\/revisions"}],"predecessor-version":[{"id":815,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/posts\/15\/revisions\/815"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/media\/49"}],"wp:attachment":[{"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/media?parent=15"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/categories?post=15"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/promptphantom.com\/wp-json\/wp\/v2\/tags?post=15"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}