You have a picture in your head — a moody landscape, a professional headshot, a whimsical illustration for a project — and you type a few words into an AI image generator. What comes back looks nothing like what you imagined. It is generic, oddly proportioned, or just off in a way that is hard to articulate. It's frustrating, especially when other people seem to be producing stunning visuals with the same tools.
The short answer: AI image generators do not read your mind. They read your words. The gap between what you imagined and what appeared on screen is almost always a prompting gap — and closing it is simpler than you might think. A good image prompt gives the AI the same kind of visual direction you would give a human artist: what to draw, how it should look, and what mood to create.
Why Your AI Images Look Wrong
AI image generators — like those built into ChatGPT, Midjourney, and Leonardo — are trained on billions of images paired with text descriptions. When you type "a cat," the AI averages everything in its training data about cats and produces something statistically probable — which usually means generic and forgettable.
Think of it this way. If you walked into a design studio and said "make me a picture of a cat," the designer would have questions. What kind of cat? What style — photorealistic, watercolour, comic? What is the mood? Without answers, they would default to something safe and bland. AI works the same way. Vague prompts produce vague images. The more specific your description, the more the output matches what you actually see in your head.
The Five Building Blocks of a Good Image Prompt
Every strong image prompt contains some combination of these five elements — not a rigid formula, but a set of ingredients that, combined, produce dramatically better results.
Subject
This is the what — the main focus of your image. Specificity makes an enormous difference here.
- Vague: "a dog"
- Specific: "a golden retriever puppy with muddy paws"
The more precise your subject, the less the AI fills in by default. "A woman" gives the AI almost nothing. "A woman in her 60s with silver hair and reading glasses, wearing a linen shirt" gives it a character.
Style
Style tells the AI how to render the image. Without a style direction, most generators default to a generic photorealistic look — which may not be what you want at all. Some keywords to explore: watercolour, oil painting, pencil sketch, digital illustration, 35mm film photography, anime, flat vector art, vintage poster, cinematic still.
In our experience, style is the single building block that makes the biggest visual difference. Adding "watercolour style" or "cinematic photograph" to an otherwise identical prompt produces completely different results.
Composition
Composition describes the camera angle, framing, and spatial arrangement. This is the element most beginners skip — and it is often the reason an image feels "off" even when the subject and style are right. Useful terms: close-up, wide shot, bird's-eye view, viewed from below, centred, rule of thirds, shallow depth of field, negative space on the left.
If you are not sure what composition to choose, "close-up" and "wide shot" are reliable starting points that most generators handle well.
Lighting
Lighting sets the mood. A portrait in harsh midday sun feels entirely different from the same portrait in soft golden-hour light. AI generators respond strongly to lighting descriptions, and even a short phrase changes the atmosphere dramatically. Try: soft morning light, dramatic side lighting, warm golden hour, neon glow, overcast diffused light, candlelight, studio lighting with a dark background.
Details
Details are the finishing touches — colour palette, texture, atmosphere, or specific elements that complete the scene. Not essential for every prompt, but they take an image from "good" to "exactly what I wanted."
Examples: "muted earth tones," "raindrops on the window," "steam rising from a coffee cup," "autumn leaves scattered on the ground," "bokeh lights in the background."
Here is what all five look like together. Instead of "a cat," we might write: "a ginger tabby cat sitting on a windowsill, watercolour style, soft morning light, viewed from slightly below, with rain streaking down the glass behind it." That single sentence gives the AI a subject, style, lighting, composition, and atmospheric detail. The result will be worlds apart from "a cat."
Prompt Examples That Actually Work
Here are four common use cases, each with a vague prompt and an improved version you can copy and adapt.
Portrait
Vague: "a professional headshot"
Improved: "A professional headshot of a man in his 40s with short dark hair and a warm smile, wearing a navy blazer over a white shirt. Studio lighting with a soft grey background, shot at eye level with shallow depth of field. Clean, modern, corporate style."
Landscape
Vague: "a mountain scene"
Improved: "A misty mountain valley at dawn, with a winding river reflecting pale pink and gold light. Viewed from an elevated ridge, wide shot. Photorealistic style, soft diffused lighting, muted greens and blues with warm highlights on the peaks."
Product Shot
Vague: "a coffee mug on a table"
Improved: "A handmade ceramic coffee mug in warm terracotta, filled with black coffee, sitting on a weathered oak table. Close-up, slightly overhead angle. Soft natural window light from the left, with steam rising gently. Minimal background, muted earth tones, 35mm film photography style."
Illustration
Vague: "a person working at a computer"
Improved: "A flat vector illustration of a young woman with curly hair working at a laptop in a cosy home office. Warm colour palette — soft oranges, creams, and olive greens. Plants on the desk, a cat curled up nearby. Overhead view, clean lines, minimal detail, modern editorial illustration style."
Each improved prompt is only two to three sentences, but it addresses most of the five building blocks. The specificity does not make the prompt harder to write — it just means you spend 30 seconds thinking about what you actually want before you type.
🧠 Quick Challenge: Which of these two prompts will produce a more useful AI image?
- A) "A woman sitting in a café, watercolour style, bright colours"
- B) "A woman in her 30s with short dark hair sitting in a Parisian café, watercolour style with visible brushstrokes, soft morning light from a nearby window, close-up, muted pastels with a warm amber accent"
Answer: B) Prompt B covers all five building blocks — subject (woman in her 30s, short dark hair), style (watercolour with visible brushstrokes), composition (close-up), lighting (soft morning light from a window), and details (muted pastels, amber accent, Parisian café). Prompt A has a subject and style but leaves composition, lighting, and details entirely to the AI's defaults.
Common Mistakes (And Quick Fixes)
If your images still are not landing, one of these is usually the culprit.
Too many subjects
"A dog and a cat and a bird and a fish in a garden with a fountain and a castle in the background" overwhelms the generator. AI image tools handle one or two clear subjects far better than a crowded scene. Focus on one main subject and let the background support it.
Contradictory styles
"A photorealistic watercolour painting" sends the AI in two directions at once. Choose one style and commit. If you want a blend, be specific — "a watercolour painting with photorealistic facial details" gives the AI a clear priority.
Forgetting the aspect ratio
Most generators default to a square format, but a landscape scene crammed into a square often looks awkward. Check whether your tool supports aspect ratios — 16:9 for landscapes, 9:16 for portraits, 3:2 for a balanced frame. This one adjustment fixes a surprising number of composition problems. The OpenAI image generation documentation covers supported sizes, and Midjourney's parameter list includes aspect ratio flags.
Not iterating
Our first AI images were, to put it kindly, not great. The breakthrough was realising that the first result is rarely the final one. Treat each generation as a draft — look at what worked, adjust what did not, and regenerate. Most impressive AI images you see online went through three to five iterations. Getting comfortable with iteration is the single biggest skill in AI image creation.
The five building blocks — subject, style, composition, lighting, details — are not a formula to memorise. They are a way of thinking about visual description that becomes instinctive with a bit of practice.
The images in your head deserve to make it onto the screen the way you imagined them. Start with one prompt today, iterate once or twice, and see what happens. If you want to go deeper, our Prompt Library has ready-made image prompts you can adapt, and the full guide to writing AI prompts that actually work covers the principles behind every type of prompt — not just images. For broader starting points, learning AI from scratch and creating something new with AI are both worth exploring.
You have everything you need to start creating images that match what you see in your head. The rest is practice — and honestly, it is the fun kind.
Ready to try it? Browse the Prompt Library for image prompt templates you can copy, customise, and use right away.



