The Power of Subtext
How to write what isn't said — the meaning that lives beneath the dialogue.
I printed out my first feature script and read it with a highlighter. Yellow for dialogue, pink for action lines. Twelve pages in, I put the highlighters down. The page was almost entirely yellow. I had written a screenplay that was, functionally, a radio play with occasional stage directions. There was almost no visual storytelling — just people talking to each other, at length, about everything they felt and thought and wanted. I had written a hundred conversations and called it a movie.
That realization was expensive in time but cheap in terms of what it cost to fix: the problem with visual storytelling isn't learning a set of difficult techniques. It's unlearning the instinct to reach for dialogue first. Film is a visual medium. Your job as a screenwriter is to think in images before you think in words — and most of us have to actively train ourselves to do it.
Writing dialogue feels like writing. Words appear. Characters speak. The page fills up. Writing visual action lines requires you to think in spatial terms — what's in the frame, what's moving, what the image means — and that takes more cognitive effort, especially in a first draft.
There's also a fear underneath it, I think. Dialogue feels safe because it's explicit. You can write "I'm angry at you" and know the emotion is in the script. Visual storytelling requires trusting that an actor's hands, a slammed cabinet, a held gaze will carry something that you can't literally verify on the page. That trust is hard to develop. But it's the core muscle of writing for the screen.
The instinct I try to follow now: before I write a line of dialogue, I ask whether an image or action could carry the same weight. Often it can, and often it carries it better.
The most reliable visual storytelling tool available to a screenwriter is character behavior. What a person does — not what they say about themselves — is what reveals who they are. And behavior is inherently visual.
Instead of a line of dialogue establishing that a character is meticulous, show them straightening the objects on a stranger's desk while waiting for a meeting. Instead of a character announcing their anxiety, show them checking the lock three times before leaving. Instead of dialogue establishing that a marriage has gone cold, show the couple at breakfast moving around each other with the practiced efficiency of people who have stopped noticing each other. The avoidance is the story.
This is the difference between a character being described and a character being revealed. Description tells an audience what to think about someone. Behavior asks them to form their own conclusion, and the conclusions they form themselves are the ones they believe. The screenwriter's job is to set up the behavior and trust the audience to read it.
Here's a test: find a scene in your draft where a character announces how they feel in dialogue. Now ask: is there a physical behavior — something they could do, something they could choose or refuse, something in the room they could interact with — that would communicate the same thing without a word? If yes, that behavior is almost always more powerful than the line.
One of the most powerful tools in a visual storyteller's kit is the recurring image — the visual motif that means something small at first and means something enormous by the end. An empty chair that's present three times at different points in the story. A window that opens and closes. An object that passes between characters as the balance of power shifts.
The motif works because visual meaning accumulates. The first time the audience sees the empty chair, it's just a chair. The second time, it starts to carry weight. The third time, in the right context, it can deliver an emotional payload that a page of dialogue couldn't match — because the audience has been carrying the meaning themselves, building it, waiting for it to matter.
This is one of the things the power of subtext comes down to at the visual level: images can carry subtext better than words can, because images don't announce themselves. A character saying "I miss you" is a statement. An empty chair in the frame where they used to sit is something the audience feels without being told to feel it.
Planting a visual motif doesn't require directing the camera — you just need to describe the same detail appearing in different contexts across the script, letting the writer in the room see the pattern. The director and production designer will find their way to the image. Your job is to put it in the story.
Cinema's greatest moments often contain no dialogue at all. The image does the work — and on the page, the screenwriter's description sets that image up.
Not every visual moment needs to be symbolic or loaded with motif. Much of visual storytelling is simpler than that: it's just choosing action over explanation. A character making a decision and immediately acting on it tells us more than a character deliberating out loud about their options. The screen rewards doing over discussing.
This becomes especially important in exposition-heavy scenes — the scenes where the story needs to deliver information. When I find myself writing characters explaining things to each other, I now ask: can this information come out of action instead of speech? Can the audience infer it from what happens rather than what's said?
A character opening their refrigerator and staring at the nearly empty shelves tells the audience about their financial situation more economically — and more viscerally — than any line of dialogue could. A character arriving at a party and scanning faces before smiling tells us they're looking for someone specific. The image does double work: it delivers information and it shows character simultaneously.
This is exactly the medicine for dialogue that's carrying too much exposition. Before cutting or restructuring dialogue, ask whether the information could be moved from speech to action. The answer is often yes, and the action version almost always reads faster.
There's a particular way that bad action writing sabotages visual storytelling. Dense, over-specified action blocks — four paragraphs describing the exact layout of a room before anything happens in it — kill the reading pace and bury the visual story in words.
Good visual action writing is lean and specific. It describes what matters and cuts everything else. "The kitchen is a mess" tells us something. "Three days of dishes" tells us more, more efficiently. The key is choosing the single most telling detail over the comprehensive inventory. A screenwriter who describes a character's apartment in three lines of specific detail creates a clearer image than one who devotes twelve lines to cataloguing the furniture.
The other thing lean action writing does is create white space — and white space controls reading pace in ways that word count can't. A single line of action followed by a single line of dialogue reads fast. A dense paragraph reads slow. That rhythm is part of pacing at the sentence level, and it affects how the script feels in the hand before the reader has even absorbed what's on the page.
The exercise that did the most for my visual sense was the audio-off pass. I went through my draft and asked: if you could watch this scene with no sound — no dialogue, no music — what would the audience understand? What would they feel?
For some scenes, the answer was surprisingly rich: the performances, the space, the behavior carried most of the meaning even without words. For others — most of them, early in my learning — the answer was "almost nothing." If the scene required sound to exist, I had written a radio play scene, not a film scene. That scene needed more visual life before the dialogue could earn its place.
Run this pass on five scenes from your current draft. The ones that survive the audio-off test are already doing some visual work. The ones that go silent and empty are the ones to rebuild from the inside, starting with the behavior and the physical world of the scene before writing a single line of dialogue.
An audience experiencing cinema is seeing before they're hearing. Write for the eye first — let dialogue earn its place by carrying what the image can't.
If I could give a writer only one tool for strengthening their visual storytelling, it would be a question to ask at the start of every scene: what do these characters do with their bodies while they talk?
Scenes where characters sit and exchange dialogue are the hardest scenes to write well and the easiest to write badly. The slightest weakness in the lines becomes exposed without anything else happening. But give those same characters something physical to be doing — cooking, packing, building something, taking something apart — and the scene gains texture, subtext, and rhythm almost automatically. The physical activity gives the dialogue a counterpoint, and the interplay between what's being said and what's being done is where visual storytelling lives at its most practical level.
This is also why strong scene structure and visual storytelling are connected — a well-built scene almost always has a physical life because the conflict and obstacle tend to manifest in the physical world. Characters pursuing goals and running into obstacles are usually doing something with their bodies. That physical doing is your visual story.
When I finally made the shift — when writing visually became the instinct rather than the afterthought — the unexpected result wasn't that my scripts had less dialogue. It's that my dialogue got better. When images were doing the heavy work, the lines that remained had a specific job: to carry what an image genuinely couldn't. The dialogue became more selective, more charged, more surprising — because it wasn't being asked to do everything.
The discipline of asking "what would this look like?" before asking "what would this character say?" doesn't produce wordless scripts. It produces scripts where every word earns its place because the images around it are doing their share of the work. That's the version of visual storytelling worth chasing — not silence for its own sake, but a story told by two fully functioning languages, each doing what it does best.