A deep read of Kallaway's I Studied 100 Viral Hooks, These 6 Will Make You Go Viral (21:56).
Why this analysis exists
This is one of the source layers behind the BuildOS hook-craft-short-form skill. The skill encodes the six archetypes and the four-component alignment as agent-runnable selection rules. This post is the long form: the catalog, the worked examples, and the universality principle.
Core thesis
The hook is the 80/20 of short-form video. All winning hooks are doing one job: opening a curiosity loop by creating contrast between what the viewer believes (A) and what the video introduces (B). The bigger the contrast, the deeper the curiosity, and you have ~5 seconds to install it.
Across thousands of videos, the winning patterns reduce to six hook archetypes. Any video idea can be expressed through any of the six — the choice is a strategic decision about which archetype best fits the key visual you have available, because the viewer's subconscious processes video in a visual → audio → visual sandwich.
The six archetypes
1. The Fortune Teller
Mechanism: Positions the present against the future. Triggers the brain's built-in compulsion to predict and prepare for what comes next.
Formula:
- A = current reality the viewer takes for granted.
- B = a future state implied or asserted by the video subject.
- Frame as a question or declaration teasing that future as inevitable or imminent.
Examples:
- "That new microwave is going to change cooking forever."
- "We're witnessing one of the largest breakthroughs in marketing."
- (Kallaway's 15M-view example) "I think this is the future of how people are going to design their homes."
Best fit: Breaking news, product innovations, anyone building authority around predicting where a category goes.
2. The Experimenter
Mechanism: Peer-to-peer demo. One friend showing another what they tried and what happened. The visual is the demo.
Formula:
- A = baseline pain or default method.
- B = the experiment / new tool / new method, shown live.
Examples:
- "Does anything look weird about this shot to you? I actually don't have a camera in my hands. I'm recording with these glasses on my face." (Colin & Samir Ray-Ban Meta — 50M views; the glasses' camera is the subject being filmed with, so visual and verbal hook self-prove.)
- "This right here is the brand-new ChatGPT Agent that literally controls your browser…"
Best fit: Product demos, new framework launches, B2B "watch our tool work," elaborate process content.
3. The Teacher
Mechanism: Same contrast spine as Experimenter, but framed teacher-to-student instead of peer-to-peer. The proof is extracted lessons, not a live experiment.
Formula:
- A = base pain point the viewer is failing at.
- B = a lesson or method that solves it.
Examples:
- "Three things you can learn from Aritzia."
- "Here's how I got this shot."
- "I made this Ecom brand a million dollars in 12 hours using something called the product drop method."
Best fit: Authority-building, expertise-as-product, anybody monetizing know-how.
4. The Magician
Mechanism: A visual stun gun. Not a contrast format on its own — a precursor or layer that forcibly stops the scroll using either (a) outlandish visuals, (b) directing language ("look at this", "check this out") that yanks the eye, or (c) a visual pacifier: a passive activity (makeup, cup-stacking, GTA gameplay split-screen) that holds enough subconscious attention to keep the viewer parked.
Formula: Find a visual, sound, or signature thing atypical enough to break the scroll, then attach one of the other five archetypes underneath.
Examples:
- "Yo, check this out."
- The neon-sign creator who opens as Trump (wig + voice), rips off the wig, then talks about neon signs.
- Carl Shakur's "snap-click" signature.
- Alex Earl doing makeup while talking; the cup-stacking creator; GTA gameplay split-screens.
Best fit: Always-on layer. Especially valuable for creators with weak topical b-roll.
Critical: Magician is a modifier, not a standalone choice. Without a contrast layer it is empty spectacle.
5. The Investigator
Mechanism: Contrast against an unknown secret. Curiosity loop = "today you don't know X; I show you X; now you know."
Formula:
- A = viewer doesn't know the thing.
- B = creator has unearthed the thing.
Examples:
- "This is one of the sneakier marketing campaigns I've seen — Central Cee's brand sells out in seconds, and what's crazy is that it's not just because he's a rapper."
- "Check this out — this is a secret Japanese city built at the base of Mount Fuji, but it's not just any city."
Best fit: Research-driven creators, "leading edge" positioning, anyone whose value is being early.
6. The Contrarian
Mechanism: The most direct contrast format. The creator immediately states a belief that opposes conventional wisdom. No hiding, no setup — A and B are stated explicitly.
Examples:
- "I think this is probably the most important thing that you need to be doing if you run a brand that you are probably not doing — and this is having one-on-one conversations with your customers."
- "You have no creative ideas because your space sucks to live in."
Best fit: Smarty-pants experts, anyone trying to differentiate in a crowded space.
Decision matrix
| Archetype | Contrast spine | Best when you have... | Risk if misused |
|---|---|---|---|
| Fortune Teller | Present → future | A real forward-looking claim and a "future-feeling" visual | Empty trend-chasing |
| Experimenter | Old method → new method, shown live | Live demo b-roll, a real test, a tool to show | Clout demos with no real result |
| Teacher | Failing at X → here's how to win at X | Hard-won lesson, slides, talking head | Becomes a lecture, kills retention |
| Magician | Layer/modifier — uses visual or verbal stun | Atypical visuals, signature objects, visual pacifiers | Spectacle without payoff |
| Investigator | Don't know → now you know | A genuine "secret" or under-noticed insight | Condescension if secret is well-known |
| Contrarian | Conventional wisdom → opposite belief | A real opposing POV with proof | Becomes a hot-takes account |
Universality principle
"Every single video idea could be manipulated to use any of the six hook formats."
Kallaway demonstrates with the same backpack-review video idea expressed six ways:
- Fortune Teller: "This backpack is going to completely change the way Millennials travel."
- Experimenter: "I just took this backpack 7,000 miles across the world but it has three major design flaws."
- Teacher: "If you're traveling internationally this summer, this is the best way to pack your backpack."
- Magician: Rapid match-cut of multiple backpacks + voiceover "Which of these backpacks is the best for a guy in their 20s?"
- Investigator: "I can't believe this company isn't marketing this one flaw in their bag."
- Contrarian: "Everybody loves this bag, but if you use it every day, it is so overhyped."
Implication: never pick the archetype from the topic. Pick it from the available visual + the specific contrast you can credibly make.
The 4 components of a winning hook
Every hook is four aligned signals:
- Spoken hook — what you say in the voiceover.
- Visual hook — what is on screen.
- Text hook — text overlay on the screen.
- Audio hook — music / SFX in the background.
"The difference between 500 and 500,000 views is unlocking max alignment between those four things."
Misalignment causes comprehension loss, which kills the curiosity loop before it forms.
The visual → audio → visual sandwich
Subconscious heat map of how viewers consume the first 5 seconds:
- Eyes process visual + read text overlay.
- Ears catch up and parse the spoken hook.
- Eyes return to the visual seeking confirmation of what they just heard.
"Our eyes process 10 to 100 times more information per second than our ears — speed of light versus speed of sound."
Therefore: the key visual is the load-bearing element of the hook, not the words. Words exist to set up and confirm the visual.
The 5-step hook-writing process
- Inventory the visual. What b-roll, demo footage, motion graphics, or stock do you actually have? If the visual is weak, decide whether to manufacture one or kill the video idea entirely.
- Find the biggest contrast. Of all the angles available, which produces the largest A → B gap? That contrast picks the archetype.
- Write the spoken hook. Two to four lines following the context-lean → contrast → contrarian snapback pattern.
- Layer the text overlay. Reinforce the visual on the way in and on the visual-confirmation rebound.
- Gut-check max comprehension. Watch silent → watch with audio → ask: when the viewer goes visual → audio → visual, do all four hook components agree? If not, stop and rewrite, or kill the idea.
The winning hook (15M views) — life-size floor plans
- Key visual: people walking on top of full-size projected floor plans in a warehouse.
- Inherent contrast: never been seen before — the visual alone is the contrast.
- Archetype chosen: Fortune Teller (over Investigator) because "future of home design" is more shocking than "secret nobody knows."
- Text overlay: "the future of home design" + "life-size floor plans" with an arrow pointing at the key visual.
- Spoken hook: "Check this out [Magician layer] — these are life-size floor plans, you can literally walk through your exact home design before you build it [context-lean, perfectly aligned with what's on screen]. I think this is the future of how people are going to design their homes [Fortune Teller payoff]."
- Why it worked: every component pointed at the same key visual.
The losing hook (~100K views) — generative world models
- Key visual: rooms appearing magically from a single image.
- Spoken hook: "This is the future of storytelling. It's called a generative world model. It lets you turn a single image into a photorealistic cinematic world."
- Why it failed:
- "Future of storytelling" is too abstract — the visual (rooms) doesn't visually mean "storytelling."
- "Generative world model" and "photorealistic cinematic world" are jargon — viewer can't comprehend in 5 seconds.
- Visual ≠ words → comprehension loss → no curiosity loop.
Anti-patterns
- Picking the archetype from the topic, not the visual.
- Magician without a contrast layer underneath.
- Jargon in the spoken hook.
- Abstract verbal claim vs. literal visual (e.g., "future of storytelling" + footage of rooms).
- Saving the strongest visual for after the hook.
- Making the video when the visual is weak. If you don't have a strong key visual, kill the video.
- Stacking the same archetype across the feed.
- Treating hook-writing as a creativity problem. It is a slot-fill alignment problem.
Notable quotes
"The biggest difference between winning and losing on social media is the hook. This is the 80/20 of video."
"The viewer starts the video believing one thing, and then we introduce an alternative. They believe A, we show them B, and the distance between A and B is contrast."
"The most important part of getting the hook to actually perform well is the visual. It's not what you say. It's the visual."
"If you don't have a good visual, you need to really think: should I make this video?"
How BuildOS uses this
This source informs how the hook-craft-short-form skill selects archetypes for BuildOS founder content:
- Contrarian is the natural shape for "thinking environment, not AI tool" positioning — Contrarian is the anti-AI strategy in hook form.
- Investigator carries the anti-feed and manufactured-virality content (research-driven observations).
- Teacher carries the context-engineering canon (workflow framework content).
- Experimenter carries BuildOS demos (live workflow comparisons).
- Magician is treated as a layer, not a standalone — pair with a recurring visual signature for solo founder content.
- Use sparingly: stacking Contrarian hooks turns the feed into a "hot takes" account; rotate archetypes to avoid burnout.
Related
- Skill:
hook-craft-short-form— uses the archetype catalog as the selection rubric. - Companion source analyses (the 4-video Kallaway hook cluster):
- Irresistible Hooks: Lessons from Kane Kallaway — the three-beat formula.
- Hooks Impossible to Skip: Lessons from Kane Kallaway — the four-mistake diagnostic.
- The 6 Words Every Hook Needs: Lessons from Kane Kallaway — the slot grammar.
- Source channel: Kallaway on YouTube.