Question 1

Why two layers? Isn’t the AI smart enough on its own?

Accepted Answer

Vision models are good at semantic reads (emotion, composition, psychology) but unreliable at measurements. Asking Claude "what’s the contrast ratio of this text?" gets a confident guess. Layer 1 measures the things that should be measured. Actual pixel stddev for contrast, real Haar-cascade face detection, OCR for text coverage, WCAG luminance ratios for readability, k-means for dominant colors. Layer 2 then judges what only judgment can judge. Does the emotion match the topic, does the composition lead the eye, does this stand out against the actual niche feed. The split is the whole point.

Question 2

How is the niche benchmark built? Where does the comparison come from?

Accepted Answer

For your keyword + format + size bracket we fetch the top 50 YouTube videos via the official Data API, filter for above-median view velocity, last-12-months only, >10K views, format match (tutorial / listicle / story / comparison / revelation), and channel-size bracket match (nano <10K, micro <100K, mid <1M, macro 1M+). The top 10 by velocity become your benchmark pool. We run Layer 1 on each of their thumbnails and average the metrics. The pool is cached for 30 days and shared across users on the same niche, so your run isn’t paying for someone else’s benchmark build.

Question 3

What does "channel size bracket" do?

Accepted Answer

Comparing a 5K-sub thumbnail against MrBeast’s feed is useless. The benchmark pool only includes top performers in your size bracket (nano / micro / mid / macro), so the score reflects what really wins among channels that real viewers see alongside yours. A 78 on Thumbnail IQ for a nano channel means the thumbnail beats the average top-performing nano thumbnail in your niche. A target you can genuinely hit.

Question 4

How accurate is the face detection. Can it tell emotion?

Accepted Answer

Layer 1 uses OpenCV’s Haar cascade for detection (presence, count, position, coverage percentage). Detection is reliable for forward-facing faces; it misses heavy profile shots and partial faces. Emotion is a Layer 2 read. Claude vision describes the specific emotion ("intense focus", "barely-suppressed laugh") and judges whether it’s readable at 200px. If Layer 1 misses your face but Layer 2 sees it, the vision score still credits you; nothing is double-penalized.

Question 5

What if my thumbnail has no text?

Accepted Answer

Text presence scores zero in Layer 1. Layer 2’s text-psychology dimension also scores 0. UNLESS the visual is exceptionally strong, in which case Claude is allowed to flag it as an intentional choice (some niches like ASMR or cinematic vlogs win without text). The combined score will still come out reasonable if the rest of the thumbnail compensates. We don’t hand back "ADD TEXT" as the universal fix; the suggestion is contextual to your niche.

Question 6

Can I score a thumbnail before I publish the video?

Accepted Answer

Yes, that’s the primary use case. Upload the image, paste your draft title, pick the keyword you’re targeting. The studio runs both layers, compares against the niche pool, returns the score and the per-dimension fixes. Iterate, re-upload, score again. Every version is tracked in the history panel so you can see exactly which change moved the score, and by how much.

Question 7

Do you compare thumbnails to the same competitors my SEO Studio analyzes?

Accepted Answer

Often, yes. Both surfaces use YouTube’s niche-search results as the source of truth for "who’s winning here". The benchmark pool for thumbnails additionally filters by channel-size bracket and format, so the comparison set is sharper than what SEO Studio uses for title rewrites. If you’ve linked a video idea from competitor research, Thumbnail IQ explicitly references the competitor gap that idea exploits. And judges whether your thumbnail can win against those exact channels.

Question 8

How does the percentile work?

Accepted Answer

For every Thumbnail IQ analysis run on your same keyword + format + size bracket (across all users, since most niches have multiple creators using the tool), we compute the average algorithm score. Your percentile is "how many of those analyses scored below yours". A 78/100 might be 92nd percentile in some niches and 60th in others. The percentile is what tells you whether your number is competitive. New niches with no peers yet show 50th percentile by default until enough data accumulates.

Question 9

Will Thumbnail IQ work for Shorts thumbnails?

Accepted Answer

Layer 1 works the same. Pixel measurements don’t care about the platform. Layer 2 currently judges against the standard 16:9 long-form benchmark pool, so feed-distinctiveness scoring for vertical Shorts thumbnails is approximate. Shorts get less play from the thumbnail itself (most plays start before the thumbnail loads), so this is intentionally not the top priority right now. If your Shorts thumbnails are critical to your funnel, email support and we’ll prioritize the Shorts pool build.

Question 10

How long does an analysis take, and what does it cost?

Accepted Answer

~20–35 seconds end-to-end on a fresh niche (Layer 1 on your image, fetch + Layer 1 on benchmark thumbnails if pool isn’t cached, then Layer 2 vision call). Cached niches return in ~10–15 seconds. Free tier gets 1 thumbnail analysis per cycle; paid plans charge one credit per run (Solo 20, Growth 50, Agency 150 pooled). Re-uploading a revised version of the same thumbnail charges a new credit because we re-run both layers from scratch.

Question 11

Are my thumbnails stored? Can other users see them?

Accepted Answer

Your uploaded thumbnail is stored on our infrastructure so the analysis can rehydrate when you reopen it later, and so the version-history panel can compare iterations. It is never shown to other users and never used as benchmark data for other channels. The benchmark pool only ever contains public thumbnails from the YouTube API. Videos that are already published and ranking. You can permanently clear an upload from the analysis history at any time.

Question 12

What does "feed distinctiveness" measure?

Accepted Answer

It’s the highest-impact Layer 2 dimension. We show Claude your thumbnail alongside the actual top 3 benchmark thumbnails (by view velocity) for your exact niche, format, and size bracket. And ask: would this stand out, blend in, or disappear in that feed? The score is anchored to the visual context a real viewer would see your thumbnail in, which is the only honest way to judge "click-worthiness". Generic best-practice advice can’t do this.

The only thumbnail score that compares against your real niche feed.

One number that fuses pixel measurements with niche-aware judgement.

We measure what should be measured. And judge what should be judged.

Compared against the channels you’ll really be next to.

From upload to scored verdict in under 30 seconds.

Seven distinct output blocks. Every one is fixable.

Open-source CV + Sonnet 4.6 vision. Public data only.

How many thumbnail scores you get each month.

The scoring engine, answered honestly.