ExplainerApril 28, 2026·5 min read

The Blind Spot in Website Monitoring: Why Your Site Can Be Down While Appearing Up

CDN cache poisoning creates a monitoring blind spot where websites appear healthy to uptime checkers while users experience complete failures.

Website monitoring services like ours check if your site is up by sending HTTP requests and measuring response times. It's straightforward—we get a 200 status code, we mark you green. But there's a category of failures that slip through this net entirely: CDN cache poisoning attacks and misconfigurations that serve stale or malicious content to real users while your monitoring probes see exactly what they should. The irony is that your site technically responds correctly to our checks, yet your users are experiencing a broken product. This gap between what monitors see and what users experience is widening.

How CDNs Hide the Real Problem from Monitoring

When a CDN like Cloudflare, Akamai, or Fastly caches your content, it sits between your origin server and users. Monitoring probes often hit the CDN edge directly—and if the cache is fresh, they get a fast, valid response. But if your origin has gone down or is serving errors, the CDN may continue serving stale cached versions to users for hours or days. The probe sees HTTP 200. Users see a 404 or a broken checkout flow from last week's cached state. The disconnect happens because most monitoring tools don't distinguish between 'fresh content from origin' and 'stale content from cache.' They just see a successful response time.

The Non-Obvious Problem: Cache Headers as Silent Killers

Here's what most people miss: misconfigured Cache-Control headers can poison your CDN without any attack. Set `Cache-Control: max-age=86400` on an error page by accident, and your monitoring will report green while users see a 500 error for a full day. We've tracked cases where a single misconfigured origin response—often a 502 Bad Gateway—gets cached at the edge with a long TTL because the header wasn't explicit about not caching errors. The CDN dutifully serves it, monitoring probes validate it, and your support team gets flooded with reports of a 'mysterious outage' that doesn't show up in any uptime dashboard. Your monitoring tool will never catch this because it's seeing the cached lie, not the broken truth.

Why Traditional Monitoring Can't See This Gap

Most uptime monitors send a single request from a single geographic location and call it done. They don't check cache headers, they don't validate that the response is actually fresh, and they don't compare what the CDN serves versus what the origin serves. Some advanced tools will probe multiple regions, but even that misses the real issue: they're not checking if the cached content is stale. A truly robust monitoring setup would need to either bypass the CDN entirely (difficult if your DNS points to CDN) or inspect cache metadata and compare origin freshness. Few platforms do this. It's expensive and complex, which is why the gap persists.

What You Should Do Starting Monday

First, audit your Cache-Control headers. Make sure error pages (4xx, 5xx) explicitly include `no-cache` or `max-age=0`. Second, configure your CDN to respect origin cache headers strictly—don't let it override them with defaults. Third, set up a separate health check that deliberately bypasses your CDN (point a probe directly at your origin IP or use a private health check endpoint). Fourth, monitor cache hit ratios and stale-while-revalidate behavior. If your hit ratio suddenly spikes or your stale content is being served longer than expected, that's a signal something's wrong. Finally, when you use a monitoring service like ours, ask them if they check cache headers and origin freshness. If they don't, you're flying blind.

The Real Cost of This Blind Spot

We've seen companies lose hours of revenue because their monitoring reported 99.9% uptime while users experienced complete service degradation. The gap between 'the CDN says it's fine' and 'users are furious' is where trust breaks down. Your status page stays green. Your incident response team stays calm. Your customers leave. The technical fix is straightforward—validate origin freshness, not just CDN response codes—but it requires acknowledging that traditional monitoring has a fundamental gap. The companies that will win at reliability aren't the ones with the most probes; they're the ones that measure what users actually experience, not what caches claim to serve.

Check if a website is down right now

Free real-time server check — results in seconds. No sign-up required.

Or set up automated uptime monitoring →
Check a website
← Older
Why ChatGPT Goes Down So Often (And When It's Coming Back)