A Simplified Deep Dive into How CDNs Work

Written by Leonard B. in internet on Feb 02, 2025 • 8~13 minutes

Ever clicked on a website and it just… appeared? Like, instantly. No loading, no lag, just there. That’s not magic. That’s infrastructure. Specifically, a CDN. You’ve probably never thought about one, and that’s kind of the point. Content Delivery Networks are meant to be invisible. Quietly working in the background so your music plays, your memes load, and your checkout button doesn’t freeze mid-purchase.

But behind that speed is a system that’s surprisingly layered. Almost like a relay team handing off the baton — except the baton is your data, and the runners are servers scattered across the globe.

CDNs are fascinating not just because they work, but because they scale. Whether it’s serving cat videos to a kid in Manila or pushing real-time financial data to a trader in London, CDNs are involved. And while I’m not particularly prone to awe when it comes to tech, there’s something weirdly elegant about a system built mostly to shave off milliseconds and avoid global chaos.

So, let’s talk about how it all fits together.

The CDN Hierarchy: Not All Servers Are Created Equal

You might picture the internet as this endless, flat mesh of connections. In reality, it’s more like a hierarchy. Think: local, regional, then central. A sort of pyramid, but with data instead of pharaohs.

Architectural Overview of the CDN

At the outer edge, there are the edge servers. These are the ones closest to you geographically. If you’re in Berlin, there’s probably one somewhere in the city. Maybe even inside a nondescript gray building you walk past every day. These machines are built to respond fast and keep traffic from bouncing all the way to some central server across an ocean.

Next up are regional servers. They handle a bunch of edge servers and act like middle managers — not glamorous, but critical. If your nearby edge server doesn’t have what you’re looking for, the regional one might.

Then there’s the origin server, the central archive of truth. It’s where the actual data lives — or rather, sleeps. You only bother it when no one else in the chain can help. And trust me, no one wants to wake up the origin server unless they have to. That thing’s busy.

Why bother with all this layering? Simple. It’s about speed, load balancing, and redundancy. Instead of 10,000 people hammering one poor server in San Francisco, the requests get spread out. And if one location crashes or slows down, another nearby one quietly picks up the slack.

It’s like how a decent pizza chain operates. One central kitchen wouldn’t survive Friday night. You need local branches. And if one runs out of mozzarella, another steps in. Same principle. Just less cheese.

DNS Knows Where You Live (Sort of)

Okay, now we know there’s a network of servers. But how does your computer know which one to talk to?

That’s DNS doing its thing. When you type a URL like domain.com, your device doesn’t know where that is. So it asks the Domain Name System, kind of like the internet’s phonebook. But smarter. And nosier.

Here’s the catch: CDNs have their own DNS that’s geo-aware. So when you ask for a site, it goes “Oh hey, you’re in Buenos Aires? Here’s an edge server just a few kilometers away.” The trick is that it figures this out by looking at your IP address, which gives a decent guess at your location.

DNS-Based Request Routing

It’s not perfect. VPNs, corporate firewalls, or just weird routing decisions can throw it off. But for most people, it works fine.

There’s also this thing called Anycast. Multiple servers share the same IP address, and the internet just routes you to whichever is closest, in a network-y sort of way. It’s kind of like when you shout into a cave and the nearest bat answers.

Netflix, for example, doesn’t want you waiting for episodes to buffer. So they use geo-aware routing with Amazon Route 53 to make sure you’re always connected to the fastest node possible. Not glamorous. Just efficient.

When It Works Like a Charm: The Cache Hit

You click a link, and boom, the page loads. That’s a cache hit. It means the content was already stored on a nearby server, ready to go. No need to hunt it down.

CDNs store stuff like images, JavaScript files, and sometimes even entire HTML pages. Each piece of content has a TTL, or Time-To-Live. After that, it expires. Think of it like yogurt in your fridge. You can eat it after the date, but things get questionable.

Cache Hit Scenario

The best part of cache hits? Speed. A cached file might take 50 milliseconds to load. The same file from an origin server could take 500. Multiply that by every file on a page, and you’re suddenly dealing with a sluggish, frustrating experience instead of a snappy one.

There’s also a big bandwidth savings here. Serving a 1GB video from an edge server 10,000 times means the origin doesn’t have to move that data again and again. That’s 10TB in savings. It’s like asking your neighbor for sugar instead of driving to the grocery store every single time.

When It Doesn’t Work: The Cache Miss

Of course, not everything is cached. Sometimes you ask for something new — like a blog post just published, or a product image that’s never been loaded before.

This kicks off what I call the “fetch chain.” First, the edge server checks its stash. Nothing? It calls the regional server. Still nothing? Only then does it go all the way to the origin.

Cache Miss Scenario

The problem here is latency. That trip to the origin can add hundreds of milliseconds. Not catastrophic, but noticeable. And if a ton of users are all causing cache misses at the same time, that origin server starts sweating.

To avoid this, CDNs sometimes use prefetching. If they predict something will be requested soon — like a homepage during a product launch — they pre-load it. Smart. Some even use cache warming before big events, loading everything up before users arrive like prepping a buffet ahead of guests.

Keeping Content Fresh Without Breaking the System

There’s a balancing act between speed and freshness. CDNs let you set expiration times (TTL), but what happens when you need something updated now?

That’s where invalidations come in. You can send a command to wipe specific content out of the cache. Like if you fixed a typo or changed a price on your store.

Content Expiration and Invalidation

The challenge is knowing when to invalidate. Too often, and you burn efficiency. Too rarely, and your users might see outdated or wrong content. Some CDNs offer APIs to automate this, which is nice, but it still requires thought. Computers are fast, but they’re not mind readers.

Real-world case: You update the price on a hot-selling item. TTL is set for 24 hours. If you don’t invalidate it manually, customers might keep seeing the old price. Cue confusion. And maybe angry emails.

Where It’s All Headed: CDNs Are Becoming Little Computers

What started as a smart way to move static files around is now turning into something else entirely.

CDNs are evolving into edge platforms. They’re not just delivering content — they’re processing it. Running code. Making decisions. Some let you run serverless functions right at the edge, which means you can customize responses, run authentication checks, or even filter images before they hit the browser.

And this edge computing thing has bigger implications. Think about IoT devices. Or real-time translation. Or AI inference happening closer to users, not in some far-off cloud center.

In a weird way, CDNs are becoming mini data centers scattered all over the world. Quietly powerful. Still invisible.

So, Why Bother Knowing All This?

Look, you don’t need to understand CDNs to use the internet. They’re designed that way. But once you get a feel for how they work, a lot of things start making more sense. Like why some pages load faster than others. Or why a site might crash under traffic while another one cruises through a viral storm.

CDNs aren’t perfect. Routing decisions can fail. Caches can go stale. But they’ve made the modern internet tolerable. Even delightful, sometimes.

And honestly, that’s impressive. All this infrastructure, just so your favorite video doesn’t stutter. Pretty decent trade-off.

Maybe next time your site feels a bit slow, you’ll remember there’s a lot happening under the hood — and it might be time to give your CDN setup a second look. Or a first one.