The always-on computer that makes every app in the world possible

❝

When you send a WhatsApp message at 2am, nobody is awake to forward it. A server handles it automatically — and understanding how changes how you build everything.

What's actually happening here?

Your phone has a battery. It sleeps, it dies, you charge it. A server sits in a climate-controlled room, plugged into redundant power, running 24 hours a day, 365 days a year.

Its whole job is to wait for requests and respond to them. Every app you've ever used — Gmail, Instagram, Zomato — is really just your device talking to someone else's server.

The problem this solves

If Instagram had no servers, every photo ever uploaded would need to live on your phone. That's petabytes of storage — impossible.

Servers centralise storage, processing, and logic so your device only needs to show results — not compute them. Your device is the screen; the server is the brain.

How it really works (step by step)

The request-response cycle — the heartbeat of every app:

Client sends a request — your device packages what it wants (a page, a video, data) and sends it over the internet with your IP address attached.
Server receives and processes — it reads the request, queries a database if needed, runs some logic, and prepares a response.
Server sends the response — the data travels back to your device, usually in under 100ms for servers in the same country.
Client renders the result — your browser or app takes the response and turns it into what you actually see on screen.

The part most tutorials skip

There are four completely different types of servers — and each does exactly one job:

Type	Job	Examples
Web server	Serves HTML/CSS/JS files to browsers	Nginx, Apache
Application server	Runs your business logic	Node.js, Django, Spring
Database server	Stores and retrieves data	Postgres, MySQL, MongoDB
Cache server	Keeps frequent answers in fast memory	Redis, Memcached

Separating these four concerns is why big systems can scale — each layer grows independently without touching the others.

Real company doing this right now

YouTube serves 2.49 billion users using MySQL — a database most beginners learn in their first week. The magic isn't the technology; it's how they scale it.

They built a tool called Vitess that splits one MySQL database across thousands of servers, routing every query to the right shard automatically. The application code barely changed — they just added a smarter layer on top.

The full cycle: Browser → Web server → App server → Database → response back to browser. Round trip under 100ms for same-country servers.

How a real production stack is split: Browser → Web server → App server → (DB server + Cache server). Each layer scales independently.

What breaks at scale?

A single server has a ceiling: fixed CPU cores, fixed RAM, fixed network bandwidth. Once you hit that ceiling, every extra user makes the whole thing slower — until it stops responding entirely.

This is why production apps use horizontal scaling: instead of one powerful server, run 10 medium ones behind a load balancer. When one crashes, the other 9 keep serving users without interruption.

The "aha" moment

A server isn't special hardware — it's just a regular computer running software that listens for requests. You could technically run a server on an old laptop in your bedroom (and many great companies started exactly that way).

Your practical takeaway

Separate concerns from day one — when you build your first backend, keep your web server, app logic, and database as separate layers. It feels like extra work at first but saves a painful refactor when traffic grows.
Cache before you scale — if a database query is making your app slow, the fix is almost never a faster server. Add Redis in front of that query and response time can drop from 200ms to 2ms.
Use managed servers for your first project — Railway, Render, or Fly.io so you focus on app logic, not OS management. Move to raw servers only when you need that level of control.

Lesson 02 · Stage 1 — Absolute basics · System Design for Everyone

The always-on computer that makes every app in the world possible

What's actually happening here?

The problem this solves

How it really works (step by step)

The part most tutorials skip

Real company doing this right now

What breaks at scale?

The "aha" moment

Your practical takeaway

KEEP READING

Learning System Design

Quick Links

Subscription