Outlier  ›  learn

Does ChatGPT use a data center? Yes — here's what doesn't.

Quick answer
  • Yes. Every ChatGPT, Claude, and Gemini prompt travels over the network to a building full of accelerator chips, gets answered there, and travels back.
  • The app on your phone or laptop is a thin window. The actual model, the part that thinks, never runs on your device.
  • That building drinks water and pulls power, and the bill lands on a real county. Answering prompts (inference) is roughly 80–90% of AI compute.
  • An on-device model is the exception: it answers on the chip in front of you, no building in the loop, works with Wi-Fi off.

When you type a question into ChatGPT, nothing on your phone answers it. The glowing cursor is a courtesy. Your words leave the device, cross the internet, and arrive at a warehouse (usually one you'll never see, in a county you've probably never visited) where rack after rack of accelerator chips does the actual thinking. Then the answer makes the trip back. So yes: ChatGPT uses a data center. Every single prompt does. The interesting part is what happens when one doesn't.

The short answer

ChatGPT, Claude, Gemini: all of them run in data centers, and the apps on your devices are essentially remote controls. A large language model is too big and too compute-hungry to run on a phone, so the heavy lifting is done on specialized hardware in a facility the provider owns or rents. You send text; their servers send back text. Without a connection to that facility, the chatbot has nothing to say, which is why "is ChatGPT down?" trends the moment a region of one of those buildings hiccups.

The path a single prompt takes

It feels instant, so it's easy to imagine the answer comes from somewhere nearby. It doesn't. Here's the round trip for one message, in plain terms:

  1. Your device. You hit send. The app packages your text and ships it off; the device itself does almost no work.
  2. The network. Your prompt hops across the internet (your router, your ISP, the backbone) to the provider's front door, often hundreds of miles away.
  3. The load balancer. At the data center, a traffic cop decides which server should take your request, steering it toward hardware that's free.
  4. The GPU rack. Here's where the thinking happens. A row of accelerators loads the model's weights and runs the math that turns your words into the next words. This is the expensive step, in money, in electricity, in heat.
  5. The trip back. The generated answer retraces the route, server to network to your screen, and streams in token by token, which is why you watch it type.

Five stops, four of them outside your control, for a sentence that looked like it came from your laptop. Multiply that by the hundreds of millions of prompts these services field daily and you get a sense of why the buildings keep getting bigger.

What that building costs

A data center is not an abstraction. It's a physical thing that sits on land, draws power off a grid, and runs water through cooling systems to keep thousands of chips from cooking themselves. Those costs are real, local, and increasingly contested.

The water is the part people are noticing first. The Guardian reported on June 7, 2026 that data centers had consumed 264 billion gallons of water as drought gripped nearly 63% of the US, and a day later, on June 8, that a majority of the country's new AI data centers are slated to be built on drought-hit land. By June 2, the framing had gone mainstream: one headline read that Americans don't quite know how to fight AI, so they're fighting data centers instead. Erin Brockovich launched a national tracking map for them in late May. This isn't fringe anymore; it's a county-commission agenda item.

The scale behind the worry is documented. UC Riverside's "Making AI Less Thirsty" research estimated that training GPT-3 alone evaporated roughly 700,000 liters of water, and that a stretch of a few dozen queries can run through about a 500ml bottle. Google's own 2024 Environmental Report put the company's 2023 water use north of 5 billion gallons. And it keeps growing: the IEA's "Energy and AI" report projects that data-center electricity demand will more than double by 2030.

Here's the detail most people miss. It's tempting to assume the big cost is training the models — the one-time, headline-grabbing megaproject. But the steady drain is inference: the everyday business of answering prompts. According to MIT Technology Review's 2025 reporting, inference is on the order of 80–90% of AI compute. So that round trip you just read about isn't a rounding error against training — it's where most of the demand actually lives. Every casual question feeds it.

Receipts: Water and drought figures from The Guardian (June 7 and June 8, 2026) and reporting on community pushback (June 2, 2026); Erin Brockovich's national data-center map (May 27, 2026). Water-use scale from UC Riverside's "Making AI Less Thirsty" and Google's 2024 Environmental Report. Electricity projection from the IEA "Energy and AI" report. Inference share (≈80–90% of AI compute) from MIT Technology Review, 2025.

The version that never leaves your desk

Now the contrast, because there is one, and it's not theoretical. A model can also run entirely on the chip already inside your computer. No round trip, no warehouse, no load balancer. You send a prompt and your own hardware answers it, the same way a video editor renders without phoning a server.

That's what Outlier is: a Mac-native app where the model files sit on your disk and inference happens on your Apple Silicon chip. There's no account, no terminal, no Docker, just one signed download. Because nothing leaves the device, it works with Wi-Fi off, on a plane, in a basement. The cost of answering a prompt isn't a county's water table; it's the electricity already running to your wall socket, the same as opening any other app.

The trade-off is honest: a local model on a laptop isn't going to outrun a rack of accelerators on raw speed, and the absolute frontier of reasoning still lives in those big buildings. But for the great majority of daily questions (drafting, summarizing, coding, explaining) the answer that arrives from the chip in front of you is plenty, and it never had to travel anywhere to get to you. A patent-pending paged inference engine even runs models larger than the Mac's RAM, so a 397-billion-parameter model can answer on a 64 GB machine. The point isn't that the local model is smarter. It's that you already own the building it runs in. It's your laptop.

Frequently asked questions

Does ChatGPT work offline?

No. ChatGPT can't answer anything without a connection, because the model that does the answering lives in a data center, not on your phone or laptop. The app on your device is a thin window; turn off the network and it has nothing to talk to. An on-device model is the opposite: it runs locally, so it still works in airplane mode.

Can any AI run without a data center?

Yes. Smaller open-weight models run entirely on your own machine, doing all the computation on your chip with no server round-trip. Outlier is a Mac app built around this: the model files sit on your disk and answer locally, so it works with Wi-Fi off and nothing about your prompt leaves the device. See private AI with no cloud for how that's set up.

Where is my ChatGPT data processed?

In the provider's data centers. Your prompt is sent over the internet to one of their facilities, processed on accelerator hardware there, and the answer is sent back. The provider controls those servers, sees the request in transit, and decides how long anything is retained. A local model processes the prompt on your own hardware, so there's no third-party server in the path at all. More on the cloud route in where ChatGPT conversations go.

Try Outlier free

Free Nano + Lite — local, private, no account. Pro $20/mo or $149/yr adds everything (all 7 model tiers incl. Plus 397B). Lifetime Pro from $99 (Founding 200, first 200 seats) or $200 (Founders 500). Apple Silicon only.

Download for Mac