Intro

Energy/power is important for datacenters, which are important for AI, which is important. I want to learn all about power generation and deployment. Here are a bunch of notes on power, largely from a conversation with Owen.

Power 101

Different sources and consumers of power have different characteristics, including their power factor and inertia.

Real vs. Reactive Power (Power Factor)

Consumers of power have different power factors, defined as the ratio of real to total (real + reactive) power consumed.

Things like fridges have lower power factors, because they consume more reactive power (smth like inductors / capacitors and motors. Things like heaters have higher power factors.

Generation usually has a high power factor.

Changes in reactive power impact voltage stability.

Aside: Physics

Real power $P = V I cos (ϕ)$ while reactive power $Q = V I sin (ϕ)$ . Where $V$ is voltage, $I$ is current, and $ϕ$ is the phase shift between the two.

So, for small phase shifts, which is what I expect, the level of reactive power is roughly linear with the phase shift. And it doesn’t seem to depend on the level of voltage or the level of current.

Grid Inertia

To generate power, coal plants, for example, will rev up one of these absolute behemoths. This is the thing that creates the current when it rotates (with a coil).

So the concept of inertia is pretty intuitive, this giant cylinder literally has inertia—it’s difficult to change the rate at which it spins by very much.

People, on aggregate, usually have a very stable, difficult to change demand: high-inertia.

Coal and natural gas plants’ power generation is also high-inertia, the cylinder is not stopping.

Low-inertia generation are things like solar and wind. Nuclear is stable (high inertia).

The grid wants high inertia demand and high inertia supply, and to minimize the difference between supply and demand.

Changes in grid inertia affect frequency stability.

Why minimize the difference between supply and demand?

If there’s a sudden drop in demand, there is momentarily more power generated than being used, and since the energy must go somewhere, those giant spinners literally end up spinning faster. Then you have to adjust how much mechanical input you’re putting into them, and hope that it stabilizes. And for that moment when they’re spinning faster, the frequency jumps from 60Hz to like 60.05 Hz!

High inertia sources can deal with this well, low inertia sources (usually) cannot. [TODO double-check]

Analogy time

For the sake of the analogy, imagine this buff dude was doing the exercise all wrong, with the ropes in sync.

TODO fill in the rest of the analogy.

Inverter-based resources

Owen showed me this visual:

To which I asked “Why are batteries having to wait to get on the grid? Shouldn’t the grid be wanting more batteries which can help alleviate any differences in supply and demand?”

The answer is partially that batteries are inverter-based.

Inverter based resources include solar, wind, batteries, and anything that doesn’t immediately generate power at 60Hz or consistently spin at 3600 rpm.

These are bad at dispatching reactive power, and dispatch agnostic to grid frequency.

Also, adding storage would likely push the majority of the power generation stack to the right, and can push lower-inertia generation sources off the stack entirely.

Economic Factors

Congestion Pricing

Congestion pricing-esque dynamics make datacenters increase energy prices more than you might naively expect.

The price of power also includes the price of transporting energy to you, so if you are like 400 miles down the line, and a 1GW datacenter opens up 500 miles down the line, then the power plant has to ship that GW to you as well, and you end up paying for part of that burden. This is on top of simple supply vs demand incentives. And part of a more complex network, of course.

Datacenters

Characteristics

Datacenters are also low-inertia demand, see this from Semianalysis. Basically, both within a training run and in general, datacenters may quickly go from requesting nothing to hundreds of MW and vice versa.

Some folks, e.g. Southwest Power Pool (SPP) have started selling lower-tier power, which is essentially the first to get shut out when the grid needs to cut power to prevent a blackout.

Incentives and Dynamics

There might be an incentive for the datacenters to want to be the kind of customers of power that the grid wants to sell to (i.e. stable demand). So, they might not just optimize for max FLOP or token or best training run, they might also optimize for stability of demand.

If the grid goes out dips a little bit, as a datacenter owner I might want to switch to backups until I’m sure we’re good. But this is a huge imbalance for the grid which has just lost a ton of demand without also losing generation.

US - China

Total power generation looks like this:

Two things to consider here:

Generation v. Generation Capacity: each one actually generates a bit under what it needs.

Generation Capacity v. Datacenter generation capacity: each one only allocates some portion of its generation to AI datacenters. So the relevant number to be watching out for is more like “total generation minus population and other non-datacenter consumption”. With any luck, Chinese electricity generation will keep going to the non-AI industrial centers and the Chinese population itself. (Realistically, it seems that the proportion towards industry in general will be higher in China, and become dominated by datacenters.)

.arunim.fyi

Table of Contents

energy