>
section 5 of 224 min read

5. The CMOS Inverter, Closely Examined

The CMOS inverter is the hydrogen atom of digital design. Master it and the rest follows.

5.1 DC transfer characteristic

Sweep VinV_{in} from 0 to VDDV_{DD} and measure VoutV_{out}:

plaintext
   V_out
    ^
 V_DD ─┐
       │\
       │ \
       │  \
       │   \
       │    \
       │     \
       │      \________
  GND ─┘──────────── V_in
       0    V_M    V_DD

The S-shaped curve has five regions:

  1. Vin<VtnV_{in} < V_{tn}: NMOS off, PMOS in linear. VoutVDDV_{out} \approx V_{DD}.
  2. Vtn<Vin<VMV_{tn} < V_{in} < V_M: NMOS in saturation, PMOS in linear. VoutV_{out} dropping but still high.
  3. VinVMV_{in} \approx V_M: both saturated. Output transitions sharply.
  4. VM<Vin<VDDVtpV_M < V_{in} < V_{DD} - |V_{tp}|: NMOS linear, PMOS saturation. VoutV_{out} low.
  5. Vin>VDDVtpV_{in} > V_{DD} - |V_{tp}|: PMOS off, NMOS linear. Vout0V_{out} \approx 0.

5.2 Switching threshold VMV_M

The switching threshold VMV_M is where Vin=VoutV_{in} = V_{out}. Setting NMOS and PMOS currents equal in saturation:

VM=Vtn+r(VDDVtp)1+r,r=μpCox(W/L)pμnCox(W/L)nV_M = \frac{V_{tn} + r(V_{DD} - |V_{tp}|)}{1 + r}, \quad r = \sqrt{\frac{\mu_p C_{ox} (W/L)_p}{\mu_n C_{ox} (W/L)_n}}

Set VM=VDD/2V_M = V_{DD}/2 and you find that you need PMOS to be ~2 to 2.5x wider than NMOS to compensate for the lower hole mobility. This is the classic "2:1 P:N ratio" rule of thumb.

5.3 Noise margins

The high noise margin is how much noise on a 1-input the inverter can absorb without flipping its output:

NMH=VOHVIH,NML=VILVOLNM_H = V_{OH} - V_{IH}, \qquad NM_L = V_{IL} - V_{OL}

where VILV_{IL} and VIHV_{IH} are the input voltages at which the gain of the inverter equals -1. For a balanced CMOS inverter at VDD=1.0V_{DD} = 1.0 V, both noise margins are typically around 0.4 V. Compare this to nMOS, where VOLV_{OL} does not even reach 0; CMOS has clean 0/1 levels and that translates to far better noise immunity.

5.4 Propagation delay

The propagation delay is the time from a 50 percent input transition to a 50 percent output transition. For an inverter driving a load CLC_L:

tpHLCLVDD/2IDS,n,tpLHCLVDD/2IDS,pt_{pHL} \approx \frac{C_L V_{DD}/2}{I_{DS,n}}, \qquad t_{pLH} \approx \frac{C_L V_{DD}/2}{I_{DS,p}}

Average propagation delay tp=(tpHL+tpLH)/2t_p = (t_{pHL} + t_{pLH})/2. The "IDSI_{DS}" here is an effective drive current that designers approximate by integrating the actual saturation/linear behavior, but the scaling intuition is:

  • Larger CLC_L → slower.
  • Larger VDDV_{DD} → faster (but quadratic dynamic power penalty).
  • Larger W/LW/L (stronger drive) → faster, but you also load the previous stage more.

5.5 Dynamic power, derived

When a CMOS gate's output rises from 0 to VDDV_{DD}, energy from the supply moves through the PMOS into the load capacitance CLC_L. The total energy drawn from the supply is Erise=CLVDD2E_{rise} = C_L V_{DD}^2, of which half (12CLVDD2\frac{1}{2}C_L V_{DD}^2) ends up stored on the capacitor and half is dissipated in the PMOS resistance.

When the output falls back to 0, the stored energy 12CLVDD2\frac{1}{2}C_L V_{DD}^2 on the capacitor dissipates through the NMOS. The supply does not sink the energy back; it just sat there during the fall.

So per full cycle (rise + fall) the energy dissipated is CLVDD2C_L V_{DD}^2. If the gate switches at frequency ff, the dynamic power is

Pdyn=αCLVDD2fP_{dyn} = \alpha C_L V_{DD}^2 f

where α\alpha is the switching activity factor, the probability that the gate switches per clock cycle (typical values are 0.1 to 0.3). This is the most important formula in digital power. The quadratic dependence on VDDV_{DD} is why lowering supply voltage is the single biggest lever for low-power design. The linear dependence on ff is why we clock-gate unused parts of a chip.

5.6 Short-circuit power

During the brief transition window when both NMOS and PMOS are partially on, a short-circuit current spikes from VDDV_{DD} to GND. This is typically 5 to 10 percent of dynamic power for normal slew rates; it grows if input transitions are slow. Buffer chains mitigate this by keeping slews fast.

5.7 Leakage power

Pleak=VDDIleakP_{leak} = V_{DD} I_{leak}, with IleakI_{leak} dominated by subthreshold conduction at modern nodes. On a 7 nm chip this can be tens of percent of total power even when "idle."

5.8 Putting it together

Ptotal=αCLVDD2f  +  tscfVDDIpeak  +  VDDIleakP_{total} = \alpha C_L V_{DD}^2 f \;+\; t_{sc}f V_{DD} I_{peak} \;+\; V_{DD} I_{leak}

The total power on a real chip is the sum across millions of gates of these three terms. Most of the action in low-power design is making each term as small as possible without breaking timing.