1.1 SSI, MSI, LSI, VLSI, ULSI
The integration density of an IC is measured in roughly how many gates fit on one chip, and the industry has used a shorthand for the decades:
| Era | Years | Gates per chip | Representative parts |
|---|---|---|---|
| SSI (Small Scale Integration) | 1960s | 1 to 10 | 7400-series TTL, 4000-series CMOS, individual op-amps |
| MSI (Medium Scale Integration) | early 1970s | 10 to 100 | 4-bit counters, decoders, small adders |
| LSI (Large Scale Integration) | mid-1970s to early 1980s | 100 to ~10,000 | 8080, 6502, 8086, early calculator chips |
| VLSI (Very Large Scale Integration) | early 1980s onward | 10,000 to 10 million | 80386, original Pentium, ARM7TDMI |
| ULSI (Ultra Large Scale Integration) | sometimes used post-1990s | over 10 million | Modern CPUs, GPUs, SoCs |
| GSI (Giga Scale Integration) | rarely used | over 1 billion | Apple M-series, NVIDIA GH100 |
The boundaries are arbitrary marketing labels. Engineers today say "VLSI design" to mean any serious chip-design work, regardless of transistor count, just as people still say they "dial" a phone number even though no rotary dial is involved. What matters is that we are talking about chips with so many transistors that nobody can route them by hand, and the entire production machinery, EDA software, photomasks, fabs, packaging, has scaled to match.
1.2 Moore's Law: an empirical observation that became a self-fulfilling roadmap
In 1965 Gordon Moore, then at Fairchild, plotted the number of components per IC on a log scale and noticed it was a clean exponential. He predicted, in a tossed-off magazine article, that the number would double roughly every year. By 1975 he revised this to "every two years," which is the form people remember as Moore's Law.
Moore's Law is not a law of physics. It is an industry roadmap that became self-fulfilling because every fab, every EDA company, every IP supplier, and every customer planned around it. If TSMC said "we will have a node with 2x density in two years," every chip designer planned a product to ship on that node, every tool vendor wrote tools for that node, every memory supplier scheduled silicon to feed those products, and so on. When everyone is rowing in the same direction, the boat moves.
The doubling did not come from any one trick. Each generation came from a careful blend of:
- Smaller transistors (lithography improvements).
- New transistor structures (planar to FinFET to gate-all-around).
- New materials (high-k dielectrics, copper interconnect, strained silicon, low-k dielectrics, cobalt fill).
- More metal layers and better routing.
- Bigger chip area (within reticle limits) and better yield.
Real-world frame. From 1971 (Intel 4004, 2,300 transistors) to 2024 (Apple M4 Max, ~28 billion), transistor count doubled about 24 times. That is a factor of 16 million increase in 53 years. No other engineering discipline has matched this.
1.3 The end of Dennard scaling
Robert Dennard at IBM described in 1974 how to shrink transistors so that everything got better at once. Shrink linear dimensions by a factor , shrink the voltage by the same factor , and:
- Capacitance per transistor falls as .
- Current per transistor stays roughly proportional to , scaling carefully.
- Switching delay falls as , so the chip gets faster.
- Power per transistor falls as , while density rises by , so power per area stays constant.
This was the dream: shrink, get faster, denser, lower power, all at once. It worked for thirty years.
Around 2005 the trick broke. The reason is that the threshold voltage of a MOSFET cannot be scaled arbitrarily low. Subthreshold leakage current depends exponentially on , and at room temperature mV. Push much below ~300 mV and leakage explodes.
With stuck, you cannot drop much further (you need headroom to keep transistors on). With stuck, no longer falls per generation. Power density, the heat per square millimeter, stops falling. And once you cannot get rid of heat, you cannot crank the clock. This is why your CPU has been stuck near 5 GHz since 2005.
The industry's response has been to scale by adding cores, by adding specialized accelerators (GPU, NPU, video, crypto, ISP), by improving cache and interconnect, and by using power gating to keep huge swathes of the chip dark when not in use. The era of "free lunches from process scaling" is over. The era of "architectural cleverness to extract value from shrinking" is here.
1.4 Why this matters for security
Two things follow from Moore's Law breaking that are deeply relevant to hardware security. First, the leakage that broke Dennard scaling is the same leakage that is exploited by static side channels, where simply measuring the standby current of a chip can reveal the state of certain SRAM cells. Second, the move to architectural cleverness means more side-channel surface, more shared resources between cores (caches, branch predictors, ring buses) which became the launching pads for Spectre, Meltdown, RowHammer, MDS, and dozens of other attacks. Hardware did not get less secure on purpose. It got less secure because the easy speedups dried up and engineers started picking up the harder, riskier ones.