INSTITUTIONAL CONCEPT · SPX + NDX LIVE-WEIGHT PIE

Index Weights
The S&P 500 as a single pie — not a logo wall

Most visualizations of the S&P 500 show you the top ten and call it a day. This one draws every one of 503 constituents at its live market-cap weight — then lets you flip to the NASDAQ-100 (101 slices, $40.28T), switch cap-weight to equal-weight, and watch the top-10 share collapse from 42.10% → 2.00%. The chart is the argument: index concentration is a design problem, not a chart-library default.

Institutional Context: Constituent lists scraped from Wikipedia (public), market caps and live price from yfinance, GICS sector taxonomy applied across both indices. What you're looking at is the same information a risk officer sees before they sign off on a concentration-risk memo, a PM sees before they rebalance a pooled fund, and a compliance officer sees when the regulator asks “how much of this index is five stocks?” Designed as a concept deliverable with production-grade data — no forecasting, no screenshots, every slice is a rectangle a designer drew to a number.

SPX constituents NDX constituents yfinance Related: Macro Signal case

01 · Signal numbers

Concentration is a design problem before it is a risk problem

Both indices are cap-weighted — a stock’s share of the pie scales with its market cap. When that cap doubles, its slice doubles. Six numbers summarize what the pie is actually doing.

SPX top-10 share 42.10% Ten of 503 names. NVDA + GOOGL + GOOG + AAPL + MSFT alone: 29.2%.

NDX top-10 share 72.35% Ten of 101 names. If you own the NDX, you own Big Tech by construction.

SPX Herfindahl index 230.6 DOJ threshold for “moderately concentrated” is 1,500. SPX is not there — yet.

NDX Herfindahl index 638.2 2.8× the SPX figure. A single negative NVDA day moves the whole index.

SPX total market cap $69.24T Roughly 2.3× U.S. GDP. The pie is the economy, weighted by listing.

GICS sectors drawn 11 Information Technology alone is 31.30% of SPX — and 49.58% of NDX.

Source · data/indices/spx.json + data/indices/ndx.json, refreshed 2026-04-20 via scripts/refresh-indices.py. See methodology for refresh cadence and known limitations.

02 · The pie

Every constituent, drawn to live weight

Four controls: index (SPX / NDX), weighting (cap / equal), focus (top N shaded, tail dimmed), and search (ticker or name). Hover a slice or row to cross-highlight the other. The readout panel on the right is the same field pattern a terminal user would expect — ticker, name, sector, cap, weight, price — no screenshots, all derived from the live snapshot.

Index

Weighting

Focus top 10 names

Find ticker or name

Loading data/indices/spx.json…

# Name Sector Weight

03 · SPX vs NDX — side by side

Same top five names. Very different pies.

The top five constituents are identical across the two indices — NVDA, GOOGL, GOOG, AAPL, MSFT. What differs is how much air around them there is. In the SPX, 490 smaller names absorb the rest of the chart; in the NDX, only 91 smaller names do, and they do it in a quarter of the area.

S&P 500 503 constituents · $69.24T · HHI 230.6

NASDAQ-100 101 constituents · $40.28T · HHI 638.2

If you are a risk officer, the NDX HHI of 638.2 is the number that changes your position sizing. If you are a product designer, it is the number that tells you the chart has to show the difference at a glance, not in a footnote.

04 · Sector roll-up

Eleven sectors. Two of them own the pie.

GICS splits the investable universe into 11 sectors. In the SPX, Information Technology + Communication Services already add to 48.21%; in the NDX, the same two add to 76.68%. Real Estate in the NDX is 0.04% — a single-pixel slice on a 1000×1000 canvas.

S&P 500 GICS 11

Information Technology31.30%
Communication Services16.91%
Financials11.88%
Consumer Discretionary10.08%
Industrials8.03%
Health Care8.00%
Consumer Staples5.14%
Energy2.98%
Utilities2.11%
Materials1.79%
Real Estate1.78%

NASDAQ-100 GICS 11

Information Technology49.58%
Communication Services27.10%
Consumer Discretionary12.72%
Consumer Staples4.79%
Health Care2.23%
Industrials1.89%
Utilities0.69%
Materials0.57%
Energy0.27%
Financials0.12%
Real Estate0.04%

GICS is the taxonomy S&P Global + MSCI agreed on in 1999. Every asset manager in the world prices, reports, and hedges against these 11 buckets — which is why the colors in this chart are not decorative: they are the vocabulary compliance will use when it asks you to explain a position.

05 · Design decisions

Four choices that separate a demo from a product surface

Most index visualizations default to a treemap and a top-10 table. I made the opposite call for four reasons. Each is a trade-off I would defend to a design director, a CIO, and a compliance lead in the same room.

Pie, not treemap.

Treemaps are spatially accurate but semantically wrong for “what share of the market is this?” A treemap rectangle’s area is two-dimensional — viewers read width and height, not area. A pie slice’s angle is one-dimensional and matches the mental model of “portion of whole.” I took the 10% hit on fine-detail accuracy to gain the 100% hit on semantic fit.

When I need the fine detail — ordering of names 40–80 — I add the leaderboard beside the chart. Two views, one screen, no mode toggle.

All 503 slices. Every one.

The common compromise is “top 10 + ‘Other 1.9%’ lump.” That lump is the point of the chart. The tail is what makes the SPX different from the NDX — 490 small companies absorbing 57.9% of the pie. Collapsing them to one slice erases the distinction that a risk officer most wants to see.

At the tail, slices are smaller than a pixel — fine. They are not meant to be individually readable; they are meant to make the shape of the tail visible.

Color by sector, not by name.

503 distinct colors is a perceptual catastrophe. 11 sectors is the taxonomy the industry already speaks. Compliance memos talk sectors. Model-risk docs talk sectors. SEC 10-Ks are filed by sector. Using GICS as the color primary maps the chart onto a vocabulary the viewer has already paid for.

The eleven hues are deuteranopia-tested and avoid the gold reserved for the portfolio accent. They are tokens, not decisions a designer makes per page.

Cap-weighted vs equal-weighted on one click.

The quietest signal in institutional investing is the gap between SPX and SPXEW — the equal-weight sibling. When cap-weighted outperforms equal-weighted, mega-caps are pulling the market; when equal outperforms cap, the broad economy is. Making that flip a single button is the difference between a “nice chart” and a morning-meeting artifact.

Equal-weight is where the animation earns its keep — watching 503 slices re-snap to 1/503 each is how a viewer feels the concentration, not reads it.

06 · Methodology & references

The pipeline, the caveats, the credits

A chart that can’t explain its own numbers is a screenshot in a fancy frame. Here is exactly where every slice came from, when it was refreshed, and what it will not tell you.

Data pipeline

1
Constituent list — scraped from Wikipedia’s List of S&P 500 companies and NASDAQ-100 articles via pandas.read_html. Both pages are curated by editors against the underlying index methodologies. UA header required to avoid Wikipedia’s 403 on default urllib.
2
Market cap & price — pulled per-ticker from yfinance’s fast_info endpoint (camelCase keys marketCap, lastPrice), with fallback to the .info dict. 0.12s throttle between calls, 2 retries.
3
Sector normalization — NDX Wikipedia page lacks a sector column, so sectors are carried from the SPX overlap first (87 tickers), fall back to yfinance info.sector (14 tickers), then normalized with an alias map (Technology → Information Technology, Healthcare → Health Care, Consumer Cyclical → Consumer Discretionary, etc.) to one canonical GICS 11 taxonomy.
4
Weight & write — each constituent’s weight is 100 × market_cap / ∑market_cap, then sorted descending and written atomically (.tmp → os.replace) to data/indices/spx.json and ndx.json. Missing caps are set to null and excluded from the denominator.

Known limitations

Snapshot, not stream. yfinance’s free tier is intraday delayed. The JSON is dated (as_of: 2026-04-20). For a real desk, swap in an IEX or Polygon feed — the shape of the JSON stays identical, the numbers refresh in seconds.
Class A + Class C siblings. GOOGL and GOOG are both in the pie, as they are in the index itself. Combined their share is 11.90% of SPX — which is the real Alphabet weight if you treat them as one economic entity. I left them split because the index does.
Weights ≠ methodology rules. S&P and Nasdaq apply float-adjusted, capping-rule-constrained weights (Nasdaq reweights quarterly to cap the largest constituents at 24% of NDX). The raw market-cap ratios here are a first-order approximation — typically within a few basis points of the published weights, not guaranteed equal.
No time series (yet). Today’s pie, not history. A second pass could add a “playback” slider across quarterly snapshots to show the tech rotation through 2020–2025. Explicitly out of scope for v1.

References & credits

Index methodologies: S&P U.S. Indices Methodology · Nasdaq-100 Index Methodology
GICS taxonomy: MSCI Global Industry Classification Standard (S&P Global + MSCI, 1999).
Concentration math: U.S. DOJ — Herfindahl-Hirschman Index · “under 1,500” = unconcentrated; “1,500–2,500” = moderately; “above 2,500” = highly.
Data tooling: pandas + yfinance + vanilla SVG arc math. No charting library.
Design precedent: Bloomberg Terminal WEI function (world equity indices overview) · finviz.com/map (treemap precedent — the thing I chose to not copy).

Portfolio threads

Where this case study sits in the larger web

Every problem we solve for clients has multiple valid approaches — different costs, different ROI, different risk profiles. These threads show how the approach on this page compares to others in the portfolio.

Thread

Concentration, Risk & Agents

Portfolio-level math primitives — HHI, beta, VaR, regime — rendered into UI defaults and AI-assisted decision surfaces.

Index Weights · SPX + NDX Concentration math exposed Low eng cost · 503+101 slices · HHI 230.6 / 638.2
Macro Signal Network Regime classifier feeding routing Low eng cost · 4 prints → 14 surfaces
Nova · Institutional Trading Terminal Portfolio risk surfaces on one shell High eng cost · Beta/Duration/Kelly/VaR unified
TradeX Institutional Stress test & position sizing High eng cost · 12 holdings · 3-factor composite
Hedge Fund · 19-Agent Committee Multi-persona investment consensus High eng cost · Agent reliability overhead
AI Cowork · Agent UX AI in the decision loop Med eng cost · Trust fallbacks · hallucination guardrails
Private Banking Advisory Diversification mandate High eng cost · Concentration compliance · FINRA 2111
Intent Canvas · Shared AI Artifact Prose → typed plan for rebalance Low eng cost · Typed artifact · Human approval gate

Thread

Evidence & Verification Discipline

How we prove design claims with data — A/B, pooled-SD, cohort analysis, and the rigor behind every number quoted on this site.

Finlogix · Retail Trader Education A/B tested retention Low eng cost · Cohen's d 2.47 · 90d post-launch
ACY Securities · Regulated Broker System Mixpanel audit High eng cost · 90-day funnel · component adoption
Index Weights · SPX + NDX Concentration math verified Low eng cost · Wikipedia + yfinance pipeline
AI Cowork · Agent UX AI reliability measurement Med eng cost · Hallucination rates · fallback UX
Intent Canvas · Shared AI Artifact Evidence attached to every AI claim Low eng cost · Evidence per row · Confidence bar

Continue the tour

More institutional case studies

← Previous case Macro Signal — FRED Data as Product UNRATE · CPI · DGS10 · VIX · regime bands + product routing Next case → ReactΩ — Design System as Product 50+ components · token architecture · 5 years distilled

View all 14 projects →