SEO

Faceted Navigation

Faceted navigation is the UI pattern that lets users narrow a content list by combining filters and sorts — category, tag, price, sort order, and so on. Common on e-commerce category pages and blog tag or filter pages, it boosts usability but creates one of the hardest technical SEO problems: URL explosion.

Faceted navigation is the UI pattern that lets users narrow a content list by combining filters and sorts — category, tag, price, sort order, and so on. Common on e-commerce category pages and blog tag or filter pages, it boosts usability but creates one of the hardest technical SEO problems: URL explosion.

Why It Matters

Three filters with 5 options each theoretically produce 5³ = 125 URL combinations. Add more filters and you're looking at tens of thousands of URLs. When Googlebot crawls them all, you get:

  • Crawl budget waste: Resources flow to filter combinations instead of real content (the actual posts).
  • Duplicate content: The same post list repeats under dozens of sort and filter URLs.
  • Thin content: Filter combinations with 0–2 results pile up in the index.
  • Diluted link equity: Backlinks land on parameter-laden URLs instead of canonical ones, scattering ranking signals.

Google's Recommended Handling

1. Index only valuable facets: Allow combinations with real search demand (e.g., "Italian restaurants in Seoul") and block ones nobody searches (e.g., "sorted by price descending").

2. Canonical normalization: Point canonicals for semantically equivalent combinations to one representative URL — e.g., /blog?tag=seo&sort=latest → canonical /blog?tag=seo.

3. noindex, follow meta tag: Use <meta name="robots" content="noindex, follow"> on filter pages you don't want indexed. Blocks indexing but still lets Google follow internal links.

4. robots.txt Disallow: For URL patterns you want Googlebot to skip entirely (?sort=, ?view=, ?utm=), block them in robots.txt. Use this carefully — it also blocks ranking signals from external links to those URLs.

5. No more URL Parameters tool: Google retired the URL Parameters tool in Search Console in 2022. Parameter handling now depends on canonical tags, meta robots, and robots.txt only.

6. rel="nofollow" on filter links: Apply nofollow to filter links so crawlers don't chase them. Aggressive but effective.

What to Index vs Block

Allow indexing:

  • Primary category and tag pages representing a distinct topic
  • Filter combinations with proven search demand
  • Landing pages enriched with unique descriptions

Block indexing:

  • Sort-order URLs (?sort=price_asc, ?sort=date_desc)
  • Items-per-page URLs (?per_page=20)
  • Internal search result URLs (?q=...)
  • Pages combining 3+ filters
  • URLs with session or tracking parameters

How to Diagnose

  • Search Console Coverage report: Check URLs marked "Crawled — currently not indexed" for parameter abuse.
  • Ahrefs / Screaming Frog site crawl: Measure how many URLs your site actually exposes. If it's 10x what you expected, you have a facets problem.
  • Log file analysis: See which URL patterns are eating Googlebot requests to diagnose crawl budget waste.

Sources: