Faceted Navigation
Faceted navigation is the UI pattern that lets users narrow a content list by combining filters and sorts — category, tag, price, sort order, and so on. Common on e-commerce category pages and blog tag or filter pages, it boosts usability but creates one of the hardest technical SEO problems: URL explosion.
Faceted navigation is the UI pattern that lets users narrow a content list by combining filters and sorts — category, tag, price, sort order, and so on. Common on e-commerce category pages and blog tag or filter pages, it boosts usability but creates one of the hardest technical SEO problems: URL explosion.
Why It Matters
Three filters with 5 options each theoretically produce 5³ = 125 URL combinations. Add more filters and you're looking at tens of thousands of URLs. When Googlebot crawls them all, you get:
- Crawl budget waste: Resources flow to filter combinations instead of real content (the actual posts).
- Duplicate content: The same post list repeats under dozens of sort and filter URLs.
- Thin content: Filter combinations with 0–2 results pile up in the index.
- Diluted link equity: Backlinks land on parameter-laden URLs instead of canonical ones, scattering ranking signals.
Google's Recommended Handling
1. Index only valuable facets: Allow combinations with real search demand (e.g., "Italian restaurants in Seoul") and block ones nobody searches (e.g., "sorted by price descending").
2. Canonical normalization: Point canonicals for semantically equivalent combinations to one representative URL — e.g., /blog?tag=seo&sort=latest → canonical /blog?tag=seo.
3. noindex, follow meta tag: Use <meta name="robots" content="noindex, follow"> on filter pages you don't want indexed. Blocks indexing but still lets Google follow internal links.
4. robots.txt Disallow: For URL patterns you want Googlebot to skip entirely (?sort=, ?view=, ?utm=), block them in robots.txt. Use this carefully — it also blocks ranking signals from external links to those URLs.
5. No more URL Parameters tool: Google retired the URL Parameters tool in Search Console in 2022. Parameter handling now depends on canonical tags, meta robots, and robots.txt only.
6. rel="nofollow" on filter links: Apply nofollow to filter links so crawlers don't chase them. Aggressive but effective.
What to Index vs Block
Allow indexing:
- Primary category and tag pages representing a distinct topic
- Filter combinations with proven search demand
- Landing pages enriched with unique descriptions
Block indexing:
- Sort-order URLs (
?sort=price_asc,?sort=date_desc) - Items-per-page URLs (
?per_page=20) - Internal search result URLs (
?q=...) - Pages combining 3+ filters
- URLs with session or tracking parameters
How to Diagnose
- Search Console Coverage report: Check URLs marked "Crawled — currently not indexed" for parameter abuse.
- Ahrefs / Screaming Frog site crawl: Measure how many URLs your site actually exposes. If it's 10x what you expected, you have a facets problem.
- Log file analysis: See which URL patterns are eating Googlebot requests to diagnose crawl budget waste.
Sources: