Thin Content Detector
Analyze content quality — word count, readability, text-to-HTML ratio, and more
What Is Thin Content?
Thin content refers to web pages that provide little or no value to users. Google's quality guidelines specifically target thin content as a ranking factor. Pages with thin content often appear in GSC's "Crawled - currently not indexed" report because Google doesn't consider them worthy of indexing.
Key Metrics Explained
Word Count
Pages under 300 words are high-risk for thin content. Aim for 600+ words for informational pages. However, word count alone doesn't define quality — a 200-word FAQ answer can be perfectly sufficient.
Text-to-HTML Ratio
The percentage of visible text versus total HTML code. Below 10% often signals boilerplate-heavy pages (navbars, sidebars, footers) with little actual content. Aim for 25%+.
Readability Score
Flesch Reading Ease score (0-100). Higher scores mean easier to read. Web content should aim for 60-70 (easily understood by 13-15 year olds). Academic writing typically scores 30-50.
Content Quality Score
Our composite score (0-100) based on word count, text-to-HTML ratio, readability, heading usage, and image presence. Pages scoring below 40 should be expanded or consolidated.
How to Fix Thin Content in GSC
- 1. Check GSC "Pages" report → filter by "Crawled - currently not indexed"
- 2. View source of flagged pages and paste the HTML into this tool
- 3. Fix the issues: expand content, improve readability, or consolidate with related pages
- 4. Request re-indexing via GSC URL Inspection tool after improvements