Does it support LTSV or JSON logs?

Only regex-matched CLF-style lines parse today. Convert externally or extend your exporter.

Can I trust bot names?

Classification is user-agent substring based. Spoofed agents fall under **human/other**.

Lines that do not match the parser regex are skipped silently. Check quoting around the request.

No. Parsing runs locally in the browser memory you paste into.

Technical SEO

Live

Log File Analyzer

Parse server logs to see what Googlebot actually crawls.

Server log (Common Log Format)

Apache/nginx CLF lines. Bots are identified by User-Agent.

66.249.66.1 - - [12/Apr/2026:10:15:33 +0000] "GET /blog/best-hiking-boots HTTP/1.1" 200 12834 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.66.2 - - [12/Apr/2026:10:15:35 +0000] "GET /products/moab-3-mid HTTP/1.1" 200 8421 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.66.1 - - [12/Apr/2026:10:16:05 +0000] "GET /sock-guide HTTP/1.1" 404 1024 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
40.77.167.5 - - [12/Apr/2026:10:17:12 +0000] "GET /blog/sustainable-running HTTP/1.1" 200 9821 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)"
8.8.8.8 - - [12/Apr/2026:10:18:42 +0000] "GET /products/x-ultra-4 HTTP/1.1" 200 7400 "-" "Mozilla/5.0 (compatible; AhrefsBot/7.0; +http://ahrefs.com/robot/)"
66.249.66.3 - - [12/Apr/2026:10:19:00 +0000] "GET /old-page HTTP/1.1" 301 0 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
66.249.66.1 - - [12/Apr/2026:10:20:11 +0000] "GET /broken HTTP/1.1" 404 1024 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)"
40.77.167.5 - - [12/Apr/2026:10:21:00 +0000] "GET /blog/best-hiking-boots HTTP/1.1" 200 12834 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)"

Hits

Bots detected

Errors (4xx/5xx)

By bot

Googlebot

Bingbot

AhrefsBot

By status

200: 5

404: 2

301: 1

Top crawled paths

/blog/best-hiking-boots

Googlebot

Bingbot

/products/moab-3-mid

Googlebot

/sock-guide

error

Googlebot

/blog/sustainable-running

Bingbot

/products/x-ultra-4

AhrefsBot

/old-page

Googlebot

/broken

error

Googlebot

X / Twitter LinkedIn Facebook

Start here · Why parse log files here?

Server logs show what actually fetched your URLs, which matters when crawl diagnostics conflict with crawl simulations.

This analyzer expects Apache/nginx style CLF lines: IP, identity, user, timestamp, request verb + path + protocol, status, bytes, referrer, user-agent.

It classifies user-agents into major bots, aggregates status code totals, surfaces the busiest paths, and lists individual error hits for triage.

When to use this tool

Crawl waste detection
See whether Googlebot repeatedly requests thin faceted paths or 404s before you tune robots or faceting rules.
Launch monitoring
Paste a slice from launch day to confirm bots see mostly 200 responses.
Third-party bot noise
AhrefsBot or others may spike; compare bot counts before blaming Google alone.
Education
Use the bundled sample lines to teach how raw logs differ from UI crawl reports.

Examples

Walk through these with the form above — they are practice scenarios, not live data.

404 cluster

Try this

Include sample /sock-guide 404 lines and rerun after fixing the route.

What to look for

Errors stat should fall. Top crawled paths highlights recurring bad URLs.

Custom paste

Try this

Paste fifty lines from your CDN log download.

What to look for

If parsing yields zero hits, verify quoting and field order match CLF expectations.

Short tutorial

Follow in order the first time you use the tool; later you can skip to the step you need.

Step 1 — Export logs
Grab plain text CLF or translate JSON logs into the classic pattern before pasting.
Step 2 — Paste a representative window
Hours or days depending on traffic. Huge files may slow the browser; sample slices instead.
Step 3 — Read bot and status cards
Confirm Googlebot volume looks sane relative to total hits.
Step 4 — Inspect top paths
Look for parameter storms, accidental admin paths, or assets mis-returning 404.
Step 5 — Feed findings into fixes
Pair with Crawl Budget Optimizer thinking or redirect tickets when waste is structural.

More detail

New here? Skim Start here first, then run one Examples scenario in the form above.

Log File Analyzer does one job: parse server logs to see what Googlebot actually crawls. It lives under Technical SEO on SEOToolkits, where the beginner idea is simple: Technical SEO keeps pages crawlable, indexable, fast enough, and understandable to search engines.

FAQ

Does it support LTSV or JSON logs?: Only regex-matched CLF-style lines parse today. Convert externally or extend your exporter.
Can I trust bot names?: Classification is user-agent substring based. Spoofed agents fall under human/other.
Why zero hits?: Lines that do not match the parser regex are skipped silently. Check quoting around the request.
Is data uploaded?: No. Parsing runs locally in the browser memory you paste into.