Skip to main content
You might notice discrepancies in the number of records from one crawl to the next or you want to refresh your siteโ€™s content between scheduled crawls while aiming to sidestep potential complications. For more information, see Monitoring crawlers

Difference in the number of records between crawls

Problem

A noticeable increase or decrease in the number of records between crawls may be due to:
  • Temporary site problems may prevent the crawler from finding everything.
  • Changes to the site. If your site changed but the crawler configuration wasnโ€™t, the crawl might not fully capture the site content. For example, if your companyโ€™s job listings page moves from www.example-company.com/careers to www.work-at-example-company.com, the crawl wouldnโ€™t include the new job listing URLs.
  • Timing issues with site updates and crawls. For example, you update your siteโ€™s content at 8ย AM every day, but the crawler schedule is 9ย AM. If the updates arenโ€™t completed on time, the crawl may miss the new content, and users might come across outdated information.

Solutions

IssueSolution
Temporary site problemsPerform a manual re-crawl once the site issues are resolved
Changes to the siteUpdate your crawler configuration as needed and perform a re-crawl. For large changes, consider updating the safety check threshold
Timing issuesAllow ample time for updates to complete before the crawler schedule

Update between scheduled crawls

Problem

Your crawl might be scheduled to run at specific intervals, but there could be occasions when immediate updates are necessary. For example, a news site might need to publish breaking news immediately.

Solution

To add just a few pages, adjust the crawler configuration and then manually re-crawl from the Crawler dashboard.
โŒ˜I