In my sitemap.xml file, a number of pages that are only visible to admins are being listed.

Google is indexing these links and showing these as crawl errors in the Search Console.

Not sure what other information I need to provide.

Comments

pmagunia created an issue. See original summary.

pmagunia’s picture

I should add that these are webform submissions that are being listed as 403.

pmagunia’s picture

The solution I found was to rebuild and regenerate the sitemap from drush.

I have disabled sitemap rebuilding via cron via the Module interface. This way if I run cron as an authenticated user, there will be no 403s in the sitemap.xml file.

zakiya’s picture

I'm experiencing the same issue. Drupal 8.5.1; xmlsitemap 8.x-1.0-alpha2

Nodes are being restricted using the Permissions By Term (permissions_by_term 8.x-1.44) module. I followed the suggestion above of running drush cron but the node is still in the sitemap. I also tried running drush xmlsitemap-regenerate with the same result.

zakiya’s picture

Fixed by

- First rebuilding the node grants using /admin/reports/status/rebuild
- rebuilding the sitemap
- drush xmlsitemap-rebuild
- drush xmlsitemap-regenerate

Dave Reid’s picture

Category: Bug report » Support request
Status: Active » Fixed

Sounds like the node grants being out of date were the issue.

Status: Fixed » Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.