A few things break in bot_project due to the d.o updates:

- html does not properly get parsed
- http requests get redirected to https anyway, so the hidden bot_project_too_lazy_to_recompile_for_ssl variable doesn't make sense anymore

additional changes i am including in this issue are:
- error reporting when project URLs are not returned
- updating the default value for the node ids since those nowadays are pretty high (from 99999 to 9999999)

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

snufkin’s picture

Status: Active » Needs review
FileSize
5.5 KB

Attaching a patch that fixes the parsing and moves to XPath parser (easy to update, just use chrome, inspect element and copy XPath).

snufkin’s picture

Now it supports non-project pages too!

snufkin’s picture

And now it actually does that correctly.