Hi,

As already posted yesterday in IRC i have a MEGA problem. I started a aegir migration from rc5 to stable. While running the migration i got a error message: https://gist.github.com/anonymous/a4a96c61344f018dc405 - After that my aegir was still running, but hosting queue failed and was still pointing to the old rc5. Also in the web frontend the complete configuration points still to rc5. While the webfrontend seems already to run under stable the configuration shows me that it's still rc5. I was able to run drush hostmaster commanfs after changing the path in .drush/hostmaster.alias.drushrc.php.
But i am not able to do a verify of the hostmaster site and i think the system is really broken.

What can i do to get a running aegir installation again?

Thank you very much!

Best regards

dob

CommentFileSizeAuthor
#4 debug.txt19.96 KBdob_
Log.txt41.54 KBdob_
Support from Acquia helps fund testing for Drupal Acquia logo

Comments

helmo’s picture

Note to others: I've already created #2171075: Fix hostmaster-resume command options after speaking with dob_ in IRC. The patch there might have prevented this situation.

Question now is how to recover.

helmo’s picture

Thanks for the full log.

Did you try the drush --debug @hostmaster hostmaster-resume platform_hostmaster6x20rc5 platform_hostmaster6x20 command?

or as ergonlogic suggested drush --debug @hostmaster hosting-task <nid> If you know the node id of a task...

Or drush --debug @hostmaster provision-verify

Please provide the full debug log.

dob_’s picture

Okay it seeems my hostmaster platform is missing.

I tried the following:
drush --debug @hostmaster hostmaster-resume platform_hostmaster6x20rc5 platform_hostmaster6x20
Added the debug.log to the issue.

Then i did:
drush --debug @hostmaster provision-verify

Got this error:
PHP Fatal error: Call to a member function get_services() on a non-object in /var/aegir/.drush/provision/Provision/Context.php on line 293

Fatal error: Call to a member function get_services() on a non-object in /var/aegir/.drush/provision/Provision/Context.php on line 293

Then my interface was gone. Vhost pointed to rc5 platform an site which is now empty.

Changed the platform_hostmaster.alias.inc to point to the stable and did a
drush --debug @hostmaster provision-verify

Then my interface was available again.

If i click in the platform section on hostmaster it points only to the url
http://aegir01.example.com/hosting/c/
and i get a page not found error.

Any ideas?

dob_’s picture

FileSize
19.96 KB
dob_’s picture

Solved the Problem reinstalling the server. ;-)

Aegir Docker installation

helmo’s picture

Status: Active » Fixed

I hope we can make this easier in the future ... atleast the root cause is fixed.

dob_’s picture

Helmo, i think the fix should be in the current stable version to prohibit that failure. Will you release a 2.0.1 as bugfix release?

Status: Fixed » Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

xurizaemon’s picture

Status: Closed (fixed) » Needs work

Experienced the same issue. Some additional info ...

* In the web UI links for platform are to hosting/c, but the replacement platform could be found @ hosting/c/platform_hostmaster6x20
* {hosting_context} contained an entry with no name, causing the broken link to platform
* correcting the above (and removing one of two aliases for that node/%nid path in {url_alias} permitted the platform and hostmaster site to verify
* I may or may not have manually corrected a few other fields before doing the above

Theory: In Provision_Context::get_services(), if the hosting context doesn't have a name or name is empty, we end up with an empty $this->parent_key, are unable to reference the platform using $this->{$this->parent_key} and all go boom.

Moving back to needs work in case this wants a fix rather than a workaround. Still unsure how we ended up with a blank name in hosting_context, sorry!

helmo’s picture

Status: Needs work » Fixed

Marking as fixed as the question appears to be answered.
--
This is a templated response, please re-open or comment if you think it's in error.

Status: Fixed » Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.