How to recover from failed stack updates

Asked by Nilanjan Roy

Hi,

I am trying to update an existing heat stack. In the original stack I have created few vms. Now I want to add more vms to the same stack. I am providing the compute node name where the vm should be launched. At first I put the compute node name wrong and the stack update failed. The newly added VM state is in error in nova list. I again trie to update it with the correct parameters. But it failed. Then I had to delete the stack and create the original one. Then I updated it with the correct template. Then it updated successfully.

Then I simulated the stack failure again and every time once the stack update is failed subsequent updates are also failing. Is it a bug or there is some other way to recover from a stack update? Or else I am missing something? Please help.

Question information

Language:
English Edit question
Status:
Answered
For:
OpenStack Heat Edit question
Assignee:
No assignee Edit question
Last query:
Last reply:
Revision history for this message
Steven Hardy (shardy) said :
#1

Hi,

This is a known problem, which we're working on fixing. See https://bugs.launchpad.net/heat/+bug/1160052 which I've linked.

Also related, see summit etherpad:

https://etherpad.openstack.org/p/icehouse-summit-heat-convergence

And related blueprints:

https://blueprints.launchpad.net/heat/+spec/retry-failed-update
https://blueprints.launchpad.net/heat/+spec/support-retry-with-idempotency

There is a patch posted for retry-failed-update, which when merged, I think may solve your problem:
https://review.openstack.org/#/c/62936/

Can you help with this problem?

Provide an answer of your own, or ask Nilanjan Roy for more information if necessary.

To post a message you must log in.