Discussion:
[ovirt-users] Re: Cluster stuck in maintenance - Hosts not upgrading as expected
Edward Berger
2021-05-23 01:34:34 UTC
Permalink
I'm saw something similar on a test cluster on CentOS 8.3.
You can take it out of global maintenance mode by navigating the engine UI
to edit cluster -> scheduling policy and turn off global maintenance there.

Not sure what else is going on. It wants me to put all three hosts into
maintenance mode which is impossible.
I have a 3-node hyperconverged cluster with Gluster filesystem running on
RHEL 8.3 hosts.
It's been stable on oVirt 4.5.
Today, I just upgraded the Engine to v4.6.
[image: Screenshot from 2021-05-22 20-29-23.png]
I then logged into the oVirt manager, navigated to Compute -> Clusters,
and clicked on Upgrade.
The Status of the cluster does have an exclamation mark that says "Upgrade
Cluster Compatibility Level", but my understanding is, I shouldn't do that
until all 3 of my hosts have been upgraded.
Looking at active tasks, I see that the first host in my cluster is stuck
in "Upgrading".
It's now been like this for 3 hours, without any visible progress.
If I try to launch a VM that is not configured for high availability, the
VM fails to launch, and I get a message about the cluster being in
"Maintenance Mode" and that none of the hosts satisfies current scheduling
restraints.
How do I....
a) Cancel / clear the upgrade and take the cluster out of "maintenance
mode"
b) Properly upgrade the cluster, since clicking the "Upgrade" button
clearly didn't work?
Thanks,
David
Sent with ProtonMail <https://protonmail.com> Secure Email.
_______________________________________________
Privacy Statement: https://www.ovirt.org/privacy-policy.html
https://www.ovirt.org/community/about/community-guidelines/
Ritesh Chikatwar
2021-05-24 05:53:00 UTC
Permalink
Hello,


Can you check at which task the upgrade has struck. Please share the
relevant log at the time.


To check at which tasks it failed ssh to ovirt engine and navigate to
directory /var/log/ovirt-engine/host-deploy/ and also share the vdsm logs
from host as well.
Post by Edward Berger
I'm saw something similar on a test cluster on CentOS 8.3.
You can take it out of global maintenance mode by navigating the engine UI
to edit cluster -> scheduling policy and turn off global maintenance there.
Not sure what else is going on. It wants me to put all three hosts into
maintenance mode which is impossible.
I have a 3-node hyperconverged cluster with Gluster filesystem running on
RHEL 8.3 hosts.
It's been stable on oVirt 4.5.
Today, I just upgraded the Engine to v4.6.
[image: Screenshot from 2021-05-22 20-29-23.png]
I then logged into the oVirt manager, navigated to Compute -> Clusters,
and clicked on Upgrade.
The Status of the cluster does have an exclamation mark that says
"Upgrade Cluster Compatibility Level", but my understanding is, I shouldn't
do that until all 3 of my hosts have been upgraded.
Looking at active tasks, I see that the first host in my cluster is stuck
in "Upgrading".
It's now been like this for 3 hours, without any visible progress.
If I try to launch a VM that is not configured for high availability, the
VM fails to launch, and I get a message about the cluster being in
"Maintenance Mode" and that none of the hosts satisfies current scheduling
restraints.
How do I....
a) Cancel / clear the upgrade and take the cluster out of "maintenance
mode"
b) Properly upgrade the cluster, since clicking the "Upgrade" button
clearly didn't work?
Thanks,
David
Sent with ProtonMail <https://protonmail.com> Secure Email.
_______________________________________________
Privacy Statement: https://www.ovirt.org/privacy-policy.html
https://www.ovirt.org/community/about/community-guidelines/
_______________________________________________
Privacy Statement: https://www.ovirt.org/privacy-policy.html
https://www.ovirt.org/community/about/community-guidelines/
Loading...