Digimer
4c7bb45ab9
Fixed a race condition where configuring the IPMI BMC would appear to fail because the BMC wouldn't report the user list after a cold reset.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
6cbdc388d4
Fixed a bug where corosync's configuration of a backup ring was broken.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
04cb116c1b
Updated anvil-parse-fence-agents to validate each fence agent's metadata is valid before adding it to the unified XML.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
8abb5b46e0
* Added support for setting per-agent log-level and log secure values in amvil.conf.
...
* Moved the check for an agent being disabled into ScanCore->agent_startup()
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
3674a47179
WIP - Working a tool to manually load updated server definition files.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
aec22bb79c
Added a check in scan-network that finds/removes duplicate network interface names.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
4800f7181f
* Updated ScanCore to boot a node that is off without a stop reason.
...
* Fixed a bug where anvil-safe-stop was not recording the stop-reason. Also made '--poweroff' an alias for '--power-off'.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
acaacd9a86
* Created Storage->get_size_of_block_device() that takes a block device path and returns the size of the path, if it's found in the database.
...
* More work on the storage management of anvil-manage-server.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
606bd8f1f0
Continuing work on anvil-manage-server.
...
Created Storage->get_storage_group_from_path() that takes a block device path and tried to find the Storage Group it belongs to.
Updated Storage->get_storage_group_data() to make it possible to look up a storage group UUID using the SG's name.
Updated DRBD->gather_data() to take a pre-generated XML via the new 'xml' parameter.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Tsu-ba-me
063840ecb6
fix(tools): correct message_* string keys in striker-manage-vnc-pipes
3 years ago
Tsu-ba-me
8da318c933
fix(tools): patch failure to fix 2nd pipe after server migration
3 years ago
Tsu-ba-me
0f1c3d2435
chore(tools): remove unused function from striker-manage-vnc-pipes
3 years ago
Tsu-ba-me
cdb66019d3
fix(tools): avoid port conflict
3 years ago
Tsu-ba-me
7e447000b4
fix(cgi-bin): use unspecified instead of loopback address in SSH tunnel
3 years ago
Tsu-ba-me
b3b6da8259
chore(cgi-bin): remove debug log level from manage_vnc_pipes and its support scripts
3 years ago
Tsu-ba-me
549758b2f2
build(tools): include support scripts for manager_vnc_pipes endpoint into makefile
3 years ago
Tsu-ba-me
e50bfc7308
fix(tools): correct typo in passing server_uuid to get_vnc_info()
3 years ago
Tsu-ba-me
3a8f4c339b
fix(tools): use VNC port in variables table if available
3 years ago
Tsu-ba-me
e4436be17b
fix(tools): do checks and kills as root
3 years ago
Tsu-ba-me
bb155a5786
fix(tools): update job progress in catch-all case
3 years ago
Tsu-ba-me
ffc1fb096a
fix(tools): correct switch name typo in striker-manage-vnc-pipes
3 years ago
Tsu-ba-me
1fec288ad0
fix(tools): make striker-manage-vnc-pipes executable
3 years ago
Tsu-ba-me
7d9013a60b
fix(tools): allow striker-manage-vnc-pipes to be executed as a job
3 years ago
Tsu-ba-me
0935b9a990
feat(tools): move manage_vnc_pipes endpoint core logic to separate script
3 years ago
Tsu-ba-me
5459e610aa
fix(tools): auto-end tunnel script when connection breaks
3 years ago
Tsu-ba-me
d5724c1457
chore(tools): rename striker-start-ssh-tunnel->striker-open-ssh-tunnel
3 years ago
Tsu-ba-me
23d818cfff
fix(cgi-bin): avoid direct SSH calls
3 years ago
Digimer
e3d65d654c
* Continuing work on anvil-manage-server.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
3f1c2dd38f
* Couple of small cleanups for fence_delay.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
8d2e454d69
* Updated fence_delay to set the ownership of the log file to 'hacluster:haclient'. This should address https://github.com/digimer/fence_delay/issues/1
...
* WIP - COntinuing work on anvil-manage-server, far from done yet.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
bc8b9274cb
WIP; Reworked anvil-manage-server to have a more interactive menu system (for the sections done so far).
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
28865780f8
* Updated Database->get_server_definitions() to take a specific server UUID, allowing just the one definition to be loaded. Also had it clear previous loads.
...
* Updated Server->parse_definition() to call DRBD->get_devices() so that referenced LVs can be loaded properly.
* Continued WIP in anvil-manage-server
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
623dbb0863
WIP; Restarted work on anvil-manage-server.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
548c52701a
Updates Jobs->update_progress() to take a 'variables' hash reference, and to support logging as well.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
1e159f548e
Added a couple notes for later dev.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
39236e9b3f
Switched default graphics for new servers to 'vnc' instead of spice.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
cebae28716
* WIP - Fixing a bug in scan-network where vnet devices aren't being recorded against their bridge.
...
* Updated scan-server to record the VNC port it is using in the database.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
7e7b91b286
* Updates anvil-join-anvil to update corosync.conf to use the BCN1 link as the main knet network with the SN1 link as the backup link.
...
* Fixed a bug in Cluster->parse_cib() where the local machine's ready state was being set to the node name.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
d7d418ee1b
* Fixed a bug in DRBD->gather_data() where the peer node's data was being recorded where the local node's data should have been saved.
...
* Fixed a bug in anvil-delete-server where, if a server was off already, the server would not be removed from pacemaker.
* WIP - continuing on scan-network
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
a697011b08
* Disabled debug logging in anvil-daemon.
...
* WIP - working on new scan-network scan agent.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
6777104398
* Fixed a bug in anvil-daemon where, when an anvil-manage-power reboot run had triggered a reboot, anvil-daemon didn't set the job_progress to '100', causing constant reboots. Also fixed a bug where the log level was hard-set to '1' instead of '2' needed during debugging.
...
* Updated Jobs->get_job_uuid() to accept the new 'incomplete' parameter that, when set, will look for jobs whose progress is > 1 and < 100.
* Updated ScanCore-agent_startup() to take the new 'no_db_ok' parameter which returns with '0' if no DB is available and that parameter is set to '1'.
* Fixed a logging bug in 'anvil-join-anvil'.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Fabio M. Di Nitto
7aea5e1b11
Switch to kmod-drdb
...
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
3 years ago
Digimer
04f7571097
* Fixed a typo causing anvil-manage-power to not compile.
...
* Updated anvil-configure-host to register a reboot job when needed.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
0c475d2a2e
* Fixed a couple logging bugs.
...
* Updated scan-cluster to get the CIB from pcs instead of reading the CIB from disk.
* Updated anvil-daemon to always call striker-prep-database at log level 2 while trying to find the cause of rare postgres config failures. Also updated striker-prep-database to use the new method of initializing the DB.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
d3052c0229
* Finished Cluster->check_server_constraints() and added it to scan-cluster. This now makes sure servers don't roll back to their old host after it has been fenced and recovers.
...
* Completely disabled Network->check_network(), it's causing more problems than it solves.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
e7a06fce72
* Disabling the periodic network health check in anvil-daemon.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
30f478267a
* Forced anvil-daemon to log-level 2 and to enable secure logging to continue debugging setup issues.
...
* Fixed a undefined variable warning.
* Removed a debugging die from Database->resync_databases().
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
47fa126a3c
* Fixed a typo that blocked anvil-daemon from starting.
...
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
023f43eda9
* In the never-ending attempt to resolve the build consistency issues, this commit enables extra debugging logging and, hopefully, implements a fix in anvil-daemon where a job could be started repeatedly.
...
* Renamed the special job status 'scancore_startup' to 'anvil_startup', given it's handled by anvil-daemon.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago
Digimer
5a343d6d75
* WIP; Started work on Cluster->check_server_constraints() that will track when a server's location constraint needs to be updated when the old preferred node is lost.
...
* Removed (for now) setting MTU in the ifcfg-X files during anvil-configure-host runs.
Signed-off-by: Digimer <digimer@alteeve.ca>
3 years ago