76 Commits (7925a3f42c5fa56dd87624bafd4866332885b039)

Author SHA1 Message Date
digimer 1152c50f3a Added pcsd config, and -y support. 8 months ago
digimer 3e63b726d3 Added node 2 joining an Anvil! node if not started by node 1. 8 months ago
digimer e00dec7cba Added loading existing corosync/authkey from peer during rebuild. 8 months ago
digimer 7ecd0a4d70 Starting work on rejoining a replacement subnode to an Anvil! node 8 months ago
digimer 60759cd9aa No longer fail is fence method exists already when trying to create it. 9 months ago
digimer cca4a7ea1b Bumped logging. 9 months ago
digimer 2d92f339c2 Fixed a bug related to changing the hostname during a manifest run 9 months ago
digimer c31880c2ec Fixed the ordering holding on hosts and network config. 10 months ago
digimer b8c73fd3f2 Replaced hosts management in anvil-join-anvil with System->update_hosts. 10 months ago
digimer 247cf0a238 Fixed using the wrong words key. 10 months ago
digimer f40d25f2dd Fixed a bug with /etc/hosts generation 10 months ago
digimer 27152845fd Attempts to create an existing fence method no longer fails. 10 months ago
digimer b85e38d20d Added the short and full host names to hosts 11 months ago
digimer d8ceb7fbf4 Updated to add all subnode nets to /etc/hosts before forming cluster 11 months ago
digimer 476b285607 Added a wait_for_access() function to anvil-join-anvil 11 months ago
digimer 023bcf46a4 Fixed a bug with hung cluster startup in some cases 11 months ago
digimer 49f194eac6 Fixed issue #515; anvil-join-anvil updates hostnames properly now 1 year ago
digimer 68521cdab7 Updated striker-get-screenshots to set permissions properly. 1 year ago
digimer 55b1380031 Finished (but need more testing) of Server->locate(). 1 year ago
digimer 30248760b5 Moved the wait_on_subnodes function call to earlier in the script. 1 year ago
digimer 77bae80534 Added default values for MTU and DNS if not set in a manifest. 1 year ago
digimer fcbace6713 Updated anvil-join-anvil to hold if either node is still running anvil-configure-host 1 year ago
digimer e480337239 Fixed wait loop for subnodes 1 year ago
digimer ef042eef25 Cleaned up logging while waiting for subnodes. 1 year ago
digimer 5d5270486e Added a wait loop when forming node clusters. 1 year ago
digimer 8ba613952c Typo fix. 2 years ago
digimer 83a527f4fa * Removed enabling anvil-safe-start out of the RPM and into anvil-join-anvil. 2 years ago
digimer efebd135eb * Removed more references to 'dr1_host_uuid' from the old way of linking DR hosts to Anvil! nodes. 2 years ago
digimer fea10e5bb1 * Prefixed all 'virsh' calls with 'setsid --wait' to help prevent future hangs if the call happens without a shell. 2 years ago
digimer a3988cc3e5 * Added System->configure_logind() to ensure that nodes are configured to ignore ACPI power button events so that IPMI-based fences work immediately. 2 years ago
Digimer c23c79cdf0 Added 'system::all::configured' to anvil-join-anvil to mark an explicit end of config. 2 years ago
Digimer 596855405f * Added variables to record when pacemaker and DRBD are configured. 2 years ago
Digimer e37f487704 Fixed a bug in System->check_ssh_keys where the 'admin' user's RSA keys were owned by root. 3 years ago
Digimer 4c7bb45ab9 Fixed a race condition where configuring the IPMI BMC would appear to fail because the BMC wouldn't report the user list after a cold reset. 3 years ago
Digimer 6cbdc388d4 Fixed a bug where corosync's configuration of a backup ring was broken. 3 years ago
Digimer 04cb116c1b Updated anvil-parse-fence-agents to validate each fence agent's metadata is valid before adding it to the unified XML. 3 years ago
Digimer cebae28716 * WIP - Fixing a bug in scan-network where vnet devices aren't being recorded against their bridge. 3 years ago
Digimer 7e7b91b286 * Updates anvil-join-anvil to update corosync.conf to use the BCN1 link as the main knet network with the SN1 link as the backup link. 3 years ago
Digimer 6777104398 * Fixed a bug in anvil-daemon where, when an anvil-manage-power reboot run had triggered a reboot, anvil-daemon didn't set the job_progress to '100', causing constant reboots. Also fixed a bug where the log level was hard-set to '1' instead of '2' needed during debugging. 4 years ago
Digimer 0f43961568 * This commit lowers the logging levels of some debug log entries. It's to help diagnose occassional function test failures with an unknown source. 4 years ago
Digimer 5b4bfa747c * Reworked the anvil-join-anvil job parsing to help diagnose occassional faults. Also changed a fatal parse error to one that allows the run to be retried. 4 years ago
Digimer 6abe06f125 The theme of these commits is improving DB responsiveness. 4 years ago
Digimer 49a700d68f * Fixed a bug in anvil-join-anvil where the desired DNS servers were not matching existing list of used DNS servers, even when they are the same already. 4 years ago
Digimer e036515df3 * Got anvil-safe-start to the point where is starts the cluster stack. Need to create the 'anvil-boot-server' and 'anvil-shutdown-server' before it can be completed, so those files have been added. 4 years ago
Digimer fb0836f912 * THe get_cpu endpoint was completed. 4 years ago
Digimer 5e9e7e4dde * Removed debug logging from tools. 4 years ago
Digimer 54496cbeb0 * Added a check to Database->get_ip_addresses() to check is a hash is set before using it, to help avoid unitialized variable messages. 4 years ago
Digimer 5db09f565d * Updated anvil-join-anvil to actively call a cluster start once per minute while waiting for initial startup. 4 years ago
Digimer 3733220b50 * Updated Log->entry() to prefix log lines with the short 'job-uuid', when the log entry is coming from a program running as a job. This is meant to make it easier to break up what log lines belong to what jobs, if multiple jobs are running at the same time (ie: when initializing multiple nodes / dr hosts in parallel). 4 years ago
Digimer 1a520b03d5 * Cleaned up a lot of logging in anvil-daemon and tools it calls. 4 years ago