622 Commits (eafd6d5090030027fc295c56dd4448cc3f554b62)

Author SHA1 Message Date
digimer c9e11fbbfc * Added checks to anvil-provision-server to fail out if either of the SN IPs are not found when generating a DRBD resource config. 2 years ago
digimer 156a0ca201 Updated anvil-daemon's new job launching logic to allow the restart of a running job that failed out early. 2 years ago
digimer 47f7a35df3 The main purpose of this commit is to add serial execution of similar jobs to help reduce race conditions for scripted jobs, like multiple server creation. 2 years ago
digimer b6a249d5e7 * Updated Cluster->add_server() to set the preferred host based first on if the server is running on a node, and if not, on the primary node (where before it defaulted to node 1). 2 years ago
digimer b7abc481e6 Updated scan-cluster to check to see that migrate_to and migrate_from are given a timeout of 600s and an on-fail of "block". Updated Cluster->add_server() to set migrate_from to timeout=600s and on-fail=block as well. 2 years ago
digimer c82bd9d73a * Created the new anvil-watch-power tool that shows the status of UPSes known on the system, including their "on battery" state, charge percentage, estimated hold up time, etc. 2 years ago
digimer 0e57836c8f This commit addresses (hopefully) issue #329. 2 years ago
digimer 110dceb55e * Added a check to make sure files were ready before provisioning a server. 2 years ago
digimer c50a1936c0 * This adds the new 'file_locations' -> 'file_location_ready' column and associated methods. This is set to TRUE/1 when the file referenced is found on disk and it is the expected size and md5sum. This is meant to allow programs to wait/watch or a file to be ready if they need to use it. Files are now checked periodically via anvil-daemon. 2 years ago
digimer 895f1ec262 This fixes a race condition when multiple servers are provisioned at (nearly) the same time. 2 years ago
digimer 0874ad571a Updated anvil-safe-start to not give up on starting corosync/pacemaker if it fails on the first try. 2 years ago
digimer 83a527f4fa * Removed enabling anvil-safe-start out of the RPM and into anvil-join-anvil. 2 years ago
digimer 89eae7098e NOTE: This updates the reserved RAM to 8 GiB from 4 GiB! 2 years ago
digimer f9689a7106 Updated ocf:alteeve:server to look for /tmp/<resource>.fail' and, if that file exists, exits with rc:1. This is done to allow for testing. 2 years ago
digimer cf73d8ed36 * Updated System->configure_ipmi() to auto-configure DR hosts once they've been assigned a BCN IP address. 2 years ago
digimer efebd135eb * Removed more references to 'dr1_host_uuid' from the old way of linking DR hosts to Anvil! nodes. 2 years ago
Fabio M. Di Nitto 856809c723 Fix typo in log message 2 years ago
Fabio M. Di Nitto a6f2c2271e Fix typo in log message 2 years ago
digimer b144976853 This resolves Issue #310. 2 years ago
digimer 645f54ab89 This commit has more changes than I would normally like, but it's all linked to changing file uploads to rsync serially. 2 years ago
digimer 7773e5f9b8 * Updated logging in DRBD->get_devices(). 2 years ago
digimer e012d6016c Tha major point of this commit is to add the new 'anvil-manage-storage-groups' program that, well, manages storage groups. 2 years ago
digimer f8743a7435 * Further work on anvil-manage-dr. Now properly sanity checks that a valid server is passed. 2 years ago
digimer 1a217d21cf * Updated anvil-manage-dr to provide the ability to link anvil nodes to dr hosts. Also began work on making it work with the new DR links system. 2 years ago
digimer 17863404e3 * Updated Database->_age_out_data() to only run once per day, unless explicitely called with --age-out-database. 2 years ago
digimer ff69916a85 * Applied typo fixed from PR #286 (thanks, Deezzir!). Also moved all the raw prints into words.xml. 2 years ago
digimer 9d2f9c4d88 * Fixed a string key name typo. 2 years ago
digimer b8b4352117 * Added support for Migration Network configs in old striker and anvil-configure-host 2 years ago
digimer b27a43eaf7 * Updated striker to only require 6 interfaces when configuring a node. 2 years ago
digimer 0fa6ddebc5 Updated scan-network to see an interface state of 'activated' as up (used to check specifically for 'active'). 2 years ago
digimer a3988cc3e5 * Added System->configure_logind() to ensure that nodes are configured to ignore ACPI power button events so that IPMI-based fences work immediately. 2 years ago
digimer dfa93a1837 * Added 'setsid' to all 'virsh' calls as nested calls (ie: crm_resource -> ocf:alteeve:server -> virsh) would fail because virsh couldn't connect to a terminal. See: 2 years ago
digimer b666caec64 * Updated anvil-provision-server to handle startup when the peer doesn't create/connect it's DRBD resource (ie: node is offline). 2 years ago
digimer a5cee52153 * Fixed a bug in DRBD->get_devices() where old test host UUIDs were left hard-coded. 2 years ago
Digimer 6d59399c73 * Updated the short OS list. 2 years ago
Digimer f9ca6fb170 * This adds the new anvil-version-change tool which anvil-daemon will call on startup to handle checks for changes made over releases/updates. 2 years ago
Digimer 02e371ac56 Updated virsh OS list. 2 years ago
Digimer f6cbe7d1d2 * Fixed a bug in System->collect_ipmi_data() where double-quoted passwords were preventing reading of the sensor data. 2 years ago
Digimer 4ba1982183 This is the start of a set of changes needed to rework how we handle DRBD fence requests, so that they create location constraints instead of triggering a full stonith fence. 2 years ago
Digimer 6eb99a2168 * FInished the anvil-manage-alerts tool. It can now send test alerts at a user-requested alert level. 2 years ago
Digimer 8b7a44cf75 * Finished cleaning up the output of Machines. 2 years ago
Digimer 3e53c87a6b Formatted the output of anvil-manage-alerts data (not yet machines) to be more presentable. 2 years ago
Digimer 622fb84652 * Renamed the 'notifications' table to 'alert-override', better reflecting what it does. 2 years ago
Digimer 586ce6e5b9 * Got recipints working in anvil-manage-alerts(). 2 years ago
Digimer 35cf0c37fb * Updated System->check_ram_use() to set the maximum RAM based on the host type, and set those values in _set_default() so that the user can override if they want. 2 years ago
Digimer a6cd5c6604 * Starting work in the new anvil-manage-alerts, which will (when done), allow for management of mail servers, alert recipients, notification over-rides and to trigger test alerts. 2 years ago
Digimer bde0b2e7ec * Fixed a bug where deleting ports from a fence device in an Install Manifest would not cause the fence methods to be removed from the associated cluster. 2 years ago
Digimer 93427a7a38 * Updated Get->switches() to always support job-uuid. 2 years ago
Digimer c23c79cdf0 Added 'system::all::configured' to anvil-join-anvil to mark an explicit end of config. 2 years ago
Digimer 596855405f * Added variables to record when pacemaker and DRBD are configured. 2 years ago