900 Commits (eafd6d5090030027fc295c56dd4448cc3f554b62)

Author SHA1 Message Date
digimer 0aa72498db * This adds the new tool 'striker-check-machines' which simply walks through all known physical machines and checks to see if they're accessible and powered on. 2 years ago
Tsu-ba-me b3f2644d07 fix: allow parameter to overwrite cgi input in Account->login 2 years ago
Tsu-ba-me 226c423af0 fix: allow param override in generate_manifest in Striker.pm 2 years ago
digimer 156a0ca201 Updated anvil-daemon's new job launching logic to allow the restart of a running job that failed out early. 2 years ago
digimer cc15eca6fb * Added anvil-watch-power to git. 2 years ago
digimer 47f7a35df3 The main purpose of this commit is to add serial execution of similar jobs to help reduce race conditions for scripted jobs, like multiple server creation. 2 years ago
digimer dda0fbd7d5 * Updated DRBD->allow_two_primaries() to be more careful at evaluating peer-node-id. 2 years ago
digimer b6a249d5e7 * Updated Cluster->add_server() to set the preferred host based first on if the server is running on a node, and if not, on the primary node (where before it defaulted to node 1). 2 years ago
digimer b03587967b * Updated Cluster->add_server() to batch the creation of the server and the location constraints in one commit to the CIB. 2 years ago
digimer b7abc481e6 Updated scan-cluster to check to see that migrate_to and migrate_from are given a timeout of 600s and an on-fail of "block". Updated Cluster->add_server() to set migrate_from to timeout=600s and on-fail=block as well. 2 years ago
digimer c82bd9d73a * Created the new anvil-watch-power tool that shows the status of UPSes known on the system, including their "on battery" state, charge percentage, estimated hold up time, etc. 2 years ago
digimer bc3d04ad2e * Updated Cluster->add_server() to wait up to 15 seconds for a server to appear to ensure that the pcs call to add the server with the right requested running state. 2 years ago
digimer 0e57836c8f This commit addresses (hopefully) issue #329. 2 years ago
digimer c50a1936c0 * This adds the new 'file_locations' -> 'file_location_ready' column and associated methods. This is set to TRUE/1 when the file referenced is found on disk and it is the expected size and md5sum. This is meant to allow programs to wait/watch or a file to be ready if they need to use it. Files are now checked periodically via anvil-daemon. 2 years ago
digimer 26fa3c7e32 Fixed a bug where Get->available_resources() was missing LVM/storage group data in some cases. 2 years ago
digimer 510db70253 Another attempt to resolve the stoage group race condition. This moves the check for auto-assembly to scan-lvm. It only works for the first assemble, after that the user can/should use anvil-manage-storage-groups. 2 years ago
digimer e483840ceb Second attempt to fix the storage group race condition. This time, we only let node 1 assemble storage groups. 2 years ago
digimer d64044c7d1 Test fix for storage group race condition. 2 years ago
digimer 9a58f4d1ff * This is a small commit to increase logging while chasing down a race condition issue with assembling storage groups. 2 years ago
digimer 895f1ec262 This fixes a race condition when multiple servers are provisioned at (nearly) the same time. 2 years ago
digimer e7537b0ca3 * Fixed a bug where, when DRBD->gather_data() calls 'drbdadm dump-xml' and the output includes usage data, it breaks XML parsing. 2 years ago
digimer dc7b909bfc More logging to debug storage group race condition 2 years ago
digimer bd575c6a7d Bumped logging for storage group management. 2 years ago
digimer 89eae7098e NOTE: This updates the reserved RAM to 8 GiB from 4 GiB! 2 years ago
digimer 025c2a6f54 * Updated Email->get_next_server() to ignore DELETED mail servers, and it now loads mail servers if not yet in memory. 2 years ago
digimer 1afa7ce09e * Created Cluster->recover_server() that uses crm_resource to try to recover a server that has entered a FAILED state. 2 years ago
digimer f9689a7106 Updated ocf:alteeve:server to look for /tmp/<resource>.fail' and, if that file exists, exits with rc:1. This is done to allow for testing. 2 years ago
digimer cf73d8ed36 * Updated System->configure_ipmi() to auto-configure DR hosts once they've been assigned a BCN IP address. 2 years ago
digimer 1c274ba58d * Fixed a bug in anvil-delete-server that was preventing the complete deletion of a server if the DRBD resource had already been removed. 2 years ago
Deezzir 109aa1ba3d docs: added annotation for the new arg 2 years ago
Deezzir 7d5f18b20d fix: introduced optional arg for clean_spaces 2 years ago
Deezzir 9241b5ef6a docs: added annotation for the new arg 2 years ago
Deezzir deac1fc6a8 fix: introduced optional arg for clean_spaces 2 years ago
digimer ddc6965b60 * Fixed a bug where references to files on Anvil! nodes was broken in anvil-provision-server and anvil-manage-files. 2 years ago
digimer efebd135eb * Removed more references to 'dr1_host_uuid' from the old way of linking DR hosts to Anvil! nodes. 2 years ago
digimer 41fb8baeda * Fixed a bug in Database->get_storage_group_data() that was deleting DR host storage group members. 2 years ago
digimer 8ff40ec42c * Fixed a SQL query bug in Database->get_drbd_data(). 2 years ago
digimer 040bc02e26 * This adds the new Database->get_drbd_data() that, like ->get_lvm_data, collates the DRBD data collected by scan-drbd into more readibly parsable data structure. 2 years ago
digimer 8e0e51544c * Continued work on anvil-manage-server-storage. 2 years ago
digimer b144976853 This resolves Issue #310. 2 years ago
digimer fea10e5bb1 * Prefixed all 'virsh' calls with 'setsid --wait' to help prevent future hangs if the call happens without a shell. 2 years ago
digimer 7891c9b2b1 * Fixed a bug in Network->load_ips() where interfaces were being marked as type 'bridge' or 'bond'. 2 years ago
digimer 5dbdd20d7e * Fixed a bug in Network->load_ips() where the IP address on a bridge or bond was having the device name recorded incorrectly. 2 years ago
digimer ab3e8afe6e Fixed a bug in Storage->push_file() where file path wasn't updated from incoming to files, preventing the push to other hosts from working. Also fixed a minor issue where the file size was sometimes 0, making transfer calculations useless. 2 years ago
digimer 254f7ef4e2 This should fix the tracking of what files belong where, using the new DR links system. It also should finish (though testing is still needed) the serial rsync issue. 2 years ago
digimer 645f54ab89 This commit has more changes than I would normally like, but it's all linked to changing file uploads to rsync serially. 2 years ago
digimer 7710d9d109 * Created the new anvil-manage-server-storage tool which will specifically handle managing a server's disks. 2 years ago
digimer 9751c883cb * Updated Cluster->assemble_storage_groups() to remove refrences to anvil_dr1_host_uuid. Also added the logic for auto-adding DR host's VGs to a storage group. Commented it out though as, for now, this might be a bad idea. Needs more thought. 2 years ago
digimer 7773e5f9b8 * Updated logging in DRBD->get_devices(). 2 years ago
digimer d88fde7733 Updated DRBD->delete_resource() to use '--force' instead of 'echo Yes' (which no longer works). 2 years ago