765 Commits (78b75c649a46898f438759d2c91944aba66730f2)

Author SHA1 Message Date
Tsu-ba-me 962bf52225 fix: add signal INT, TERM hooks to Tools.pm 1 year ago
digimer 458cb267da * Fixed a bug in Cluster->get_primary_host_uuid() where servers were not loaded before trying to calculate RAM use. 1 year ago
digimer 4dc1b0e117 * Added a check to Network->get_company_from_mac() to manually set the company to KVM/qemu if the prefix is 52:54:00. 1 year ago
digimer 02c3d204ea * Updated anvil-update-system to set 'job_data' to track reboots, and striker-update-cluster to read it. 1 year ago
digimer 3016fb875b * Reworded striker-update-cluster to use anvil-update-system for on-system OS updates. 1 year ago
Tsu-ba-me a7751da153 fix: rename, relocate function to find qemu-kvm processes 1 year ago
Tsu-ba-me c3c69733d9 fix: correct base port check, server info extract, vnc alive assign in Server.pm 1 year ago
Tsu-ba-me 3cce3c39b8 fix: add Server subroutine to extract server VM info from qemu-kvm process(es) 1 year ago
digimer d56b7f9a84 * Created (but not finished!) the new striker-update-cluster tool. 1 year ago
digimer a7ebe45f76 This adds the new 'striker-collect-debug' tool that collects all potentially useful debug info into a single tarball. 1 year ago
digimer 1b8b0bc493 * Created the new 'anvil-manage-server-storage' with the first role of reload a DRBD resource. 1 year ago
digimer 7fbed10864 * Updated Remote->call() to take the new 'background' parameter. 1 year ago
digimer ae55ca9187 * Applied the fix for TCP ports aging out reserved TCP ports properly to DRBD->get_next_resource(). 1 year ago
digimer ea95d26cc5 * Fixed a bug in DRBD->get_next_resource() where reserved minor numbers were not being released. Also added a new parameter, "minor_only", that returns the next minor number but doesn't bother processing TCP ports. 1 year ago
digimer 65af56d5bd * Updated Database->insert_or_update_jobs() to not look for jobs that are complete when no job_uuid is passed. 1 year ago
digimer e0316da88b * Got anvil-manage-server-storage working enough to grow existing disk's hard drive sizes, and to insert/eject optical disks. 1 year ago
digimer 1d12fb32b4 * Completed the new anvil-watch-drbd which replaces watch_drbd. 1 year ago
digimer 0aa72498db * This adds the new tool 'striker-check-machines' which simply walks through all known physical machines and checks to see if they're accessible and powered on. 1 year ago
Tsu-ba-me b3f2644d07 fix: allow parameter to overwrite cgi input in Account->login 1 year ago
Tsu-ba-me 226c423af0 fix: allow param override in generate_manifest in Striker.pm 1 year ago
digimer 156a0ca201 Updated anvil-daemon's new job launching logic to allow the restart of a running job that failed out early. 1 year ago
digimer cc15eca6fb * Added anvil-watch-power to git. 1 year ago
digimer 47f7a35df3 The main purpose of this commit is to add serial execution of similar jobs to help reduce race conditions for scripted jobs, like multiple server creation. 1 year ago
digimer dda0fbd7d5 * Updated DRBD->allow_two_primaries() to be more careful at evaluating peer-node-id. 1 year ago
digimer b6a249d5e7 * Updated Cluster->add_server() to set the preferred host based first on if the server is running on a node, and if not, on the primary node (where before it defaulted to node 1). 1 year ago
digimer b03587967b * Updated Cluster->add_server() to batch the creation of the server and the location constraints in one commit to the CIB. 1 year ago
digimer b7abc481e6 Updated scan-cluster to check to see that migrate_to and migrate_from are given a timeout of 600s and an on-fail of "block". Updated Cluster->add_server() to set migrate_from to timeout=600s and on-fail=block as well. 1 year ago
digimer c82bd9d73a * Created the new anvil-watch-power tool that shows the status of UPSes known on the system, including their "on battery" state, charge percentage, estimated hold up time, etc. 1 year ago
digimer bc3d04ad2e * Updated Cluster->add_server() to wait up to 15 seconds for a server to appear to ensure that the pcs call to add the server with the right requested running state. 1 year ago
digimer 0e57836c8f This commit addresses (hopefully) issue #329. 1 year ago
digimer c50a1936c0 * This adds the new 'file_locations' -> 'file_location_ready' column and associated methods. This is set to TRUE/1 when the file referenced is found on disk and it is the expected size and md5sum. This is meant to allow programs to wait/watch or a file to be ready if they need to use it. Files are now checked periodically via anvil-daemon. 2 years ago
digimer 26fa3c7e32 Fixed a bug where Get->available_resources() was missing LVM/storage group data in some cases. 2 years ago
digimer 510db70253 Another attempt to resolve the stoage group race condition. This moves the check for auto-assembly to scan-lvm. It only works for the first assemble, after that the user can/should use anvil-manage-storage-groups. 2 years ago
digimer e483840ceb Second attempt to fix the storage group race condition. This time, we only let node 1 assemble storage groups. 2 years ago
digimer d64044c7d1 Test fix for storage group race condition. 2 years ago
digimer 9a58f4d1ff * This is a small commit to increase logging while chasing down a race condition issue with assembling storage groups. 2 years ago
digimer 895f1ec262 This fixes a race condition when multiple servers are provisioned at (nearly) the same time. 2 years ago
digimer e7537b0ca3 * Fixed a bug where, when DRBD->gather_data() calls 'drbdadm dump-xml' and the output includes usage data, it breaks XML parsing. 2 years ago
digimer dc7b909bfc More logging to debug storage group race condition 2 years ago
digimer bd575c6a7d Bumped logging for storage group management. 2 years ago
digimer 89eae7098e NOTE: This updates the reserved RAM to 8 GiB from 4 GiB! 2 years ago
digimer 025c2a6f54 * Updated Email->get_next_server() to ignore DELETED mail servers, and it now loads mail servers if not yet in memory. 2 years ago
digimer 1afa7ce09e * Created Cluster->recover_server() that uses crm_resource to try to recover a server that has entered a FAILED state. 2 years ago
digimer f9689a7106 Updated ocf:alteeve:server to look for /tmp/<resource>.fail' and, if that file exists, exits with rc:1. This is done to allow for testing. 2 years ago
digimer cf73d8ed36 * Updated System->configure_ipmi() to auto-configure DR hosts once they've been assigned a BCN IP address. 2 years ago
digimer 1c274ba58d * Fixed a bug in anvil-delete-server that was preventing the complete deletion of a server if the DRBD resource had already been removed. 2 years ago
Deezzir 109aa1ba3d docs: added annotation for the new arg 2 years ago
Deezzir 7d5f18b20d fix: introduced optional arg for clean_spaces 2 years ago
Deezzir 9241b5ef6a docs: added annotation for the new arg 2 years ago
Deezzir deac1fc6a8 fix: introduced optional arg for clean_spaces 2 years ago