705 Commits (7bf7c3747cdd7ec2764b6e074400754200f46ed5)

Author SHA1 Message Date
digimer 79ff96cee5 * Fixed a bug in DRBD->manage_resource() that prevented new resources from being created. 1 year ago
digimer 3ee30e6e24 * Updated DRBD->allow_two_primaries() to gracefully fail if the peer isn't connected. 1 year ago
digimer 55aaf7876e Starting work on checking if the peer is connected before managing allow-two-primaries. 1 year ago
digimer 59ade94124 * Added PID logging as an option, and enabled it in ocf:alteeve:server 1 year ago
Fabio M. Di Nitto fc75bda6ef ocf:alteeve:server: add support for log levels and bump timeouts 1 year ago
digimer a0ff080741 * Deleted some old unused code from Cluster->assemble_storage_groups(). 1 year ago
digimer ed480cf1cb * Fixed a double-$ bug in Remote->_check_known_hosts_for_target() 1 year ago
digimer be290bf561 This commit fixes a bug where the drbd kernel module build was being killed mid-compile, leaving DBRD unusable. 1 year ago
digimer d68adb5b4e * Updated anvil-manage-power to not reboot if anvil-version-changes is running (which, if it's taking time, is generating new kmods). 1 year ago
digimer 556e91238d * Updated Network->find_access() to clear the data from previous scans, which fixes a bug where checking multiple hosts could return stale data for the previous host. 1 year ago
digimer 66c82e5e22 * Fixed a bug in anvil-update-system where updating a single package with --reboot wouldn't request a reboot. Finished reworking it so that a check is made to see if the kernel or DRBD kmod will be updated and, if so, removes the kmod-drbd RPMs prior to doing the update (as opposed to the sloppier check-on-error method). 1 year ago
Tsu-ba-me c46ff969f3 fix: add UUID to server process during find in Server.pm 1 year ago
Tsu-ba-me 4bdd206e0c fix: replace ps|grep with pgrep to reduce run time 1 year ago
Tsu-ba-me 54c98f89ab fix: allow extend remote call with openssh options 1 year ago
digimer 7bd76c10dc Major thing in this commit is reworking striker-update-cluster to work without expecting anvil-daemon to be running on target machines. Similarly, they had to be able to work when the Striker DBs were not available. This is to account for cases where the Striker dashboards have updated, and the schema has changed, preventing the not-yet-updated DR hosts and subnodes from being able to use the DB. To do this, anvil-safe-stop, anvil-update-system, and anvil-shutdown-server had to be updated to use the new --no-db switch, which tells then to run without the database being available. 1 year ago
Tsu-ba-me 92a4027f9f fix: add UUID to server process during find in Server.pm 1 year ago
Tsu-ba-me 9aa2937929 fix: replace ps|grep with pgrep to reduce run time 1 year ago
Tsu-ba-me e01c773eee fix: allow extend remote call with openssh options 1 year ago
digimer 458cb267da * Fixed a bug in Cluster->get_primary_host_uuid() where servers were not loaded before trying to calculate RAM use. 1 year ago
digimer 4dc1b0e117 * Added a check to Network->get_company_from_mac() to manually set the company to KVM/qemu if the prefix is 52:54:00. 1 year ago
digimer 02c3d204ea * Updated anvil-update-system to set 'job_data' to track reboots, and striker-update-cluster to read it. 1 year ago
digimer 3016fb875b * Reworded striker-update-cluster to use anvil-update-system for on-system OS updates. 1 year ago
Tsu-ba-me a7751da153 fix: rename, relocate function to find qemu-kvm processes 1 year ago
Tsu-ba-me c3c69733d9 fix: correct base port check, server info extract, vnc alive assign in Server.pm 1 year ago
Tsu-ba-me 3cce3c39b8 fix: add Server subroutine to extract server VM info from qemu-kvm process(es) 1 year ago
digimer d56b7f9a84 * Created (but not finished!) the new striker-update-cluster tool. 1 year ago
digimer a7ebe45f76 This adds the new 'striker-collect-debug' tool that collects all potentially useful debug info into a single tarball. 1 year ago
digimer 1b8b0bc493 * Created the new 'anvil-manage-server-storage' with the first role of reload a DRBD resource. 2 years ago
digimer 7fbed10864 * Updated Remote->call() to take the new 'background' parameter. 2 years ago
digimer ae55ca9187 * Applied the fix for TCP ports aging out reserved TCP ports properly to DRBD->get_next_resource(). 2 years ago
digimer ea95d26cc5 * Fixed a bug in DRBD->get_next_resource() where reserved minor numbers were not being released. Also added a new parameter, "minor_only", that returns the next minor number but doesn't bother processing TCP ports. 2 years ago
digimer 65af56d5bd * Updated Database->insert_or_update_jobs() to not look for jobs that are complete when no job_uuid is passed. 2 years ago
digimer e0316da88b * Got anvil-manage-server-storage working enough to grow existing disk's hard drive sizes, and to insert/eject optical disks. 2 years ago
digimer 1d12fb32b4 * Completed the new anvil-watch-drbd which replaces watch_drbd. 2 years ago
digimer 0aa72498db * This adds the new tool 'striker-check-machines' which simply walks through all known physical machines and checks to see if they're accessible and powered on. 2 years ago
Tsu-ba-me b3f2644d07 fix: allow parameter to overwrite cgi input in Account->login 2 years ago
Tsu-ba-me 226c423af0 fix: allow param override in generate_manifest in Striker.pm 2 years ago
digimer cc15eca6fb * Added anvil-watch-power to git. 2 years ago
digimer 47f7a35df3 The main purpose of this commit is to add serial execution of similar jobs to help reduce race conditions for scripted jobs, like multiple server creation. 2 years ago
digimer dda0fbd7d5 * Updated DRBD->allow_two_primaries() to be more careful at evaluating peer-node-id. 2 years ago
digimer b6a249d5e7 * Updated Cluster->add_server() to set the preferred host based first on if the server is running on a node, and if not, on the primary node (where before it defaulted to node 1). 2 years ago
digimer b03587967b * Updated Cluster->add_server() to batch the creation of the server and the location constraints in one commit to the CIB. 2 years ago
digimer b7abc481e6 Updated scan-cluster to check to see that migrate_to and migrate_from are given a timeout of 600s and an on-fail of "block". Updated Cluster->add_server() to set migrate_from to timeout=600s and on-fail=block as well. 2 years ago
digimer c82bd9d73a * Created the new anvil-watch-power tool that shows the status of UPSes known on the system, including their "on battery" state, charge percentage, estimated hold up time, etc. 2 years ago
digimer bc3d04ad2e * Updated Cluster->add_server() to wait up to 15 seconds for a server to appear to ensure that the pcs call to add the server with the right requested running state. 2 years ago
digimer 0e57836c8f This commit addresses (hopefully) issue #329. 2 years ago
digimer c50a1936c0 * This adds the new 'file_locations' -> 'file_location_ready' column and associated methods. This is set to TRUE/1 when the file referenced is found on disk and it is the expected size and md5sum. This is meant to allow programs to wait/watch or a file to be ready if they need to use it. Files are now checked periodically via anvil-daemon. 2 years ago
digimer 26fa3c7e32 Fixed a bug where Get->available_resources() was missing LVM/storage group data in some cases. 2 years ago
digimer 510db70253 Another attempt to resolve the stoage group race condition. This moves the check for auto-assembly to scan-lvm. It only works for the first assemble, after that the user can/should use anvil-manage-storage-groups. 2 years ago
digimer e483840ceb Second attempt to fix the storage group race condition. This time, we only let node 1 assemble storage groups. 2 years ago