200 Commits (fd3596e4ab1c2d8355fbabb1107d68d6783c7cfd)

Author SHA1 Message Date
digimer be290bf561 This commit fixes a bug where the drbd kernel module build was being killed mid-compile, leaving DBRD unusable. 1 year ago
digimer f57ab1a78c * Updated anvil-daemon to not hold jobs at startup is the host isn't configured yet. 1 year ago
digimer 66c82e5e22 * Fixed a bug in anvil-update-system where updating a single package with --reboot wouldn't request a reboot. Finished reworking it so that a check is made to see if the kernel or DRBD kmod will be updated and, if so, removes the kmod-drbd RPMs prior to doing the update (as opposed to the sloppier check-on-error method). 1 year ago
digimer e278de4b5a The main change in this commit deals with anvil-daemon startup. During OS updates, it would pick up the queued update job and run it while the other --no-db one was still running. This could become an issue for other tasks in the future, so updated anvil-daemon to not run any jobs for the first minute after startup. Also updated it to see if an OS update is underway (given how it can start mid-RPM update, before packages like kmod-drbd are ready to build). While doing this, implemented caching of daily tasks (like agine out data, archiving data, network scans, etc) to only run once per day, period. As it was before, they would always run on anvil-daemon startup, then wait 24 hours. 1 year ago
digimer d741f4aa6f * Updated anvil-daemon to not exit on high RAM use is any job is running. 1 year ago
digimer 751687129a * Updated anvil-daemon to not exit on RAM use if anvil-update-system is running. 1 year ago
Tsu-ba-me 4f46bb43eb fix(tools): remove server screenshot fetching in anvil-daemon 1 year ago
Tsu-ba-me d95eb699f9 chore: disable web VNC, screenshot pieces to avoid libvirt deadlock 1 year ago
Tsu-ba-me d98df4b2a4 fix(tools): isolate non-striker tasks in anvil-daemon 1 year ago
Tsu-ba-me 560d60c7e8 fix(tools): get server screenshots every minute and punt to strikers WIP 1 year ago
digimer 1d12fb32b4 * Completed the new anvil-watch-drbd which replaces watch_drbd. 1 year ago
digimer c9e11fbbfc * Added checks to anvil-provision-server to fail out if either of the SN IPs are not found when generating a DRBD resource config. 1 year ago
digimer 156a0ca201 Updated anvil-daemon's new job launching logic to allow the restart of a running job that failed out early. 1 year ago
digimer 47f7a35df3 The main purpose of this commit is to add serial execution of similar jobs to help reduce race conditions for scripted jobs, like multiple server creation. 1 year ago
digimer 8f375c58a9 * Fixed a typo in anvil-daemon that prevented compiling. 2 years ago
digimer c50a1936c0 * This adds the new 'file_locations' -> 'file_location_ready' column and associated methods. This is set to TRUE/1 when the file referenced is found on disk and it is the expected size and md5sum. This is meant to allow programs to wait/watch or a file to be ready if they need to use it. Files are now checked periodically via anvil-daemon. 2 years ago
digimer 895f1ec262 This fixes a race condition when multiple servers are provisioned at (nearly) the same time. 2 years ago
digimer 7710d9d109 * Created the new anvil-manage-server-storage tool which will specifically handle managing a server's disks. 2 years ago
digimer a3988cc3e5 * Added System->configure_logind() to ensure that nodes are configured to ignore ACPI power button events so that IPMI-based fences work immediately. 2 years ago
Digimer 6d59399c73 * Updated the short OS list. 2 years ago
Digimer 9194eb3d09 * Updated System->check_if_configured() to record that a host is configured in /etc/anvil to make the system auto-mark as configured if the host is removed from the DB (or, more specifically, variables -> system::configured is lost). 2 years ago
Digimer f9ca6fb170 * This adds the new anvil-version-change tool which anvil-daemon will call on startup to handle checks for changes made over releases/updates. 2 years ago
Digimer 622fb84652 * Renamed the 'notifications' table to 'alert-override', better reflecting what it does. 2 years ago
Digimer a4ef93404c * Fixed a bug in DRBD->gather_data() to remove trailing commas for existing TCP ports. 2 years ago
Digimer ef3ac86162 * Fixed a bug where setting the db_in_use flag without a valid $ENV{_}. 2 years ago
Digimer 21738ab0d4 Added a bit more logging to the Database->mark_active method. 2 years ago
Digimer a81478f2bc * Updated 'db_in_use' state to add the caller's name to the state name. This is pulled out when logging stale locks that are being reaped, to help debug where stale locks are coming from. 2 years ago
Digimer e7cf8ac789 * Got more work done on anvil-manage-files. It now picks up new files on nodes/dr hosts in an Anvil! and downloads them if needed. 2 years ago
Digimer 5fea8ff46a * Adds the anvil-boot-server man page. 2 years ago
Digimer b3b185a43c * Added the alteeve-repo-setup man page and updated it to show that when called with '-h'. 2 years ago
Digimer d9910fc951 Finished the man page for anvil-daemon. 2 years ago
Digimer be612ff878 * Updated Get->switches() to take 'list' and 'man' parameters. With list, the passed in switches can be checked to ensure they're valid. With 'man', if set to the name of a man page (usually $THIS_FILE) will be displayed if --help, -h or -? are used. 2 years ago
Digimer cd220e97dc Disabled striker-prep-databas and set Database->configure_pgsql() calls to use debug => 2. 2 years ago
Digimer 7fd6185445 * Disabled firewalling for now. There appears to be an issue starting up with DRBD. 2 years ago
Digimer bce9e2caaf This is the first attempt at enabling firewalld completely. There is a decent chance that problems exist, so it won't be a surprise if a few more commits are needed to this branch before things work. 2 years ago
Digimer f2d06fa9b1 * Updated striker-parse-oui to only run if/when the system has been running for at least one hour. 2 years ago
Digimer ab9b00a2f7 * Updated anvil-daemon, in its daily checks, to disable ksm and ksmtuned daemons. 2 years ago
Digimer 911f7cfb6a This is another big commit with a lot of DB work. Getting closer to sorting out the frequent resyncs. 2 years ago
Digimer e6dcff1cf1 * Added a missing modified_date to ip_addresses in Database->get_ip_addresses(). 2 years ago
Digimer 1b70b49cf8 * Updated Network->find_matches() to try to populate the first and second parameters if they're not passed in. 2 years ago
Digimer 142be7674e * Fixed a bug in striker-scan-network where the scan wasn't running properly when no network was specifically given. 3 years ago
Digimer 0b41029db2 Reworked Database->_find_behind_databases to loop through tables, then databases when evaluating for resync. This is still racy but should be less racy as the time between counts of columns for a given table should be a lot shorter. Also re-enabled triggering resyncs based on the age of the most recent record. 3 years ago
Digimer 7212ea1c2f Fixed a bug where reaping db_in_use states wasn't restricted to the caller's host_uuid. 3 years ago
Digimer 74b7719cf5 * Created the new anvil-manage-host that can check/set if a host is configured. On Strikers, it can age out data, resync data, and check/set if the local database is active. 3 years ago
Digimer edf51adaec * Changed 'anvil-manage-power' to no longer set the job progress to 50 prior to calling a reboot. It now sets to 100 immediately. Also reduced the uptime timer to five minutes from ten. 3 years ago
Digimer 7b090e1623 * Updated Database->shutdown() to disconnect, stop the postgresql daemon, then reconnect. 3 years ago
Digimer 3fd0db15bf * This rather heavily reworks how database shutdowns works. It adds much more intelligent shutdown, tracking who is using the database, being able to mark a database as "offline" and waiting for users of the database to disconnect before it shuts down. 3 years ago
Digimer b234b79544 Updated anvil-daemon to check if anvil-sync-shared is running if the reported RAM use is too high. If so, it doesn't exit. This fixes an issue where anvil-sync-shared would loop forever as it would constantly be killed when downloading large files. 3 years ago
Digimer 68b1d12545 Updated anvil-daemon to not shutdown a striker DB until the striker host has been running for at least an hour. 3 years ago
Digimer f77f486775 Fixed a typo in scan-network 3 years ago