273 Commits (bac8680b36afd8cac28edcddf48bb398e779840e)

Author SHA1 Message Date
Digimer 8fbf594002 Updated striker-prep-database to stop -> start postgres post-configure, and to connect -> disconnect to run the schema load logic. 3 years ago
Digimer 422d248cbe * Updated Database->insert_or_update_states() to not actually record unless the state_host_uuid exists in all available databases. 3 years ago
Digimer 7b090e1623 * Updated Database->shutdown() to disconnect, stop the postgresql daemon, then reconnect. 3 years ago
Digimer 513ce3b74e Created 'striker-db-status' that reports the status of the databases to external tools. It's basic, but it works. 3 years ago
Digimer 3fd0db15bf * This rather heavily reworks how database shutdowns works. It adds much more intelligent shutdown, tracking who is using the database, being able to mark a database as "offline" and waiting for users of the database to disconnect before it shuts down. 3 years ago
Digimer b234b79544 Updated anvil-daemon to check if anvil-sync-shared is running if the reported RAM use is too high. If so, it doesn't exit. This fixes an issue where anvil-sync-shared would loop forever as it would constantly be killed when downloading large files. 3 years ago
Digimer ec3b3d2ac9 Fixed a bug in Database->_age_out_data() where checking if a table existed was hard coded to one table. 3 years ago
Digimer d70b9a4956 Updated scancore and anvil-daemon to check their RAM use at the end of each loop and, if it's using more than 1 GiB of RAM, it sends an alert and exits. 3 years ago
Digimer a886653af1 * Updated scan-network to purge duplicate bridges and bonds. 3 years ago
Digimer 6229a0f6b6 Added '#!no_value!#' to be handled properly by Convert->format_mmddyy_to_yymmdd. 3 years ago
Digimer 892a475881 * Fixed a bug in Convert->format_mmddyy_to_yymmdd() where being passed '--' didn't return the same. 3 years ago
Digimer 032f20a267 * Fixed a bug in Database->_age_out_data() where, when all records in the history schema would be purged, the most recent record would not be preserved. The result is that nothing was purged, allow tables to grow dramatically. The 'variables' table was also added to this age-out list. 3 years ago
Digimer 72038e8358 * Fixed a bug where ethtool's Media type contained tab characters that broke JSON when configuring the netowrk interfaces. 3 years ago
Digimer 9cfd7b9b94 Created the new (and still in development) striker-file-manager to manage files from a Striker dashboard's command line. So far. it will add files only. 3 years ago
Digimer 65dfc22a38 Added an eval{} call around Database->query()'s ->prepare() DBI call to better handle lost database handle. 3 years ago
Digimer 9eec6c4977 * Created ScanCore->check_temperature_direct() based around that start logic from ScanCore->post_scan_analysis_striker() temperature check, and updated the later to use the former. 3 years ago
Digimer 75a4c8d709 * Moved the logic to add the local database to a Striker's anvil.conf from striker-prep-database to Database->_add_to_local_config(). 3 years ago
Digimer b517117bc1 * Did more work on trying to figure out why iniital setup of the database was failing. I believe it was because, in anvil-daemon, after calling 'prep_database' we called ->connect() _without_ 'check_if_configured' set. Next round of function testing should help confirm is this was the case. 3 years ago
Digimer 090c59a873 Updated striker-prep-database to enable extra logging to help diagnose a function test build failure problem. 3 years ago
Digimer 32f29861a4 * Fixed a bug (maybe) that was causing users to get immediately logged out of the WebUI 3 years ago
Digimer 257a998743 * Updated Database->configure_pgsql() to use 'postgresql-setup --initdb --unit postgresql' instead of the deprecaded 'initdb' switch. 3 years ago
Digimer 63c45430bb * Updated scan-network to clear duplicate IP addresses. 3 years ago
Digimer a034583213 * Updated DRBD->gather_data() to record TCP/IP data between connections of two hosts. 3 years ago
Digimer 5c07179aa6 * Resolved a words.xml conflict. 3 years ago
Digimer e60a1b46b3 Fixed bugs related to automatic database startup and conditional backup loading. 3 years ago
Digimer 4e9882812d * Fixed a bug where the periodic database dumps on the primary database Striker were not sync'ing to peers. Also fixed a bug where these periodic dumps weren't running at all. 3 years ago
Digimer 72b17ff1f9 * Reworked how databases are stopped, now being handled in anvil-daemon. This way, initial starts will still do traditional resyncs, then shut down. This should allow the best of both worlds, where data is not lost on striker start/stop loss/recovery, but operate normally otherwise without delays. 3 years ago
Madison Kelly 922899ea78 * WIP: Working on a new method of failing over between which Striker is the active database, instead of running N-number of databases all the time. 3 years ago
Digimer 76eb09393f Fixed a bug found in the last commit causing newly-added VGs to storage group membership to use the wrong UUID as the internal VG UUID. 3 years ago
Digimer 9edf698c37 Updated Database->get_storage_group_data() to determine when a node or DR host needs to be removed from a Storage group, or when a member of an Anvil! needs to be added to a storage group. 3 years ago
Digimer 2f8b1fb72e Updated anvil-provision-server so that when the OS type is 'win7', set the disk to sata and the NIC to e1000e. Also updated it to store the virt-install call in the 'variables' table and write it out to /mnt/shared/provision. 3 years ago
Digimer 213babaaf2 Trying to fix a bug where vnet devices keep reporting as having returned. 3 years ago
Digimer e40d0e2444 Fixed a bug where if a database is pingable but the pgsql database is down, and it's the first database tested (or local), then the DB handle used to read / quote fails. 3 years ago
Digimer 4c7bb45ab9 Fixed a race condition where configuring the IPMI BMC would appear to fail because the BMC wouldn't report the user list after a cold reset. 3 years ago
Digimer 15d8309095 This commit adds scan agent DB connection info caching to help minimize the number of unnecessary DB resync checks that happen. 3 years ago
Digimer 606bd8f1f0 Continuing work on anvil-manage-server. 3 years ago
Digimer 28865780f8 * Updated Database->get_server_definitions() to take a specific server UUID, allowing just the one definition to be loaded. Also had it clear previous loads. 3 years ago
Digimer 6db16ca313 * Fixed a bug in Database->insert_or_update_network_interfaces() where the passed-in network_interface_uuid parameter was not being set properly. 3 years ago
Digimer fd5d3c0434 * Finished (though testing still needed) scan-network. 4 years ago
Digimer 30f478267a * Forced anvil-daemon to log-level 2 and to enable secure logging to continue debugging setup issues. 4 years ago
Digimer 023f43eda9 * In the never-ending attempt to resolve the build consistency issues, this commit enables extra debugging logging and, hopefully, implements a fix in anvil-daemon where a job could be started repeatedly. 4 years ago
Digimer 11b1900e1b Note: Continuing to resolve the build issues with network startup. Expect breakage. 4 years ago
Digimer 80bdac8e34 * Updated the pacemaker server config to drop the stop timeout to 5 minutes and the migration timeout to 10 minutes. This will avoid blocking the entire cluster when a stop or migrate operation times out. Will update scan-server to clean these up when they happen. 4 years ago
Digimer daca6c887b * This contains a fairly major change to how time stamps are handled. All INSERT and UPDATE calls now generate a new timestamp via Database->refresh_timestamp, instead of using 'sys::database::timestamp'. This was done in responce to finding a bug where tables in a database differed in both counts of public and private schemas (ip_addresses table, specifically) that failed to resync because the timestamps were re-used too often. 4 years ago
Digimer 16c20ae69c * Updated Tools->catch_sig() to use return code 0 instead of 255 so that systemd doesn't think our daemons failed on stop. 4 years ago
Digimer 24ec17f8f7 * Added a new parameter called 'sensitive' to Database->connect() that returns after connections before any ancilliary checks are done, minimizing connect time. 4 years ago
Digimer 73267a8ea9 * WIP - Slowly working on anvil-manage-server 4 years ago
Digimer 4dcd505753 * Biggest change in this commit; scan-apc-pdu and scan-apc-ups now only run on Striker dashboards! This was because we found that if two machines ran their agents at the same time, the reponce time from SNMP read requests grew a lot. This meant it was likely a third, fourth and so on machne would also then have their scan agent runs while the existing runs were still trying to process, causing the SNMP reads to get slower still until timeouts popped. 4 years ago
Digimer 8807915bb7 The theme of this commit is database cleanup and fixes. 4 years ago
Digimer 6abe06f125 The theme of these commits is improving DB responsiveness. 4 years ago