966 Commits (753358a13bc2f0a1a4b78e25c65b74fca23d820c)

Author SHA1 Message Date
Digimer 0b2407e78b * Added a really simple DRBD monitoring tool to the repo, will likely remove later. 4 years ago
Digimer 4e843baa17 * Started work on parsing the gathered DRBD data. 4 years ago
Digimer 4d5ec72026 * Started work on the scan-drbd scan agent. Got it to the point that it is gathering needed data. 4 years ago
Digimer cda51e562d * Finished porting scan-hpacucli, the last M2 scan agent! 4 years ago
Digimer 1c00060d6e * Finished porting scan-storcli! This was the largest scan agent to migrate to M3. 4 years ago
Digimer 51de6c721f * Created scan-ipmitool (needs more testing but seems to work now). Logical straight port from M2. 4 years ago
Digimer 9c92b6bbb8 * Created Database->get_tables_from_schema() that returns an ordered list of tables in a database schema file, removing the need for scan agents to manually provide a list for agent start-up / DB purge. 4 years ago
Digimer 7e6e345513 * Updated tools/striker-manage-install-target to now check to see if the striker is a RHEL host. If not, the packages in the RHEL High Availability add-on are merged into the main package list. If it is RHEL though, a search is made for nodes that it can access and once one is found that is a matching RHEL version / arch, and has Internet access, it is used as a proxy to download the packages in the HA add on and then pulls those packages to the local repo. 4 years ago
Digimer 713f77bc78 * Finally finished scan-apc-ups! Proved way harder than anticipated... (over a solid week of work!) In M3, this agent is no longer host-bound, and the UPSes to scan based on entries in 'upses' using this scan agent. 4 years ago
Digimer d677d19ca0 * Moved Database->check_condition_age to Alert. 4 years ago
Digimer 2f4a06f2e0 * Updated System->call() to take the 'timeout' parameter which, when set, prepends the call with 'timeout X <shell_call>' to make it easier to deal with calls that could potentially hang. 4 years ago
Digimer 33101f969a * Fixed several bugs related to tracking server boots, migrations and shut downs in the anvil database. The 'ocf:alteeve:server' now has (mostly?) safe integration with the Anvil! database. This was mostly done by updating Servers->boot_virsh(), ->shutdown_virsh() and ->migrate_server(). 4 years ago
Digimer 46f1a05789 * Got the code in scan-server to the point where it _should_ now gracefully and automatically detect changes to a server's definition originatin from the database (via Striker), directly editing the on-disk definition file, or editing via libvirt tools (like virt-manager). Still needs to be tested though. 4 years ago
Digimer 1a1fa7ce88 * Created Cluster->get_anvil_uuid() that returns the 'anvil_uuid' of a given 'host_uuid'. 4 years ago
Digimer e6e4c7d530 * Moved Server->_parse_definition() to -> parse_definition() to make it a publid method. 4 years ago
Digimer e240a32a19 * Created Cluster->parse_crm_mon and updated Cluster->parse_cib() to determine what state a server is in and which host has a server. 4 years ago
Digimer 4dfe0cb5a0 * Created Cluster->boot_server, ->shutdown_server and ->migrate_server methods that handle booting, migrating and shutting down servers. Also created the private method ->_set_server_constraint which is used by migrate and boot to set resource constraints to control where a server boots or migrates to. 4 years ago
Digimer 0f7267eae1 * Moved the '_host_name', '_short_host_name', and '_domain_name' private methods in Tools.pm over to Get.pm (removing the leading '_' in the method names). 4 years ago
Digimer 44dc4f4b47 * Fixed a bug in Words->string() where not having the parameter 'file' set caused the default 'words.xml' to be specified, preventing strings from scan agents from being used. 4 years ago
Digimer 4be943ebf3 * Finished (initial) testing of scan-hardware. The first M3 scan agent is done! 4 years ago
Digimer 0a1dc809a2 * Created the ScanCore.pm module with the first 'agent_startup' method which generalized scan agent start up. 4 years ago
Digimer 925664762a * Created Database->check_for_schema() (not finished) that will check/add a schema for a scan agent. 4 years ago
Digimer dc5ec9c264 * Added checking the email server config to anvil-daemon. Email works now! 4 years ago
Digimer fe7cdb18fb * Updated all methods to add (or fix) logging the method entry. 4 years ago
Digimer 911523dfce * Got a lot of work done in generating emails. Doesn't work yet, but the code to generate emails for recipients using their preferred language and alert level is done (though limited testing so far). 4 years ago
Digimer 4f39272d9a * Fixed a big in Jobs->get_job_details() where jobs weren't being found via 'switches::job-uuid'. 4 years ago
Digimer 82acb4e104 * Fixed a resync bug where bridges needed to sync before bonds 4 years ago
Digimer 49682a01d7 * Fixed a bug in Database->disconnect() where the database idenitification number wasn't being removed, so connecting again triggered the duplicate DB connection check. 4 years ago
Digimer aad68b8ed0 * Got email reconfiguration when a mail queue is over ten minutes old (needs testing). 4 years ago
Digimer 767148b538 * Updated Database->get_mail_servers() to clear old stored data, and to pull out the list of when a mail server was last used. 4 years ago
Digimer b2c7fd95fb * Renamed the ScanCore unit file to scancore. 4 years ago
Digimer 1498e1b53c * Got server migration working using ocf:alteeve:server in a test environment! 4 years ago
Madison Kelly 30f2b3fa8e * Switched all hash 'local' keys to be the host's short user name. Untested, likely bugs to be fixed in the next commit. 4 years ago
Digimer 47203490a9 * Working on getting live migration to work with ocf:anvil:striker using the environment variables that pacemaker sets. Incomplete, but getting close. 4 years ago
Digimer e35800c413 * Fixed up (though more testing/work needed) to ocf:alteeve:server to get it working with DRBD resources referenced using '/dev/drbd/by-res/...'. 4 years ago
Digimer 39b4a912af * Remember in the last commit how I said that DRBD->update_global_common() was done? Well that was cute, 'cause it was quite broken. Now it's working. 4 years ago
Digimer d647014ad1 * Created (finished but not yet tested) DRBD->update_global_common() to update DRBD's global_common.conf file. 4 years ago
Digimer ef208fd3fb * Finished the logic for adding stonith devices and levels to pacemaker! More testing is needed though, bugs expected, but it adds them. 4 years ago
Digimer c27cc7507f * Renamed striker-parse-fence-agents to anvil-parse-fence-agents and changed anvil-daemon to run it on all machines. 4 years ago
Digimer 61f4dcc41f * Updated Cluster->parse_cib() to pull out fencing (stonith) devices and levels. 4 years ago
Digimer 3c2f25a860 * Added 'fence_delay' fence agent to handle the corner cases where an IPMI BMC had crashed until a power cycle, and PDU fencing was effected, but failed to report as such. 4 years ago
Digimer d2d5d7b460 * Fixed a bug in Striker->load_manifest() where fences were parsed twice, the second time missing a hash reference. 4 years ago
Digimer 1bf71f8428 * Updated Database->get_hosts() to run host_ipmi the Log->is_secure if the string contains 'passw'. 4 years ago
Digimer ef70f90ba4 * Updated Log->entry() to set the log file handle to UTF-8 when opened. 4 years ago
Digimer dcfdf1127c * Got more work done on System->configure_ipmi(). It should now configure the IP address, subnet mask and default gateway using information from the manifest and anvil-join-anvil data. 4 years ago
Digimer 1fa63d2ea3 * Added 'anvil_uuid' as a set parameter in Database->get_hosts(). 4 years ago
Digimer 99afd2e936 * Fixed a bug in Database->manage_anvil_conf() where initializing a host set the DB information with the wrong DB port and password. 4 years ago
Digimer 345d2e33d4 * Updated Cluster->parse_cib() to pre-fill some hashes to avoid undefined errors. 4 years ago
Digimer 01974d7efe * Finished (though testing is needed) the updated ocf:alteeve:server resource agent. It now handles starting and stopping libvirtd and drbd daemons on-demand. 4 years ago
Digimer dcd1fd1492 * Created Cluster->check_node_status() that checks the status of a node (in pacemaker). 4 years ago