digimer
25a0454dce
Better handling of lost DB connections.
...
* Added a sync call to Tools->nice_exit() to ensure logs are flushed.
* Updated Database->quote() to be in an eval block to better handle
cases where the DB handle is lost.
* Added an hourly check to anvil-daemon and moved the memory in use
check to run only once per hour.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
b86493fff4
More logging to debug apparent hang
...
* Added an explicit 'sync' call when writing to logs. TO BE REMOVED!
* Disabled anvil-monitor-daemons and anvil-monitor-performance in case
this is somehow trigging program exits.
* Converted prints to Log->entry calls in anvil-change-password
* Added PID state info logging for running jobs.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
4766ceff70
Added logging to debug network config issue.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
8dc3a8262f
Updated pod on requiring 'new' for manifest_uuid when creating new
...
manifests.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
566887462e
Fixed parameter names being sent to Striker->generate_manifest().
...
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
a3ac5cf7f8
Fixed a bug that prevented install manifests from being saved.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
368673eac2
Added a flag for when NM is changed and, if set, NM is restarted.
...
* Also bumped nmcli sleeps to 5s.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
acf30229ef
Added code to restart NetworkManager if needed
...
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
ab33c716cb
Created a specific check that there's a hosts entry for each DB
...
* This is meant to deal with a case where, when a DB is added to
anvil.conf but that new entry is not yet in hosts, the program crashes
because of a duplicate key when calling insert_or_update_hosts for all
DBs.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
3d50f45984
Added a 1 second delay to nmcli calls
...
* Also fixed a bug Database->get_storage_group_data() to add a missing
column to adding members.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
033052f449
Shortened the time to reboot when no DBs come back after net reconfig
...
* Also updated to directly call a reboot.
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
8e53993f67
Shortened the anvil-daemon job start up delay.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
7 months ago
digimer
6d121dc0c0
Mapped each interface name in match.interface-name to a UUID lookup.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
7925a3f42c
* Added more man pages.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
1152c50f3a
Added pcsd config, and -y support.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
3e63b726d3
Added node 2 joining an Anvil! node if not started by node 1.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
e00dec7cba
Added loading existing corosync/authkey from peer during rebuild.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
bd2e4c46ae
Updated Network->load_ips() to use the device_name when available.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
7ecd0a4d70
Starting work on rejoining a replacement subnode to an Anvil! node
...
Signed-off-by: digimer <mkelly@alteeve.ca>
8 months ago
digimer
44aa0fb8d9
Bumped logging to debug periodic strike init resync failure
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
76e66e6fa6
Added anvil.conf to log collection.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
60759cd9aa
No longer fail is fence method exists already when trying to create it.
...
* Also fixed a string insertion variable typo
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
Tsu-ba-me
5d086f5e79
fix(tools): log websockify output
9 months ago
Tsu-ba-me
468a6d18ee
fix(tools): get all logs in audit/
9 months ago
Tsu-ba-me
11fd4098c4
fix(tools): collect /var/log/audit/audit.log for debug
9 months ago
digimer
937d8ed2ef
Added collecting corosync to log collection
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
cca4a7ea1b
Bumped logging.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
a28bb083fa
Commented out a stray debug print
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
Fabio M. Di Nitto
f1584c770d
striker-collect-debug: disable screenshot collection by default
...
Closes : #620
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Fabio M. Di Nitto
510af7e4b1
striker-collect-debug: remove tempdir when done
...
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Fabio M. Di Nitto
b64cf7052a
striker-collect-debug: align output format to other parts of the tool
...
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Fabio M. Di Nitto
3b018e56c6
striker-collect-debug: create output dir only when necessary
...
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Fabio M. Di Nitto
da67156ee1
striker-collect-debug: cleaner messages
...
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Fabio M. Di Nitto
635f38b489
anvil-safe-stop: don´t use locked version of pcs
...
Add pcs_direct tool path and use it for anvil-safe-stop
Closes : #623
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
digimer
371aaae6a8
Updated to set location constraints to minimize migrations
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
21c8084b2f
Updated to support Sys::Virt::Domain generating PNG screenshots
...
* This should work with older versions still generating PPM screenshots.
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
15a79d7bac
Added '--all' to anvil-manage-daemons to enable optional daemons.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
b74900c2fc
Beginning to repurpose anvil-manage-server for server resync
...
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
Fabio M. Di Nitto
50ad874909
striker-collect-debug: fix collection of cib.xml
...
Closes : #534
Signed-off-by: Fabio M. Di Nitto <fabbione@fabbione.net>
9 months ago
Tsu-ba-me
f506ec4ac8
fix(tools): allow operations (currently set) on hash children in execute mode of access module
9 months ago
Tsu-ba-me
b5264131c4
fix(tools): allow reference to children of in execute operation of access module
9 months ago
digimer
2d92f339c2
Fixed a bug related to changing the hostname during a manifest run
...
* The original hostname would be used to form the cluster, even though
the hostname was updated.
Signed-off-by: digimer <mkelly@alteeve.ca>
9 months ago
digimer
870c990632
Added support for multiple IP's per interface
...
* Created Database->get_mac_to_ip()
* Updated Database->insert_or_update_mac_to_ip() to find an entry using
both the IP and MAC address.
* Updated Network->get_ips() to store only the first IP it finds on an
interface as the main IP (for use in /etc/hosts, etc) and to store it
and any other IPs in a new hash.
* Updated scan-network to use the new hash above to record them in the
'mac_to_ip' table. Similarly, before marking an IP as removed, it
checks to see if it's an alternate IP.
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
ab0b1a262b
Reworked Network->wait_for_bonds() to be ->wait_for_networks()
...
* Renamed the old ->wait_for_networks() to be ->wait_for_nm_online().
* The new ->wait_for_networks() waits for all interfaces we manage to be
'activated' before returning.
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
0f1ff02e78
Added alarms around remote calls to better handle dropped networks.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
2f5fb32769
Quieted logging
...
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
c31880c2ec
Fixed the ordering holding on hosts and network config.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
b8c73fd3f2
Replaced hosts management in anvil-join-anvil with System->update_hosts.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
495cb90ca6
Created Network->wait_for_network to hold startup for NM to be up.
...
Added the call to Network->wait_for_network to pause scancore and
anvil-daemon startups until NetworkManager says it's up and running.
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago
digimer
5cf0bbc6be
Added Want=NetworkManager to anvil-daemon and scancore unit files.
...
Signed-off-by: digimer <mkelly@alteeve.ca>
10 months ago