Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Net config #660

Open
wants to merge 28 commits into
base: main
Choose a base branch
from
Open

Net config #660

wants to merge 28 commits into from

Conversation

digimer
Copy link
Member

@digimer digimer commented May 8, 2024

No description provided.

Signed-off-by: digimer <mkelly@alteeve.ca>
* Added the 'up' parameter to Network->collect_data() that will bring up
  an interface we configured that is down.
* Updated scan-network to call Network->collect_data() with 'up' if the
  uptime is less than ten minutes.

Signed-off-by: digimer <mkelly@alteeve.ca>
@digimer
Copy link
Member Author

digimer commented May 8, 2024

retest this please

1 similar comment
@digimer
Copy link
Member Author

digimer commented May 10, 2024

retest this please

digimer added 17 commits May 15, 2024 23:00
Signed-off-by: digimer <mkelly@alteeve.ca>
* Also updated to directly call a reboot.

Signed-off-by: digimer <mkelly@alteeve.ca>
* Also fixed a bug Database->get_storage_group_data() to add a missing
  column to adding members.

Signed-off-by: digimer <mkelly@alteeve.ca>
* This is meant to deal with a case where, when a DB is added to
  anvil.conf but that new entry is not yet in hosts, the program crashes
  because of a duplicate key when calling insert_or_update_hosts for all
  DBs.

Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
* Also bumped nmcli sleeps to 5s.

Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
hours.

Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
manifests.

Signed-off-by: digimer <mkelly@alteeve.ca>
Signed-off-by: digimer <mkelly@alteeve.ca>
* Added an explicit 'sync' call when writing to logs. TO BE REMOVED!
* Disabled anvil-monitor-daemons and anvil-monitor-performance in case
  this is somehow trigging program exits.
* Converted prints to Log->entry calls in anvil-change-password
* Added PID state info logging for running jobs.

Signed-off-by: digimer <mkelly@alteeve.ca>
* Added a sync call to Tools->nice_exit() to ensure logs are flushed.
* Updated Database->quote() to be in an eval block to better handle
  cases where the DB handle is lost.
* Added an hourly check to anvil-daemon and moved the memory in use
  check to run only once per hour.

Signed-off-by: digimer <mkelly@alteeve.ca>
* Updated the fence_ipmilan timeouts to 30 seconds to help debug fence
  config failures.

Signed-off-by: digimer <mkelly@alteeve.ca>
@digimer
Copy link
Member Author

digimer commented May 30, 2024

retest this please

@fabbione
Copy link
Member

fabbione commented Jun 2, 2024

retest this please

* Created Database->reconnect to clean up reconnecting to the DBs

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
@digimer
Copy link
Member Author

digimer commented Jun 3, 2024

retest this please

2 similar comments
@digimer
Copy link
Member Author

digimer commented Jun 3, 2024

retest this please

@fabbione
Copy link
Member

fabbione commented Jun 4, 2024

retest this please

* Before, it would try to reconnect to just the lost DB, which could
  trigger an error.

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
* Updated Database->query() to track when a specific DB to read from is
  passed. If so, and that is lost, return an error. If not, and another
  DB is available, switch to it.
* Updated Database->write() to skip trying to write to a lost DB.

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
Signed-off-by: Madison Kelly <mkelly@alteeve.com>
Signed-off-by: Madison Kelly <mkelly@alteeve.com>
Signed-off-by: Madison Kelly <mkelly@alteeve.com>
* Updated Database->reconnect() to take 'lost_uuid' and, if passed,
  deletes the cached file handle before calling ->disconnect().
* Updated Database->query() to return an empty hash reference instead of
  '!!error!!', as almost always, callers do an array count, which
  triggered errors as it's not a hash reference. Updated docs to reflect
  this.

Signed-off-by: Madison Kelly <mkelly@alteeve.com>
@fabbione
Copy link
Member

fabbione commented Jun 6, 2024

retest this please

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants