-
Notifications
You must be signed in to change notification settings - Fork 594
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
AWX Operator Fails to Install AWX Containers/Instance #284
Comments
Blew the entire thing away and restarted fresh. Service pods are stuck 0/4 pending. It's been an additional 45 minutes now. This is the exact task that continuously fails over and over again with no real output/log.
Output {
"level":"error",
"ts":1620229749.932002,
"logger":"controller-runtime.controller",
"msg":"Reconciler error",
"controller":"awx-controller",
"request":"default/awx",
"error":"event runner on failed",
"stacktrace":
"github.com/go-logr/zapr.(*zapLogger).Error
pkg/mod/github.com/go-logr/zapr@v0.1.1/zapr.go:128
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler
pkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:258
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem
pkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:232
sigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).worker
pkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:211
k8s.io/apimachinery/pkg/util/wait.BackoffUntil.func1
pkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:155
k8s.io/apimachinery/pkg/util/wait.BackoffUntil
pkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:156
k8s.io/apimachinery/pkg/util/wait.JitterUntil
pkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:133
k8s.io/apimachinery/pkg/util/wait.Until
pkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:90"
} Logs
|
@bandwiches we will need more information to understand what is going on. kubectl get awx -o yaml awx
kubectl describe deployment awx
kubectl describe statefulset awx-postgres
kubectl get pods
kubectl get events Thanks! |
I'm seeing the same thing in minikube. here is the output from mine. Saw the same error in kubernetes on centos7 as well. Fresh install with all latest binaries. describe_awx.txt |
For the sake of clarity, I feel I should state that I'm using minikube since it is recommended by the AWX install guide. get_awx.txt (Edit) I see a CPU warning (insufficient CPU) for the AWX pod. I have to say, this is a dedicated VM w/2 CPU and 2GB RAM. This VM has had no issues running AWX v15 and v17. New install method introduced in v19 all of a sudden complains about resources? Understandable that this could change from version to version, but it would be nice to know minimal system requirements now that it's an issue. |
Here is the snippet of the error i'm seeing which i believe is exactly like @bandwiches error. {"level":"error","ts":1620310325.2259731,"logger":"controller-runtime.controller","msg":"Reconciler error","controller":"awx-controller","request":"default/awx","error":"event runner on failed","stacktrace":"github.com/go-logr/zapr.(*zapLogger).Error\n\tpkg/mod/github.com/go-logr/zapr@v0.1.1/zapr.go:128\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).reconcileHandler\n\tpkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:258\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).processNextWorkItem\n\tpkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:232\nsigs.k8s.io/controller-runtime/pkg/internal/controller.(*Controller).worker\n\tpkg/mod/sigs.k8s.io/controller-runtime@v0.6.0/pkg/internal/controller/controller.go:211\nk8s.io/apimachinery/pkg/util/wait.BackoffUntil.func1\n\tpkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:155\nk8s.io/apimachinery/pkg/util/wait.BackoffUntil\n\tpkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:156\nk8s.io/apimachinery/pkg/util/wait.JitterUntil\n\tpkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:133\nk8s.io/apimachinery/pkg/util/wait.Until\n\tpkg/mod/k8s.io/apimachinery@v0.18.2/pkg/util/wait/wait.go:90"} |
Spot on. |
I have the exact same error, but on a bare-metal kubernetes cluster: |
For your case, it looks the issue is related with the CPU (like you mentioned) NAME READY STATUS RESTARTS AGE
awx-5b58db49c-9gslf 0/4 Pending 0 7m3s
awx-operator-5595d6fc57-92txg 1/1 Running 0 10m
awx-postgres-0 1/1 Running 0 7m14s
LAST SEEN TYPE REASON OBJECT MESSAGE
87s Warning FailedScheduling pod/awx-5b58db49c-9gslf 0/1 nodes are available: 1 Insufficient cpu. Looking at your deployment, we can see it's using the default awx-web:
Image: quay.io/ansible/awx:19.1.0
Port: 8052/TCP
Host Port: 0/TCP
Requests:
cpu: 1
memory: 2Gi
....
awx-task:
Image: quay.io/ansible/awx:19.1.0
Port: <none>
Host Port: <none>
Args:
/usr/bin/launch_awx_task.sh
Requests:
cpu: 500m
memory: 1Gi
.... Please note the suggested values (memory and cpu) are still the same (see https://github.com/ansible/awx-operator/pull/93/files) and you can override it to fulfill your needs. That should the job for you. Please let us know. |
Same thing here @exodusprime1337 LAST SEEN TYPE REASON OBJECT MESSAGE
2s Warning FailedScheduling pod/awx-5b58db49c-bfwnt 0/1 nodes are available: 1 Insufficient memory.
21m Normal SuccessfulCreate replicaset/awx-5b58db49c Created pod: awx-5b58db49c-bfwnt
awx-web:
Image: quay.io/ansible/awx:19.1.0
Port: 8052/TCP
Host Port: 0/TCP
Requests:
cpu: 1
memory: 2Gi
Requests:
cpu: 500m
memory: 1Gi If you run allocatable:
cpu: 7800m
ephemeral-storage: "222240964241"
hugepages-1Gi: "0"
hugepages-2Mi: "0"
memory: 31547268Ki
pods: "250"
capacity:
cpu: "8"
ephemeral-storage: 235495Mi
hugepages-1Gi: "0"
hugepages-2Mi: "0"
memory: 32173956Ki
pods: "250"
> kubectl top nodes
NAME CPU(cores) CPU% MEMORY(bytes) MEMORY%
p70 763m 9% 12685Mi 41% |
@tchellomello thanks for the update there. I'm following you, but I have a serious concern about the AWX install tutorial since it gives a bare minimum config and that leads to this result. Perhaps there should be more cross-communication between the two packages to ensure that the minimal config is actually the bare minimum? These settings are never mentioned in install doc. Edit - One issue is that AWX INSTALL.md doesn't have any mention of minimal requirements thus making the transition from v17 to v19 even harder since what worked before, may no longer work as "default". While I understand requirements may change, it would also be nice to know that the minimal requirements/default have changed. |
@bandwiches I hear you, I agree that the documentation has lots of room to improve, and please if you see any place that could use some enhancement, do not hesitate to submit a PR. In regards to the https://github.com/ansible/awx-operator/blob/devel/deploy/crds/awx_v1beta1_molecule.yaml, that is used on the |
I would love to, except I think the In regards to your response about system settings - understood and that's fair, no qualms about that. I was running into another issue once I was able to resolve the resources issue and I feel it's actually still appropriate here. The
Second issue - minikube service IP. I don't see anywhere that this is configurable, however I'll admit that I may be overlooking it given how many different repo's I've had to visit today. This actually presents 2 issues (1) now we're required to route to the host first for the underlying subnet access and (2) there's no consideration for organizational overlap if that subnet is already in use. I believe the default underlying network is |
Hi bandwiches |
I am having an issue installing AWX under K3S on both CentOS 8.5 and Rocky 8.5. I am trying to follow the steps from the below blog post: https://computingforgeeks.com/install-and-configure-ansible-awx-on-centos/ I have repeated the build multiple times stepping up my VM CPU/RAM resources. Initially tried 2c/4g then 4g/8g, then 8c/16gb. The VM has a single 50GB disk with automatic partitioning from the OS installer. The Minimal software selection was used. I have disabled both SELinux and firewalld. K3s installs and I can deploy the AWX Operator. When I try installing AWX with the command "kubectl apply -f awx-instance-deployment.yml -n awx", it appears to successfully deploy the aws-postgresql container but never starts to deploy the 4 pod AWX container. The output of the following commands is included here: kubectl get awx -o yaml awx debug.get-awx.txt Thanks, |
And then I searched the current issues here and found this: Seems to be my issue -- chanign nodeport to clusterip allowed it to bring up the AWX pods. [root@d-1-cfg-awx-c8 awx-operator]# kubectl get pods -n awx Now I need to work on the ClusterIP access as it does not seem to be mapping to a high port for access: [root@d-1-cfg-awx-c8 awx-operator]# kubectl get service -n awx |
Don't mean to necrobump this, but I just installed it following https://computingforgeeks.com/how-to-install-ansible-awx-on-ubuntu-linux and the issue I had with Redis not launching was that the CPU needed to be set to host in Proxmox, not KVM or QEMU |
ISSUE TYPE
AWX Operator fails to perform installation.
SUMMARY
Had an instance of 17.0.1 running, don't care if the data persists either.
Performed data migration following Data Migration instructions.
Performed install of AWX Operator following INSTALL.md
ENVIRONMENT
STEPS TO REPRODUCE
Follow INSTALL.md
EXPECTED RESULTS
Expected to see pods/AWX instance
ACTUAL RESULTS
minikube kubectl apply -- -f myawx.yml
After 30 minutes only the orchestrator is running, tailing the logs shows a looping error.
ADDITIONAL INFORMATION
AWX-OPERATOR LOGS
The text was updated successfully, but these errors were encountered: