-
Notifications
You must be signed in to change notification settings - Fork 213
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ReadyApp framework does not detect failed deployment #4639
Comments
We use the ready app framework and the WebLogic server handle if all the registered application is ready. I assumed you have set up the WebLogic deployment descriptor to register to the framework. wls:ready-registrationtrue</wls:ready-registration> For this to work, it must be successfully registered and initialized before you can call the ready() method to tell the server that this app is ready, if your application failed to initialize before then the framework will not be able to tell. Is it something that you are trying to simulate? |
Hi @jshum2479! Yes, I've tried different scenarios, but in all of them, the <ready-registration>true</wls:ready-registration> is defined at weblogic-application.xml (if not, anyway the ReadyLifecycleManager.getInstance().ready() crashes with an IllegalStateException, as documented) However, even if it is defined, whenever the webapp crashes at deployment time, the ReadyApp framework /weblogic/ready endpoint changes from HTTP 503 (server not ready) to HTTP 200 (server ready) - while it just faced a webapp deployment crash. I could understand this behavior, because, in a way, a failed deployment does not exactly mean the server is not "ready", but it this also means we cannot rely on this framework to detect managed server with improper state (= with expected webapp that failed to start), which means such pods could be used to serve traffic, while the webapp is down. |
@lastnico You are going to need to file a Support case with WebLogic Server support. This has nothing to do with the operator. |
Thanks. I indeed wondered where is the boundary between:
I can (and will) open a Support ticket to the Weblogic support, but I'm just afraid they'll redirect me here, asking for the K8S weblogic operator to offer ways to identify when the Weblogic server is in an incorrect state (from deployment perspective) Thanks again! |
@lastnico Feel free to give the Oracle Support people my name and mention that I said it has nothing to do with WKO. They know me and should reach out to me if they have issues. If you are not satisfied with the ReadyApp framework behavior, you can always use an endpoint for your app to handle readiness yourself... |
Thanks for the advice. About your last sentence, according to K8S operator doc:
And the 5 properties under "readinessProbe"
I don't find how we could assign another endpoint for readinessProbe. This is actually the reason why I started to investigate on the ReadyApp framework, which, in the end, does not fit detection of failed deployments. |
We do not allow configuration of a different probe path just the tuning parameters. |
My mistake. @rjeberhard The domain spec looks like this should be supported so why is it being prevented.
|
It has never been supported to override the liveness or readiness probes generated for the container running the WebLogic instance. The schema above is for the sidecar containers, if any, added to the pods. This is why there is validation to ensure that this functionality isn't used to try and add a sidecar container with the same name as the main container ("weblogic-server"). The only supported customizations are to timing-related fields, as described here: https://oracle.github.io/weblogic-kubernetes-operator/managing-domains/domain-lifecycle/liveness-readiness-probe-customization/#readiness-probe-customization We could certainly look at an enhancement to support overriding more fields of the probes. @lastnico, was your idea that you would provide some other application endpoint that consulted the MBeans? |
Or maybe something more basic such as exposing an endpoint in the app to ensure that the app is properly deployed. This is what WebLogic customers did prior to 12.2.1 being released (release that introduced ReadyApp). |
@robertpatrick I initially thought of this, yes: A new httpGet property under readinessProbe, to allow overriding /weblogic/ready URL polling Though, when multiple webapps are deployed on the same server, it'd mean multiple endpoints to poll, so, in this case, relying on the ReadyApp API calls on each webapp would probably be simpler. However, returning HTTP 503 for failed deployed webapps at ReadyApp means it'll require to change how it behaves currently, so there's maybe lower chances this gets supported. |
@lastnico let us know once you file a Support ticket with a reproducer and I will take a look. Note that your test case should not use Kubernetes… |
@robertpatrick Sorry for asking, do you mean a Support case to WebLogic Server support regarding ReadyApp framework, or also/both regarding the K8S weblogic operator, to possibly get a new httpGet property to override readinessProbe? |
Yes |
@robertpatrick It took a while, but the issue was finally submitted to Oracle Weblogic Support under Thanks! |
Following issue #2069, that was closed suggesting usage of ReadyApp framework would solve the case, it unfortunately does not, after testing it.
When using this API (declaring true and calling weblogic.application.ready.ReadyLifecycleManager.getInstance().ready() in some Web Listener for instance), the startup does as following:
This behavior probably comes from the fact the "applicationId" is unregistered from the ReadyApp framework, but, still, it means this API cannot be used to detect failed deployments.
I read that K8S startup probe is not used because the Weblogic K8S operator makes usage of the ReadyApp framework for fine control on deployment startup, but I'm afraid there are no proper ways for this operator to detect (and trigger creation of a new pod) of Weblogic managed server with failed deployment(s).
The text was updated successfully, but these errors were encountered: