Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Ray serve functionality #44

Open
venkatajagannath opened this issue Aug 19, 2024 · 0 comments
Open

Improve Ray serve functionality #44

venkatajagannath opened this issue Aug 19, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@venkatajagannath
Copy link
Contributor

Currently, when we submit a ray serve deployment, the job status will likely be in running state until it is taken down.

One way to get around that would be to set wait_for_completion=False, which would return control to Airflow to run the next task. But, there may be a scenario where the serve deployment is currently not ready but the following task needs to access it.

For example, If I want to deploy an AI model and then call it using a spark streaming application in the next task, the model might not be ready.

Things to check --

  • What is the exact behavior of Ray Serve deployments when submitted through the SubmitRayJob operator?
  • Should we introduce a new trigger (specifically for ray serve apps) which is called instead if the job is serve deployment?
  • How can we make sure the UX remains consistent?
@venkatajagannath venkatajagannath added the enhancement New feature or request label Aug 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant