Yatai ClientΒΆ

A Yatai RPC Server is a stateful service that provides a complete BentoML model management and model serving/deployment workflow.

Two sets of APIs are provided:

  • BentoRepositoryAPIClient (via YataiClient.repository) manages saved BentoService bundle, making them available for serving in production environments.

  • DeploymentAPIClient (via YataiClient.deployment) deploys BentoServices to a variety of different cloud platforms, track deployment status, set up logging monitoring for your model serving workload.

Note

We want to provide a better documentation on using DeploymentAPIClient programmatically. For now refer to deployment_api.py or using the CLI commands bentoml deployment