Unified AI Application Framework#
BentoML is a framework for building reliable, scalable and cost-efficient AI applications. It comes with everything you need for model serving, application packaging, and production deployment.
Start your BentoML journey#
The BentoML documentation provides detailed guidance on the project with hands-on tutorials and examples. If you are a first-time user of BentoML, we recommend that you read the following documents in order:
Gain a basic understanding of the BentoML open-source framework, its workflow, and the BentoML ecosystem.
Hands-on tutorials that help you quickly get started with BentoML by deploying AI applications with common machine learning (ML) models.
A step-by-step tour of BentoML’s components and introduce you to its philosophy. After reading, you will see what drives BentoML’s design, and know what Bentos and Runners stand for.
Best practices and example usages by the ML framework used for building your model.
Dive into BentoML’s advanced features, internals, and architecture, including GPU support, inference graph, monitoring, and performance optimization.
Learn how BentoML works together with other tools and products in the Data/ML ecosystem.
Join us in our Slack community where thousands of AI application developers are contributing to the project and helping each other.
The BentoML team uses the following channels to announce important updates like major product releases and share tutorials, case studies, as well as community news.