Go Back

OpenLLM by BentoML

github.com
Summary

OpenLLM by BentoML is an open-source project that enables developers to self-host and run various open-source Large Language Models (LLMs) as OpenAI-compatible API endpoints. It provides a built-in chat UI, supports custom model integration, and offers simplified deployment to cloud platforms like BentoCloud for enterprise use cases.

Features
5/13
See all

Must Have

4 of 5

Conversational AI

API Access

Fine-Tuning & Custom Models

Enterprise Solutions

Safety & Alignment Framework

Other

1 of 8

Code Generation

Image Generation

Multimodal AI

Research & Publications

Security & Red Teaming

Synthetic Media Provenance

Threat Intelligence Reporting

Global Affairs & Policy

Pricing
Free
See all

Free

$0.00 monthly
  • Unlimited public/private repositories
  • Dependabot security and version updates
  • 2,000 CI/CD minutes/month (Free for public repositories)
  • 500MB of Packages storage (Free for public repositories)
  • Issues & Projects
  • Community support

Team

$4.00 per user
Popular
  • Everything included in Free, plus...
  • Access to GitHub Codespaces
  • Protected branches
  • Multiple reviewers in pull requests
  • Draft pull requests
  • Code owners
  • Required reviewers
  • Pages and Wikis
  • Environment deployment branches and secrets
  • 3,000 CI/CD minutes/month (Free for public repositories)
  • 2GB of Packages storage (Free for public repositories)
  • Web-based support

Enterprise

$21.00 per user
  • Everything included in Team, plus...
  • Data residency
  • Enterprise Managed Users
  • User provisioning through SCIM
  • Enterprise Account to centrally manage multiple organizations
  • Environment protection rules
  • Repository rules
  • Audit Log API
  • SOC1, SOC2, type 2 reports annually
  • FedRAMP Tailored Authority to Operate (ATO)
  • SAML single sign-on
  • Advanced auditing
  • GitHub Connect
  • 50,000 CI/CD minutes/month (Free for public repositories)
  • 50GB of Packages storage (Free for public repositories)
Rationale

OpenLLM by BentoML is a strong match as it explicitly states its purpose is to 'Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.' This directly aligns with the 'API Access' and 'Conversational AI' features, as it provides an OpenAI-compatible API for interacting with LLMs and a built-in chat UI. The platform also supports 'Fine-Tuning & Custom Models' by allowing users to run custom models and add them to a model repository. While it doesn't explicitly detail a 'Safety & Alignment Framework' or 'Multimodal AI' beyond text, its focus on providing an API for LLMs and enterprise deployment options makes it a very close fit. The 'Enterprise Solutions' feature is supported by its integration with BentoCloud for enterprise-grade cloud deployment with features like autoscaling and model orchestration.