Helix Documentation

Introduction to Helix

πŸ‘‹ Hello! Welcome to the Helix documentation! Browse this site using the navigation links on the left (desktop) or navigation menu (mobile). You can also use the links at the bottom of each page to browse to the next logical location in the documentation.

What is Helix?

Helix is a generative AI stack that can be deployed on your own infrastructure. Run locally, deploy to your private cloud account, on-premise, or test it out quickly on our SaaS, Helix Cloud. To find out more about the features of helix please read through the rest of this documentation.

How Helix Works

Helix comes packaged with a distributed GPU scheduler capable of bin-packing AI models into GPU memory to optimise for latency and utilization.

On top of that, Helix exposes easy to use industry-standard APIs to interact with enterprise-ready open source AI models. Helix also adds simple abstractions to help users of Helix quickly build generative AI applications.

Helix’s Runner architecture means you can deploy a single control plane and then connect GPUs to it – from your enterprise, a cloud provider or a specialist provider like RunPod or Lambda Labs, and they’ll all be brought together into an easy to use environment.

It integrates with Keycloak for authentication so can be integrated into any enterprise ActiveDirectory/LDAP/OAuth environment.

Last updated on