Documentation

Float16 Documentation

Learn how to deploy GPU instances, serverless endpoints, and build AI applications with Float16 Cloud.

⌘K

Quick Start

Get up and running in 5 minutes

API Reference

Complete API documentation

Console

Access the Float16 dashboard

Getting Started

Learn the basics of Float16 Cloud platform

Introduction to Float16 Cloud

Learn what Float16 Cloud is and how it can help you deploy AI workloads

2 min read

Quick Start Guide

Popular

Get up and running with Float16 Cloud in under 5 minutes

4 min read

Authentication

Learn how to sign in to Float16 Cloud

1 min read

Workspaces

Organize your resources with workspaces

1 min read

Dashboard Overview

Navigate the Float16 console and understand key features

4 min read

Platform Guides

In-depth guides for using Float16 Cloud features

GPU Platform Overview

Create and manage dedicated GPU instances on Float16 Cloud

3 min read

Serverless GPU

New

Run AI workloads on-demand without managing infrastructure

3 min read

One-Click Deployment

Deploy vLLM models instantly with pre-configured settings

3 min read

Volumes & Storage

Manage persistent storage for your GPU instances

3 min read

Blueprints

Use pre-configured templates to deploy common AI workloads

3 min read

AI Services

AI and LLM services for inference and deployment

AI Services Overview

Explore Float16's AI capabilities including LLM deployment, vLLM Playground, and OCR

3 min read

LLM Deployment

Deploy and serve large language models on Float16 Cloud

4 min read

vLLM Playground

Interactive

Interactive environment for testing deployed vLLM models

4 min read

Typhoon OCR

Extract text from Thai and English documents using vision-language OCR

4 min read

Tool Calling

Enable LLMs to interact with external tools and functions

5 min read

Structured Outputs

Generate responses in specific formats using JSON Schema, Regex, or Choice constraints

5 min read

API Reference

Complete API documentation for Float16 Cloud

API Reference

Access your vLLM models via the OpenAI-compatible API

3 min read

Chat Completions API

Generate text with LLMs using the OpenAI-compatible Chat Completions API

5 min read

Billing & Payments

Credits, payments, and billing management

Billing Overview

Understand Float16 billing and payment options

2 min read

Top Up

Add credits to your Float16 account

2 min read

Coupons

Apply coupon codes to get credits

2 min read

Need help?

Can't find what you're looking for? Our team is here to help.

Contact Support GitHub