🌟 New service Serverless GPU → Read more

Discover our latest articles and insights

Float16 x Typhoon (SCB 10X)
1 min read

Float16 x Typhoon (SCB 10X)

We are pleased to announce our latest partnership between Float16 and the Typhoon team (SCB 10X). Our collaboration Regarding our partnership, we both agree to increase AI adoption in Thailand through developer activities such as bootcamps, developer materials, and advanced use cases. Float16 will take part in supporting and empowering the community through resource accessibility and further AI development. Float16 will contribute to accelerated computing use cases like vector search,search

Serverless GPU, GPU Grants and Much More 🤯
3 min read

Serverless GPU, GPU Grants and Much More 🤯

Hi Everyone. Mati is Here 👋👋 It have been a while since latest update. (One-click deployment) Today, I have a very big update about Float16.cloud. Serverless GPU Firstly, we are proud to announce our "Serverless GPU" service, powered by H100. Key Features * Zero code changes required—say goodbye to Docker images 👋 * The world's fastest cold start, under 100ms * Deployment mode for AI inference (Please see the examples) * Spot mode for AI training The Main Differentiator Between Ou

1st Thailand LLM Bootcamp
2 min read

1st Thailand LLM Bootcamp

Float16.cloud has been a part of hosting the Bootcamp with OpenACC, NVIDIA, Siam.ai, and AIEI. The Thailand LLM Bootcamp will take place over 2 days. Day 1 (February 28) will be held online, and Day 2 will be held in person at Central World, Thailand. Registration link (Close 18 Feb 2025) https://www.openhackathons.org/s/siteevent/a0CUP00001Yalpp2AB/se000400 Agenda This bootcamp will provide hands-on experience with an end-to-end LLM project. It is divided into 3 parts 1.Continue Fine-T

How to
Launching Float16.cloud App Console v0.3
5 min read

Launching Float16.cloud App Console v0.3

It has been almost 2 months (46 days) since the last update (v0.2.x) and the v0.3.x update. This update is a major one, adding platform capability and a new service. V0.3 * One-click deployment * Quantized inference speed leaderboard One-Click Deployment In this version, we are introducing the One-Click Deployment service. This service allows you to deploy LLMs from the Huggingface repository. We focus on ensuring that the endpoint is ready for production use, not just for a POC or exper

ACL2024 BKK Guidebook
4 min read

ACL2024 BKK Guidebook

Hello everyone, welcome to ACL 2024 and welcome to Bangkok! I'm Mati, the founder of Float16.cloud. I'm Thai and based in Bangkok. The upcoming ACL 2024 will be held from August 11th to 16th, 2024, at Centara Grand and Bangkok Convention Centre. I want to provide an update on how to 'deal' with Bangkok in several aspects such as: * Location and Weather * Maps * Internet SIM cards * Transportation * Food * Coffee * Medicine * People * Recommended activities Location and Weather Cen

Launching Float16.cloud Console v0.2
3 min read

Launching Float16.cloud Console v0.2

Hello, Everyone. I am happy to announce that the Float16 platform has been updated to version 0.2.1. Version 0.2.x is going to be a core interface supporting upcoming main features like one-click deployment, serverless GPU, and one-click training. We have also added features to improve the capability of managing your projects, such as a workspace and a usage dashboard. Version 0.2.1 includes the following features: * Authentication with GitHub and Google accounts * Workspace * Usage dashb

5 min read

Press Conference: Opening AI startup Alliance and Launch AI Developer Group

5 AI Startups launch AI Startup Alliance and AI Developer Group. 5 AI Startups included AThenaAI, Perceptra, Gowajee, Eidy and Float16 Launch AI Startup Alliance and AI Developer Group. The AI Startup Alliance is a collaboration between 5 startups, formed in response to the current landscape of the AI industry in Thailand. This initiative recognizes that the AI industry structure in Thailand is less mature and experienced compared to leading countries, particularly in terms of the AI industr

Float16.cloud Seed Round
3 min read

Float16.cloud Seed Round

Hi everyone, I’m Mati, founder of Float16.cloud. I built Float16.cloud, starting 9 months ago in October 2023 to develop the platform, and launched it for the first time in January 2024. After launching the first version of Float16, I received several pieces of feedback and I want to say thank you for your support and valuable input. I am also excited to announce that we have received funding from our investor. Today, I am very proud to announce the first version of our core values and roa

2 min read

Float16 at SuperAI 2024 @ SG

Float16 have selected to present at SuperAI 2024 event @ Marina Bay Sands, June 5-6 2024. for startup Genesis. Join Us at SuperAI Event for an Insightful Pitch Session with Float16! We are excited to announce that Float16 will be participating in the upcoming SuperAI event. The pitch session scheduled for Wednesday, June 5th, from 3:00 PM to 5:00 PM (SGT) at the Motiff Stage. Additionally, should we progress to the next stage, we will compete in the final round on Thursday, June 6th, from

[Preview] Southeast Asia LLM Function call benchmark.
2 min read

[Preview] Southeast Asia LLM Function call benchmark.

What is function call Function call capability is crucial for creating agent-based systems. Function calls allow LLMs to use 'the tools' when 'the tools' are functions (methods) or APIs. This capability increases the reliability of LLM applications when LLM applications MUST interact with non-LLM systems. Method I used a part of Gorilla-eval (https://github.com/ShishirPatil/gorilla/tree/main/eval) to evaluate all models. This evaluation method measures two metrics: functionality score and h

Benchmark
SQLCoder-7b-2 is live.
3 min read

SQLCoder-7b-2 is live.

Faster, Smaller, The best Text-to-SQL LLM model. Overview SQLCoder is a Text-to-SQL LLM family from Defog.ai (YC W23): a human-level AI analyst for every enterprise user. Text-to-SQL is a crucial task. This task can convert text (natural language) into SQL statements based on the table schema. Text-to-SQL enables end users to retrieve data from a database instead of having to write SQL statements. Benchmark According to the Defog.ai team's benchmark, SQLCoder-7b-2 and SQLCoder-70b-alpha

How to
The First 70B Thai LLM.
3 min read

The First 70B Thai LLM.

OpenThaiGPT is a volunteer group that has created an open-source model named OpenThaiGPT (OTG). They have continued training a large language model (LLM) from an open-source model, and the resulting model has scored nearly as high as Claude Sonnet on certain benchmarks. Launch date On April 8, 2024, they published an open-source (Apache-2.0) large language model (LLM) for the Thai language. It includes versions with 7 billion, 13 billion, and 70 billion parameters, based on the Llama-2 model

Float16.Cloud Update [02/2024]
2 min read

Float16.Cloud Update [02/2024]

Float16.Cloud has released a Generative AI service for Southeast Asian languages that is better than ChatGPT (GPT-3.5). Float16.Cloud supports three new models: one for SEA languages and two for the Thai language. The pricing is 95% cheaper than ChatGPT (GPT-3.5). The models are named: 1. SeaLLM-7b-v2 (Alibaba) 2. Typhoon-7b (SCB10X) 3. OpenThaiGPT-13b (OpenThaiGPT). The highlight is SeaLLM-7b-v2, developed by DAMO (Alibaba), which supports a total of 10 languages in SEA, including Engli

Thai language RAG with Llamaindex + Weaviate + SeaLLM
2 min read

Thai language RAG with Llamaindex + Weaviate + SeaLLM

Sa-Wad-Dee, Hello from Thailand. Introduce RAG is like this cool AI tool designed for developers and teams who want to bring LLMs into new features. But gotta admit, there's a small hiccup - we don't really have a ton of tutorial projects and this leads to a couple of big issues. 1. Everything's pretty much in English. So for those who aren't native English speakers, they have to tweak their solutions and spend that extra time getting acquainted with stuff like Tokenizers or Text splitter

LlamaindexUse case