Interested in a hands-on learning experience for developing LLM applications?
Join our LLM Bootcamp today!

Weaviate Setup and Configuration

Weaviate

Weaviate is an open-source, cloud-native vector database that stores both objects and vectors, enabling semantic search at scale. It combines vector similarity search with keyword filtering, retrieval-augmented generation (RAG), and reranking — all within a single query interface.

Typical use cases include RAG systems, semantic and image search, recommendation engines, chatbots, and content classification.

Weaviate supports two primary approaches for handling vectors:

For production workloads, Weaviate offers features like multi-tenancy, replication, RBAC authorization, and high availability, making it suitable for enterprise-grade deployments.

Weaviate Cloud

Weaviate Cloud (WCD) is a fully managed service that allows you to deploy and scale Weaviate clusters in the cloud without managing any infrastructure.

Weaviate Cloud automates provisioning, scaling, security, and maintenance so you can focus entirely on building AI-driven applications rather than maintaining servers. It’s designed to help teams quickly develop and deploy enterprise-ready vector search and retrieval systems.

Weaviate Cloud and Weaviate Database

Weaviate is more than a database — it’s a flexible, extensible platform for vector-based AI applications.
The core open-source Weaviate engine supports vector, keyword, and hybrid search, and integrates seamlessly with machine learning and embedding models through its pluggable architecture.

Weaviate Cloud is built directly on top of this same Weaviate Database engine. Both share identical technology and capabilities, but Weaviate Cloud removes the operational burden of hosting, monitoring, and scaling.

This means:

  • No manual cluster setup or infrastructure management.

  • Automatic scaling, version upgrades, and reliability.

  • Easy integration with model providers like OpenAI and Cohere.

In short, Weaviate Cloud lets you run production-grade Weaviate instances with minimal configuration and zero infrastructure overhead.

For further details about Weaviate Database, APIs, and modules, refer to the Weaviate documentation.

Cluster Setup Overview

This guide walks you through the setup process for a Weaviate cluster, which will be used in the upcoming hands-on exercises.
Setting up your cluster ahead of time ensures a smooth experience during the live sessions and labs.

What is a Cluster?

A cluster is a group of connected Weaviate nodes (servers) that work together to store and process your data.
Instead of relying on a single machine, a cluster distributes data and queries across multiple nodes, ensuring:

  • Scalability – handle larger datasets and higher query volumes

  • High availability – your system remains online even if one node fails

  • Performance – faster data retrieval and parallel processing

Follow the steps below to set up your cluster.

1. Sign Up for Weaviate Cloud

 

  • Sign up on Weaviate Cloud and verify your email.

 

2. Create a Sandbox Cluster

  • Next, you will see this page. Click on “Create cluster”.

 

 

  • To create a sandbox cluster, follow these steps: 
      1. Give your cluster a name. Weaviate Cloud adds a random suffix to sandbox cluster names to ensure uniqueness.
      2. Select a cloud region from the dropdown menu. You may choose any available option.

 

Weaviate Cluster Guide 4

 

           3. Scroll to the bottom and click “Create” to create the cluster.

  • It takes a minute or two to create the new cluster.

 

 

  • When the cluster is ready, Weaviate Cloud displays a check mark (✔️) next to the cluster name.

 

3. Use Sandbox Cluster

  • Once your sandbox is ready, click on the sandbox name to open the cluster details page.
  • In the cluster details, locate and click the “Connect” button to proceed.

 

 

  • A pop-up window titled “Connect to cluster” will appear. Click on the outlined box to copy both the “Weaviate_URL” and the “Weaviate_API_Key” format. Paste this in a notepad file for records.

 

  • Close the window and scroll down until you see “Create API Key”. Click on it.

 

  • Name your key as you wish, and make sure to select “admin” for Roles(s) as outlined in the image.

 

  • Copy or download your API key as it will not be visible again. Paste it in your notepad file for hands-on exercises. 

 

  • After pasting the details in a notepad to keep them saved as you will need the “Weaviate_URL” and the “Weaviate_API_Key” during the hands-on exercises with Weaviate, make sure to add “https://” before your URL as shown in the image. Paste it as it is where the URL is required.