Elasticsearch: Set up vector search
Overview
Introduction to Elasticsearch
As you ramp up on Elastic, you'll use the Elasticsearch Relevance Engine™ (ESRE), designed to power AI search applications. With ESRE, you can take advantage of a suite of developer tools including Elastic's textual search, vector database, and our proprietary transformer model for semantic search.
Elastic offers a variety of search techniques, starting with BM25, the industry standard for textual search. It provides precise matching for specific searches, matching exact keywords, and it improves with tuning.
As you get started on vector search, keep in mind there are two forms of vector search: "dense" (aka, kNN vector search) and "sparse".
In addition, Elastic also supports kNN vectors to implement similarity search on unstructured data beyond text, such as videos, images, and audio.
In this guide, we'll demonstrate how to get started with leveraging Elasticsearch as a vector database, which enables vector search use cases.
Let's get started
Create an Elastic Cloud project
Get started with a 14-day trial. Once you go to cloud.elastic.co and create an account, follow the steps below to learn how to launch your first Elasticsearch Serverless project.
In this example, we will onboard a live website, the Elasticsearch Labs.
Head over Create Project. Let’s create a Serverless project that is Optimized for Vectors, give it a name and create it.
data:image/s3,"s3://crabby-images/109a3/109a32f2aa20326b5aad98acfa13130da8f2de79" alt="SGJ-create-project.png"
Let’s create our first Elasticsearch index, we can name it vector-search. Hit Create my index.
data:image/s3,"s3://crabby-images/a46ca/a46caa32491db6df70ca13d77e7dae69cc9273da" alt="SGJ-create-index.png"
Get started with vector search
Now that you’ve created your serverless project and your first index, it’s time to get data into Elasticsearch. We’ve made it easy for you to try it out, at Data > Index Management Data, click Run in Console.
data:image/s3,"s3://crabby-images/27c67/27c6712c97f8117ac0613f4059590dcacabbcef7" alt="SGJ-vector-search-run.png"
A flyout will appear with the console. You’ll see the code on the left. When you click the Send request button ▶, you’ll see the executed code on the right.
The first command will create the mappings for the index, with a vector field called vector and a text field called text.
data:image/s3,"s3://crabby-images/f7bdf/f7bdf336a3bb510df0f7e9427a9fec36f6a126e4" alt="SGJ-vector-search-mappings.png"
Now run the second command to index 3 documents using a _bulk request. This is the preferred choice when indexing large quantities of documents in Elasticsearch.
data:image/s3,"s3://crabby-images/66ace/66ace28cadf4a3e68d8eaf1479b8dc7b25c06b68" alt="SGJ-vector-search-bulk.png"
Working with Elasticsearch
Build your vector search query
Great, now we can issue a knn query to get the closest documents that match a given vector, we can try with a vector such as [2,6,0]. Copy and paste the following request:
GET vector-search/_search
{
"knn": {
"field": "vector",
"k" 10,
"num_candidates": 100,
"query_vector": [2,6,0]
}
}
Done, “Yosemite National Park” with vector [2.654, 6.36, 0.476] is indeed the closest match.
data:image/s3,"s3://crabby-images/0599e/0599e9310b8535256c8f0970875e77096a38057e" alt="SGJ-vector-search-knn.png"
Next steps
Thanks for taking the time to set up vector search for your data with Elastic Cloud. As you begin your journey with Elastic, understand some operational, security, and data components you should manage as a user when you deploy across your environment.
Ready to get started? Spin up a free 14-day trial on Elastic Cloud or try out these 15 minute hands-on learning on Search AI 101.
Additional resources
- Get started with Elasticsearch
- Build an AI-powered search experience
- Search across business systems and software
- Build an application on top of Elasticsearch
- Connect to the Elasticsearch API