Glossary

Vector Search

Discover how vector search revolutionizes AI by enabling semantic similarity in data retrieval for NLP, visual search, recommendation systems, and more!

Train YOLO models simply
with Ultralytics HUB

Learn more

Vector search is a powerful technique used in artificial intelligence (AI) and machine learning (ML) to find data items based on their conceptual similarity rather than exact keyword matches. Instead of searching for text containing specific words, vector search allows systems to retrieve information—like images, documents, or sounds—that is semantically related or contextually similar to a query. This method is fundamental to many modern AI applications, enabling more intuitive and human-like information retrieval.

Understanding Vector Embeddings

The core concept behind vector search is vector embeddings. Data, whether it's text, images, audio, or other types, is transformed into numerical representations called vectors using embedding models. These vectors exist in a high-dimensional space where items with similar meanings or characteristics are positioned close to each other. For example, in natural language processing (NLP), models like BERT convert sentences into vectors, ensuring that sentences conveying similar ideas have nearby vector representations. Similarly, in computer vision (CV), models such as Ultralytics YOLO can generate feature vectors for images, enabling visual similarity comparisons. Once data is embedded, vector search algorithms use distance metrics, such as Cosine Similarity or Euclidean Distance, to quantify the "closeness" between vectors. A query (also converted into a vector) is compared against the vectors in the database, and the items corresponding to the nearest vectors are returned as the search results.

How Vector Search Works

Implementing vector search typically involves these steps:

  1. Embedding Generation: Convert your dataset (images, text, etc.) into vector embeddings using a pre-trained or custom-trained model.
  2. Indexing: Store these vectors in a specialized vector database or index designed for efficient similarity search. Examples include Pinecone and Milvus.
  3. Querying: Convert the search query (e.g., an image or text phrase) into a vector using the same embedding model.
  4. Searching: Use an algorithm like Approximate Nearest Neighbor (ANN) search to efficiently find the vectors in the index that are closest to the query vector based on the chosen distance metric.
  5. Retrieval: Return the original data items corresponding to the nearest vectors found.
Read all