LambdaDB - Docs by LangChain

LambdaDB is a serverless AI database for building scalable RAG and agent applications.

This notebook covers how to get started with the LambdaDB vector store in LangChain.

设置

要访问 LambdaDB 向量存储，您需要create a LambdaDB account, get your project credentials, and install the langchain-lambdadb integration package.

凭证

LambdaDB uses project-based authentication with a project URL and API key:

import getpass
import os

if "LAMBDADB_PROJECT_URL" not in os.environ:
    os.environ["LAMBDADB_PROJECT_URL"] = getpass.getpass("Enter your LambdaDB project URL: ")

if "LAMBDADB_API_KEY" not in os.environ:
    os.environ["LAMBDADB_API_KEY"] = getpass.getpass("Enter your LambdaDB API key: ")

To enable automated tracing of your model calls, set your LangSmith API key:

os.environ["LANGSMITH_API_KEY"] = getpass.getpass("Enter your LangSmith API key: ")
os.environ["LANGSMITH_TRACING"] = "true"

安装

The LangChain LambdaDB integration lives in the langchain-lambdadb package:

pip install -U langchain-lambdadb

You’ll also need to install an embedding model. For example, to use OpenAI embeddings:

pip install -U langchain-openai

实例化

LambdaDBVectorStore works with existing collections. You must create the collection beforehand with proper vector and text indexes configured.

from langchain_lambdadb.vectorstores import LambdaDBVectorStore
from langchain_openai import OpenAIEmbeddings
from lambdadb import LambdaDB
import os

# Initialize the LambdaDB client
client = LambdaDB(
    server_url=os.environ["LAMBDADB_SERVER_URL"],
    project_api_key=os.environ["LAMBDADB_API_KEY"]
)

# Initialize embeddings
embeddings = OpenAIEmbeddings()

# Connect to an existing collection
vector_store = LambdaDBVectorStore(
    client=client,
    collection_name="my_collection",  # Must exist beforehand
    embedding=embeddings,
)

Key parameters

client: LambdaDB client instance (required)
collection_name: Name of an existing collection in LambdaDB (required)
embedding: Embedding function to use (required)
text_field: Name of the text field in documents (default: “text”)
vector_field: Name of the vector field in documents (default: “vector”)
validate_collection: Whether to validate that the collection exists and is active (default: True)
default_consistent_read: Use consistent reads by default for immediate consistency, or eventual consistency for better performance (default: False)

管理向量存储

Add items

from langchain_core.documents import Document

document_1 = Document(page_content="LambdaDB is a serverless vector database", metadata={"source": "docs"})
document_2 = Document(page_content="It supports fast similarity search", metadata={"source": "docs"})
document_3 = Document(page_content="Perfect for RAG applications", metadata={"category": "features"})

documents = [document_1, document_2, document_3]
ids = vector_store.add_documents(documents=documents, ids=["1", "2", "3"])
print(f"Added documents with IDs: {ids}")

Documents have a maximum size of 50KB. The integration automatically batches documents into groups of up to 100 to stay within LambdaDB’s 6MB request limit.

Delete items

vector_store.delete(ids=["3"])

Get items by ID

documents = vector_store.get_by_ids(["1", "2"])
for doc in documents:
    print(f"* {doc.page_content} [{doc.metadata}]")

查询向量存储

Once your vector store has been created and the relevant documents have been added, you will most likely wish to query it during the running of your chain or agent.

相似度搜索

Performing a simple similarity search:

results = vector_store.similarity_search(
    query="What is LambdaDB?",
    k=2
)
for doc in results:
    print(f"* {doc.page_content} [{doc.metadata}]")

Similarity search with scores

If you want to execute a similarity search and receive the corresponding scores:

results = vector_store.similarity_search_with_score(
    query="vector database features",
    k=2
)
for doc, score in results:
    print(f"* [SIM={score:.3f}] {doc.page_content} [{doc.metadata}]")

Similarity search with filtering

LambdaDB supports filtering using query string syntax:

results = vector_store.similarity_search(
    query="database",
    k=2,
    filter={"queryString": {"query": "source:docs"}}
)
for doc in results:
    print(f"* {doc.page_content} [{doc.metadata}]")

Maximal Marginal Relevance (MMR) search

MMR optimizes for both similarity to the query AND diversity among selected documents:

results = vector_store.max_marginal_relevance_search(
    query="LambdaDB features",
    k=2,
    fetch_k=10,  # Fetch 10 candidates
    lambda_mult=0.5,  # Balance between relevance (1.0) and diversity (0.0)
)
for doc in results:
    print(f"* {doc.page_content}")

Turn into retriever

You can also transform the vector store into a retriever for easier usage in your chains:

retriever = vector_store.as_retriever(
    search_type="mmr",
    search_kwargs={"k": 2, "fetch_k": 10}
)
retriever.invoke("What is LambdaDB?")

Supported search types:

"similarity": Standard similarity search (default)
"mmr": Maximal marginal relevance search
"similarity_score_threshold": Similarity search with a score threshold

Async operations

LambdaDBVectorStore supports async methods for all operations:

# Add documents
ids = await vector_store.aadd_documents(documents=documents)

# Delete documents
await vector_store.adelete(ids=["3"])

# Search
results = await vector_store.asimilarity_search(query="LambdaDB", k=2)
for doc in results:
    print(f"* {doc.page_content}")

# Search with score
results = await vector_store.asimilarity_search_with_score(query="database", k=2)
for doc, score in results:
    print(f"* [SIM={score:.3f}] {doc.page_content}")

Currently, async methods run synchronously as the LambdaDB client doesn’t support async operations yet.

Consistency control

LambdaDB supports two consistency modes:

Eventual consistency (default): Faster performance, but data may be up to ~1 minute stale after writes
Consistent reads: Immediate consistency, slight performance impact

# Use consistent reads for a specific operation
results = vector_store.similarity_search(
    query="LambdaDB",
    k=2,
    consistent_read=True
)

# Or set consistent reads as the default
vector_store = LambdaDBVectorStore(
    client=client,
    collection_name="my_collection",
    embedding=embeddings,
    default_consistent_read=True  # All reads will be consistent by default
)

Creating from texts

You can create a vector store and populate it with texts in one step:

from langchain_lambdadb.vectorstores import LambdaDBVectorStore

texts = [
    "LambdaDB is a serverless vector database",
    "It supports fast similarity search",
    "Perfect for RAG applications"
]

metadatas = [
    {"source": "docs"},
    {"source": "docs"},
    {"category": "features"}
]

vector_store = LambdaDBVectorStore.from_texts(
    texts=texts,
    embedding=embeddings,
    metadatas=metadatas,
    client=client,
    collection_name="my_collection",
    ids=["1", "2", "3"]
)

用于检索增强生成

Here’s a complete example using LambdaDB for RAG:

from langchain_lambdadb.vectorstores import LambdaDBVectorStore
from langchain_openai import OpenAIEmbeddings, ChatOpenAI
from langchain_core.prompts import ChatPromptTemplate
from langchain_core.runnables import RunnablePassthrough
from langchain_core.output_parsers import StrOutputParser
from lambdadb import LambdaDB
import os

# Initialize
client = LambdaDB(
    project_url=os.environ["LAMBDADB_PROJECT_URL"],
    project_api_key=os.environ["LAMBDADB_API_KEY"]
)

embeddings = OpenAIEmbeddings()
vector_store = LambdaDBVectorStore(
    client=client,
    collection_name="my_collection",
    embedding=embeddings
)

# Create retriever
retriever = vector_store.as_retriever(search_kwargs={"k": 3})

# Create RAG chain
template = """Answer the question based only on the following context:
{context}

Question: {question}
"""
prompt = ChatPromptTemplate.from_template(template)
model = ChatOpenAI()

chain = (
    {"context": retriever, "question": RunnablePassthrough()}
    | prompt
    | model
    | StrOutputParser()
)

# Use the chain
response = chain.invoke("What is LambdaDB?")
print(response)

Key features

Document size limits

Maximum document size: 50KB per document
The integration validates document sizes and raises an error if exceeded

Batch processing

Documents are automatically batched in groups of 100 for upsert operations
Stays within LambdaDB’s 6MB request limit

过滤

Supports LambdaDB’s query string syntax for metadata filtering
Example: filter={"queryString": {"query": "field:value"}}

Search options

Similarity search: Find documents similar to a query
MMR search: Balance similarity and diversity
Score thresholding: Filter results by similarity score
Consistent reads: Control read consistency vs. performance trade-off

API 参考

For detailed documentation of all LambdaDBVectorStore features and configurations, head to the API reference.

Additional resources

LambdaDB Documentation

连接这些文档到 Claude、VSCode 等工具，通过 MCP 获取实时答案。

在 GitHub 上编辑此页面或提交 issue。

Documentation Index

​设置

​凭证

​安装

​实例化

​Key parameters

​管理向量存储

​Add items

​Delete items

​Get items by ID

​查询向量存储

​相似度搜索

​Similarity search with scores

​Similarity search with filtering

​Maximal Marginal Relevance (MMR) search

​Turn into retriever

​Async operations

​Consistency control

​Creating from texts

​用于检索增强生成

​Key features

​Document size limits

​Batch processing

​过滤

​Search options

​API 参考

​Additional resources

设置

凭证

安装

实例化

Key parameters

管理向量存储

Add items

Delete items

Get items by ID

查询向量存储

相似度搜索

Similarity search with scores

Similarity search with filtering

Maximal Marginal Relevance (MMR) search

Turn into retriever

Async operations

Consistency control

Creating from texts

用于检索增强生成

Key features

Document size limits

Batch processing

过滤

Search options

API 参考

Additional resources