Search

This plugin is currently in beta. While it is considered safe for use, please be aware that its API could change in ways that are not compatible with earlier versions in future releases, or it might become unsupported.

Search from an embedding store.

Performs a semantic search using a query string.

yaml
type: "io.kestra.plugin.langchain4j.rag.Search"

Make a search query against an embedding store.

yaml
id: search_embeddings_flow
namespace: company.team

tasks:
  - id: ingest
    type: io.kestra.plugin.langchain4j.rag.IngestDocument
    provider:
      type: io.kestra.plugin.langchain4j.provider.GoogleGemini
      modelName: gemini-embedding-exp-03-07
      apiKey: "{{ secret('GEMINI_API_KEY') }}"
    embeddings:
      type: io.kestra.plugin.langchain4j.embeddings.KestraKVStore
    drop: true
    fromExternalURLs:
      - https://raw.githubusercontent.com/kestra-io/docs/refs/heads/main/content/blogs/release-0-22.md

  - id: search
    type: io.kestra.plugin.langchain4j.rag.Search
    provider:
      type: io.kestra.plugin.langchain4j.provider.GoogleGemini
      modelName: gemini-embedding-exp-03-07
      apiKey: "{{ secret('GEMINI_API_KEY') }}"
    embeddings:
      type: io.kestra.plugin.langchain4j.embeddings.KestraKVStore
    query: "Feature Highlights"
    maxResults: 5
    minScore: 0.5
    fetchType: FETCH

Dynamic NO

The embedding store provider

Dynamic NO

Maximum number of results to return

Dynamic NO

Minimum similarity score

Dynamic NO

The embedding model provider

Dynamic YES

Query string to search for

Dynamic YES

Default NONE

Possible Values

STOREFETCHFETCH_ONENONE

SubType string

List of matching text results

The count of the fetched or stored resources

Format uri

The output files URI in Kestra's internal storage

Only available when fetchType is set to STORE

Dynamic YES

Endpoint URL

Dynamic YES

Project location

Dynamic YES

Model name

Dynamic YES

Project ID

Dynamic NO

Dynamic YES

API endpoint

The Azure OpenAI endpoint in the format: https://{resource}.openai.azure.com/

Dynamic YES

Model name

Dynamic NO

Dynamic YES

API Key

Dynamic YES

Client ID

Dynamic YES

Client secret

Dynamic YES

API version

Dynamic YES

Tenant ID

Dynamic YES

API Key

Dynamic YES

Model name

Dynamic NO

Dynamic YES

Default https://api.deepseek.com/v1

API base URL

SubType string

Dynamic YES

Min items 1

List of HTTP ElasticSearch servers.

Must be an URI like https://elasticsearch.com: 9200 with scheme and port.

Dynamic NO

Basic auth configuration.

SubType string

Dynamic YES

List of HTTP headers to be send on every request.

Must be a string with key value separated with : , ex: Authorization: Token XYZ.

Dynamic YES

Sets the path's prefix for every request used by the HTTP client.

For example, if this is set to /my/path, then any client request will become /my/path/ + endpoint. In essence, every request's endpoint is prefixed by this pathPrefix. The path prefix is useful for when ElasticSearch is behind a proxy that provides a base path or a proxy that requires all paths to start with '/'; it is not intended for other purposes and it should not be supplied in other scenarios.

Dynamic NO

Whether the REST client should return any response containing at least one warning header as a failure.

Dynamic NO

Trust all SSL CA certificates.

Use this if the server is using a self signed SSL certificate.

Dynamic YES

API Key

Dynamic YES

Model name

Dynamic NO

Dynamic YES

API Key

Dynamic YES

Model name

Dynamic NO

Dynamic YES

API base URL

Dynamic YES

Model endpoint

Dynamic YES

Model name

Dynamic NO

Dynamic YES

Basic auth password.

Dynamic YES

Basic auth username.

Dynamic NO

Dynamic YES

Default {{flow.id}}-embedding-store

The name of the K/V entry to use

Dynamic YES

API Key

Dynamic YES

Model name

Dynamic NO

Dynamic YES

AWS Access Key ID

Dynamic YES

Model name

Dynamic YES

AWS Secret Access Key

Dynamic NO

Dynamic YES

Default COHERE

Possible Values

COHERETITAN

Amazon Bedrock Embedding Model Type

Dynamic YES

The database name

Dynamic YES

The database server host

Dynamic YES

The database password

Dynamic NO

The database server port

Dynamic YES

The table to store embeddings in

Dynamic NO

Dynamic YES

The database user

Dynamic NO

Default false

Whether to use use an IVFFlat index

An IVFFlat index divides vectors into lists, and then searches a subset of those lists closest to the query vector. It has faster build times and uses less memory than HNSW but has lower query performance (in terms of speed-recall tradeoff).

Dynamic YES

API Key

Dynamic YES

Model name

Dynamic NO

Dynamic YES

API base URL

Dynamic NO

Dynamic YES

The name of the index to store embeddings

Dynamic NO

​Search

Search