Search
This plugin is currently in beta. While it is considered safe for use, please be aware that its API could change in ways that are not compatible with earlier versions in future releases, or it might become unsupported.
Search from an embedding store.
Performs a semantic search using a query string.
type: "io.kestra.plugin.langchain4j.rag.Search"
Make a search query against an embedding store.
id: search_embeddings_flow
namespace: company.team
tasks:
- id: ingest
type: io.kestra.plugin.langchain4j.rag.IngestDocument
provider:
type: io.kestra.plugin.langchain4j.provider.GoogleGemini
modelName: gemini-embedding-exp-03-07
apiKey: "{{ secret('GEMINI_API_KEY') }}"
embeddings:
type: io.kestra.plugin.langchain4j.embeddings.KestraKVStore
drop: true
fromExternalURLs:
- https://raw.githubusercontent.com/kestra-io/docs/refs/heads/main/content/blogs/release-0-22.md
- id: search
type: io.kestra.plugin.langchain4j.rag.Search
provider:
type: io.kestra.plugin.langchain4j.provider.GoogleGemini
modelName: gemini-embedding-exp-03-07
apiKey: "{{ secret('GEMINI_API_KEY') }}"
embeddings:
type: io.kestra.plugin.langchain4j.embeddings.KestraKVStore
query: "Feature Highlights"
maxResults: 5
minScore: 0.5
fetchType: FETCH
NO
The embedding store provider
NO
Maximum number of results to return
NO
Minimum similarity score
NO
The embedding model provider
YES
Query string to search for
YES
NONE
STORE
FETCH
FETCH_ONE
NONE
List of matching text results
The count of the fetched or stored resources
uri
The output files URI in Kestra's internal storage
Only available when fetchType
is set to STORE
YES
Endpoint URL
YES
Project location
YES
Model name
YES
Project ID
NO
YES
API endpoint
The Azure OpenAI endpoint in the format: https://{resource}.openai.azure.com/
YES
Model name
NO
YES
API Key
YES
Client ID
YES
Client secret
YES
API version
YES
Tenant ID
YES
API Key
YES
Model name
NO
YES
https://api.deepseek.com/v1
API base URL
YES
1
List of HTTP ElasticSearch servers.
Must be an URI like https://elasticsearch.com: 9200
with scheme and port.
NO
Basic auth configuration.
YES
List of HTTP headers to be send on every request.
Must be a string with key value separated with :
, ex: Authorization: Token XYZ
.
YES
Sets the path's prefix for every request used by the HTTP client.
For example, if this is set to /my/path
, then any client request will become /my/path/
+ endpoint.
In essence, every request's endpoint is prefixed by this pathPrefix
.
The path prefix is useful for when ElasticSearch is behind a proxy that provides a base path or a proxy that requires all paths to start with '/'; it is not intended for other purposes and it should not be supplied in other scenarios.
NO
Whether the REST client should return any response containing at least one warning header as a failure.
NO
Trust all SSL CA certificates.
Use this if the server is using a self signed SSL certificate.
YES
API Key
YES
Model name
NO
YES
API Key
YES
Model name
NO
YES
API base URL
YES
Model endpoint
YES
Model name
NO
YES
Basic auth password.
YES
Basic auth username.
NO
YES
{{flow.id}}-embedding-store
The name of the K/V entry to use
YES
API Key
YES
Model name
NO
YES
AWS Access Key ID
YES
Model name
YES
AWS Secret Access Key
NO
YES
COHERE
COHERE
TITAN
Amazon Bedrock Embedding Model Type
YES
The database name
YES
The database server host
YES
The database password
NO
The database server port
YES
The table to store embeddings in
NO
YES
The database user
NO
false
Whether to use use an IVFFlat index
An IVFFlat index divides vectors into lists, and then searches a subset of those lists closest to the query vector. It has faster build times and uses less memory than HNSW but has lower query performance (in terms of speed-recall tradeoff).
YES
API Key
YES
Model name
NO
YES
API base URL
NO
YES
The name of the index to store embeddings
NO