TrieveRubyClient::SearchChunksReqPayload
Properties
| Name |
Type |
Description |
Notes |
| content_only |
Boolean |
Set content_only to true to only returning the chunk_html of the chunks. This is useful for when you want to reduce amount of data over the wire for latency improvement (typically 10-50ms). Default is false. |
[optional] |
| filters |
ChunkFilter |
|
[optional] |
| get_collisions |
Boolean |
Set get_collisions to true to get the collisions for each chunk. This will only apply if environment variable COLLISIONS_ENABLED is set to true. |
[optional] |
| get_total_pages |
Boolean |
Get total page count for the query accounting for the applied filters. Defaults to false, but can be set to true when the latency penalty is acceptable (typically 50-200ms). |
[optional] |
| highlight_delimiters |
Array<String> |
Set highlight_delimiters to a list of strings to use as delimiters for highlighting. If not specified, this defaults to ["?", ",", ".", "!"]. These are the characters that will be used to split the chunk_html into splits for highlighting. These are the characters that will be used to split the chunk_html into splits for highlighting. |
[optional] |
| highlight_max_length |
Integer |
Set highlight_max_length to control the maximum number of tokens (typically whitespace separated strings, but sometimes also word stems) which can be present within a single highlight. If not specified, this defaults to 8. This is useful to shorten large splits which may have low scores due to length compared to the query. Set to something very large like 100 to highlight entire splits. |
[optional] |
| highlight_max_num |
Integer |
Set highlight_max_num to control the maximum number of highlights per chunk. If not specified, this defaults to 3. It may be less than 3 if no snippets score above the highlight_threshold. |
[optional] |
| highlight_results |
Boolean |
Set highlight_results to false for a slight latency improvement (1-10ms). If not specified, this defaults to true. This will add `<b><mark>` tags to the chunk_html of the chunks to highlight matching splits and return the highlights on each scored chunk in the response. |
[optional] |
| highlight_threshold |
Float |
Set highlight_threshold to a lower or higher value to adjust the sensitivity of the highlights applied to the chunk html. If not specified, this defaults to 0.8. The range is 0.0 to 1.0. |
[optional] |
| highlight_window |
Integer |
Set highlight_window to a number to control the amount of words that are returned around the matched phrases. If not specified, this defaults to 0. This is useful for when you want to show more context around the matched words. When specified, window/2 whitespace separated words are added before and after each highlight in the response's highlights array. If an extended highlight overlaps with another highlight, the overlapping words are only included once. |
[optional] |
| page |
Integer |
Page of chunks to fetch. Page is 1-indexed. |
[optional] |
| page_size |
Integer |
Page size is the number of chunks to fetch. This can be used to fetch more than 10 chunks at a time. |
[optional] |
| query |
String |
Query is the search query. This can be any string. The query will be used to create an embedding vector and/or SPLADE vector which will be used to find the result set. |
|
| recency_bias |
Float |
Recency Bias lets you determine how much of an effect the recency of chunks will have on the search results. If not specified, this defaults to 0.0. We recommend setting this to 1.0 for a gentle reranking of the results, >3.0 for a strong reranking of the results. |
[optional] |
| score_threshold |
Float |
Set score_threshold to a float to filter out chunks with a score below the threshold. |
[optional] |
| search_type |
String |
Can be either "semantic", "fulltext", or "hybrid". If specified as "hybrid", it will pull in one page (10 chunks) of both semantic and full-text results then re-rank them using BAAI/bge-reranker-large. "semantic" will pull in one page (10 chunks) of the nearest cosine distant vectors. "fulltext" will pull in one page (10 chunks) of full-text results based on SPLADE. |
|
| slim_chunks |
Boolean |
Set slim_chunks to true to avoid returning the content and chunk_html of the chunks. This is useful for when you want to reduce amount of data over the wire for latency improvement (typically 10-50ms). Default is false. |
[optional] |
| tag_weights |
Hash<String, Float> |
Tag weights is a JSON object which can be used to boost the ranking of chunks with certain tags. This is useful for when you want to be able to bias towards chunks with a certain tag on the fly. The keys are the tag names and the values are the weights. |
[optional] |
| use_weights |
Boolean |
Set use_weights to true to use the weights of the chunks in the result set in order to sort them. If not specified, this defaults to true. |
[optional] |
Example
require 'trieve_ruby_client'
instance = TrieveRubyClient::SearchChunksReqPayload.new(
content_only: null,
filters: null,
get_collisions: null,
get_total_pages: null,
highlight_delimiters: null,
highlight_max_length: null,
highlight_max_num: null,
highlight_results: null,
highlight_threshold: null,
highlight_window: null,
page: null,
page_size: null,
query: null,
recency_bias: null,
score_threshold: null,
search_type: null,
slim_chunks: null,
tag_weights: null,
use_weights: null
)