Collections

Find specific information quickly by organizing and grouping knowledge sources.

Collections enhance how the agents interact with organizational knowledge. By grouping related sources into topic-based repositories, Collections enable more precise, context-aware information retrieval and intelligent knowledge management.

Feature overview

  • Better information retrieval: Organize knowledge sources into topic-specific repositories so the system searches within relevant content clusters instead of the entire knowledge base. This reduces the search space and improves retrieval speed.

  • More accurate answers: By organizing related sources together, collections create focused knowledge domains that reduce noise and interference from unrelated content.

  • Contextual understanding: Go beyond simple keyword matching. The system can better understand nuance and user intent by analyzing the current question, previous interactions, and configurable contextual parameters. This enables more precise responses to underlying needs.

  • Search types: Define how the system processes queries across your sources. Choose between Semantic, Full-text, or Hybrid approaches. Your selection applies whenever the system accesses this collection to retrieve information, ensuring consistent search behavior.

Each Collection is a standalone entity that contains knowledge sources relevant to its theme. The search functionality lets you find information within a specific collection, limiting results to its sources only. Each collection must be trained individually, and the system keeps previous versions for reference.

Creating a new collection

To create a new collection, go to the Collections section and select New Collection.

Add the collection name and the topics it covers in the description so the agent can understand when to use it. Without this field, the agent can’t activate the skill. The agent only “knows” what is defined in this description, so write it clearly, precisely, and with enough detail. The Collection description provides enough context for the agent to determine when it should be used, based on user questions or inputs. It must also indicate the topics covered by the documents in the collection. This helps the agent understand the scope of the available information.

When a user’s input relates to any of the described topics, the agent can trigger the corresponding knowledge collection, retrieve relevant information from its documents, and generate an appropriate response.

circle-info

You can add up to 200 Knowledge Sources and up to 200 Collections. Knowledge Sources can be organized into collections according to the needs of the project, with no limit per collection.

New collection modal

Search types

When configuring the Collection, you can select the search type that best fits your needs through a dropdown menu. Collections offers three distinct search approaches, each with its own strengths: Semantic, Full-text, and Hybrid.

Retrieves results based on meaning rather than exact word matches. It understands user intent and can find relevant information even when terminology differs.

Example: If a user asks, “How do I reset my password?”, Semantic search may return sources with phrases such as “password restoration procedure” or “how to change forgotten login credentials,” because it recognizes that these concepts are related.

Focuses on exact word matches within sources. It’s ideal for finding specific terminology, product names, or unique identifiers.

Example: If you search for “Model X500 error code 3021,” Full-text search looks for sources containing those exact terms, ensuring technical precision.

Combines Semantic and Full-text search. This approach often produces more relevant results, especially when a single method isn’t sufficient.

If you choose “Hybrid,” you’ll be prompted to enter percentage values to define how much each search type contributes to the final results.

Example: For a query such as “smartphone battery draining quickly” (for example, semantic: 60%, full-text: 40%), a Hybrid search might:

  • Use Semantic search (60%) to understand concepts related to battery conservation and power management

  • Use Full-text search (40%) to ensure that specific terms such as “smartphone” and “battery” appear in the results

This combination ensures you get sources that both specifically mention the key terms and understand the underlying issue of power consumption, even if they don't use the exact phrase "draining quickly."

Set a percentage when selecting Hybrid search

When to use each search type

  • Choose Semantic search when: Your content includes different ways of expressing similar concepts, or when users may use terminology that differs from what appears in your sources.

  • Choose Full-text search when: Precise terminology is critical, such as product codes, specific error messages, or technical documentation where exact wording matters.

  • Choose Hybrid search when: Your knowledge base includes both technical specifications and conceptual information, or when you want to balance exact matches with understanding user intent.

circle-info

For better results, especially in complex scenarios, consider using Hybrid search to combine semantic understanding with exact word matching.

Advanced settings

You can now customize each collection with advanced settings to fine-tune how information is retrieved using the following parameters:

  • Top K: Defines how many paragraphs the system considers when searching for information. With a lower K value, the model selects from the most likely options, resulting in more focused and relevant results and improving perceived accuracy.

  • Similarity Threshold: Sets the minimum relevance level required for results to be considered relevant. Lower values may return broader but less predictable results. Higher values produce more precise matches.

  • Previous user inputs: Customizes search based on earlier user messages. The default value is 0, meaning only the current message is considered. Adjust the slider to include previous messages. This helps the system understand context when a previously discussed topic is revisited. For example, if a user asks about “unlock new credit card” and then follows up with “how do I do it?”, the system understands that the follow-up refers to “unlock new credit card.”

Training

Now, you have to make sure that they are now part of the knowledge base. You do that by using the training process. Click on the button Training on the top right corner. Train each collection individually to keep the agent up to date with the latest information. After making changes, retrain the affected collections to ensure the knowledge base stays up-to-date.

You can train using the icon available in the list or via Training section.

Resfresh icon to train the selected collection
circle-info

Retrain the agent every time you add or edit a source, create or edit a question, change the document linked to the question.

The training process of a PDF may take a little longer than training TXT files. If you face any issues with specific files in cases of very large trainings (which may involve too many documents or PDFs close to the 100-page limit), wait a few moments and try again.

Delete Collection

When deleting collections, the system carefully handles associated content. Questions and sources within the deleted collection remain accessible, ensuring that no critical information is lost.

circle-exclamation

Last updated

Was this helpful?