Skip to main content

Datasets

Introduction

Datasets transform how your agents work with structured data. Instead of processing CSV files as plain text, Datasets allow your agents to query data like a database, delivering precise answers based on exact data points.

When you add a CSV file as a Dataset, Serenity* Star converts it into a queryable database table. Your agent can then filter, search, aggregate, and perform calculations on your data to provide accurate, data-driven responses.

Common Use Cases:

  • Product catalogs with inventory and pricing information
  • Customer databases for support agents
  • Policy documents with structured rules and guidelines
  • Pricing lists and availability tracking
  • Any structured data that requires precise querying and filtering

Understanding Datasets

Before creating your first dataset, let's understand the key concepts:

  • Dataset: A collection of one or more related tables. Think of it as a database that groups related information together.
  • Table: Each CSV file you upload becomes a table within your dataset. Tables contain rows and columns of structured data.
  • Queryable Data: Your agent can execute SQL queries against your data to find specific information, perform calculations, or filter results.

Create Your First Dataset

You can create datasets in two ways: manually from the Datasets module, or automatically when uploading CSV files through the Knowledge tab in the Agent Design Studio. We'll cover both methods.

Manual Dataset Creation

Use this method when you want full control over your dataset structure and plan to use the same data across multiple agents.

Step 1: Access the Datasets Module

Navigate to the Datasets module from the sidebar navigation.

Datasets sidebar access

If this is your first time accessing Datasets, you'll see an empty state prompting you to create your first dataset.

Step 2: Create a New Dataset

Click the "Create New Dataset" button to begin.

Create new dataset

Step 3: Fill in Dataset Details

You'll need to provide the following information:

  • Identifier: A unique identifier for your dataset (numbers and letters only, no spaces or special characters)

    • Example: ProductCatalog, CustomerData, HrPolicies
  • Display Name: A friendly name that will be shown throughout the interface

    • Example: Product Catalog, Customer Database, HR Policies
  • Description: A brief explanation of what data this dataset contains and its purpose

    • This helps the agent and team members understand the dataset content

Dataset creation form

Naming Conventions

Use descriptive identifiers with letters and numbers only (no spaces or special characters). We recommend using PascalCase like ProductCatalog or CustomerSupportData. This makes it easier to identify datasets as your library grows.

Step 4: Add Your First Table

To complete the dataset creation, you must add at least one table by uploading a CSV file.

  1. Click "Add Table" in the dataset view. If this is your first table, you'll see an empty state with the Add Table button.

    Add table button with empty state

  2. A side panel will open. Provide the table information:

    • Table Identifier: Unique name for this table (e.g., Products, Inventory)
    • Table Display Name: Friendly name (e.g., Products, Inventory Levels)
    • Table Description: What data this table contains
  3. Upload your CSV file by clicking Browse files or by dragging and dropping it

    Add table side panel

  4. After uploading, a new table card will appear showing the Pending status

    Table card with pending status

CSV Format Requirements

Your CSV file must:

  • Have column headers in the first row
  • Use consistent data types within each column
  • Use UTF-8 encoding

Step 5: Create the Dataset

Click "Create" to save the dataset and confirm all the changes you made.

Step 6: Wait for Processing

After creating the dataset, Serenity* Star will process all your CSV files in the background. This involves:

  • Analyzing the CSV structure
  • Inferring data types for each column
  • Creating a queryable database table
  • Generating a schema that helps your agent understand the data

You can track the processing status in the datasets grid. The status will change from Pending to In Progress to Completed.

Dataset processing status

Once the status shows Completed, your dataset is available for use.

Important

Any changes you make to datasets (adding tables, deleting tables, appending data, replacing data) from the datasets management screen will only take effect once you confirm the changes. Until then, your agents will continue using the current version of the dataset.

Automatic Dataset Creation from Agent Design Studio

When you upload CSV files directly in the Agent Design Studio's Knowledge tab, Serenity* Star automatically creates a special "Default Dataset" for that agent.

To upload CSV files, follow these steps:

  1. Navigate to your agent in the Agent Design Studio
  2. Go to the Knowledge tab
  3. Click "Add Knowledge" and select Files
  4. Upload one or more CSV files

For detailed instructions on uploading files to your agent's Knowledge, see the Knowledge documentation.

After uploading the files, Serenity* Star automatically:

  • Creates a dataset with identifier Default[YOUR-AGENT-CODE]
  • Names it Default dataset for [YOUR-AGENT-NAME]
  • Converts each CSV file into a separate table
  • Assigns the dataset to your agent

This default dataset appears in:

  • Your agent's Knowledge sources (each CSV file appears as an individual file, but they are tables in the default dataset that your agent can query)
  • The Datasets grid alongside your other datasets (for viewing and management)

Assign Datasets to Agents

Once you have created a dataset manually, you can assign it to one or more agents. Note that default datasets (created automatically from CSV uploads) can only belong to a single agent.

Step 1: Access Agent Knowledge Settings

  1. Navigate to your agent in the Agent Design Studio
  2. Go to the Knowledge tab
  3. Click "Add Knowledge"

Step 2: Select Dataset Option

From the side panel, select the Datasets option.

Add knowledge modal with datasets option

Step 3: Choose Your Dataset

A side panel will open showing all available datasets. Select the dataset you want to assign to your agent.

You can also create a new dataset directly from this panel by clicking "Create New Dataset".

Dataset selection panel

Step 4: Confirm Assignment

After selecting a dataset, it will appear in your agent's Knowledge sources list. Click "Save Changes" to finalize the assignment.

The dataset is now available for your agent to query during conversations.

Working with Tables

Once your dataset is created, you can manage its tables in various ways.

Add a New Table

To add additional tables to an existing dataset:

  1. Open the dataset from the Datasets module
  2. Click "Add Table"
  3. Provide the table details and upload your CSV file
  4. Click "Save" to confirm the changes to your dataset
  5. Wait for processing to complete

This is useful when you want to add related data to an existing dataset. For example, adding an Inventory table to complement your Products table.

Replace Table Data

Use Replace when you need to completely update a table's content with fresh data. This removes all existing rows and replaces them with the new data.

When to use Replace:

  • Daily inventory updates
  • Complete price list refreshes
  • Replacing outdated information

Important: The new CSV must have the same column structure (same number of columns and compatible data types) as the original table.

To replace a table:

  1. In the dataset view, locate the table you want to update

  2. Click the Replace button on the table card

    Table card with Replace button

  3. A side panel will open. Upload your new CSV file with the replacement data

    Replace table side panel

  4. The table will be marked for replacement

  5. Click "Save" to confirm the changes to your dataset

tip

While the replace operation is pending or in progress, your agent will continue to have access to the previous version of the table. The new data becomes available once processing is complete.

Append Data to Table

Use Append when you want to add new rows while preserving existing data. The new rows are added to the end of the table.

When to use Append:

  • Adding new products to a catalog
  • Accumulating historical data
  • Incremental updates without removing old records

Important: The CSV must match the existing table's structure exactly (same columns and data types).

To append data:

  1. Locate the table in your dataset

  2. Click the Append button on the table card

    Table card with Append button

  3. A side panel will open. Upload your CSV file with new rows

    Append data side panel

  4. The table will be marked for appending new data

  5. Click "Save" to confirm the changes to your dataset

tip

While the append operation is pending or in progress, your agent will continue to have access to the previous version of the table. The appended data becomes available once processing is complete.

Edit Table Records

You can edit data directly within Serenity* Star using the inline table editor.

  1. Click the Edit icon on the table card

    Table card with Edit button

  2. A paginated grid view of your data will open where you can:

    • Click any cell to edit its value
    • Use the search and filter options to find specific records
    • Navigate through pages of data
    • Click Save to apply your changes

    Table edit grid view

This is helpful for:

  • Correcting individual values
  • Making small updates without re-uploading the entire file
  • Quick data fixes

Download Table Data

You can export the current contents of any table as a CSV file. This downloads the current state of the data, including any edits you've made, not the original uploaded file.

From the Datasets Module:

  1. Open your dataset from the Datasets module

  2. Locate the table you want to download

  3. Click the Download button on the table card

    Download button on table card

  4. The table data will be exported as a CSV file

From the Agent Design Studio:

  1. Navigate to your agent in the Agent Design Studio

  2. Go to the Knowledge tab

  3. Locate the dataset in the knowledge sources list

  4. Click the Download icon next to the table name

    Download icon in Agent Design Studio

  5. The table data will be exported as a CSV file

Delete a Table

To remove a table from a dataset:

  1. Locate the table you want to remove in your dataset

  2. Click the Delete button on the table card

    Delete button on table card

  3. The table will be marked for deletion

  4. Click "Save" to confirm the changes to your dataset

Cannot Delete Last Table

A dataset must contain at least one table. If you try to delete the only table in a dataset, you'll need to delete the entire dataset instead.

Background Processing

When deleting a table that has already been created, the deletion operation runs in the background. Your agent will continue to have access to the table until the deletion process is complete.

Manage Datasets

View All Datasets

The Datasets module shows a grid of all your datasets with the following information:

  • Dataset name and identifier
  • Number of tables
  • Number of agents using this dataset
  • Current status
  • Creation date

Datasets grid overview

Use the search bar to quickly find specific datasets.

Edit Dataset Details

You can update a dataset's display name and description:

  1. Click on the dataset to open its details
  2. Click the Edit button
  3. Update the display name or description
  4. Save your changes
Identifier Cannot Be Changed

The dataset identifier is permanent and cannot be modified after creation. Choose your identifiers carefully when creating datasets.

Delete a Dataset

To delete a dataset:

  1. Locate the dataset in the grid
  2. Click the Delete button
  3. Confirm the deletion
Cannot Delete Assigned Datasets

You cannot delete a dataset that is currently assigned to any agent. First, remove the dataset from all agents, then delete it.

Best Practices

CSV Formatting

Do:

  • Include clear column headers in the first row
  • Use consistent data types within each column
  • Format dates as YYYY-MM-DD
  • Remove any merged cells or complex formatting

Don't:

  • Mix different data types in the same column
  • Use formulas or macros
  • Include multiple header rows
  • Use special characters that might cause parsing issues

Data Maintenance

  • Review regularly: Update your datasets as your source data changes
  • Use Replace for full updates: When refreshing entire tables, use Replace
  • Use Append for additions: When adding new records only, use Append
  • Test after updates: Always verify your agent works correctly after updating data
  • Clean up unused datasets: Remove datasets that are no longer needed

Supported CSV Format

  • Must be valid CSV (comma-separated values)
  • UTF-8 encoding
  • Headers required in first row
  • Consistent column count across all rows