# Dataset Dashboard

On the Datasets page, you can select a pre-created dataset and view dashboards that give you insights into the data processing tasks.&#x20;

Below is an example of a dataset dashboard and the associated each processing tasks.

<figure><img src="/files/5YlIyanqoCPv2eqv7mwD" alt=""><figcaption><p>Dataset used in a QNA recipe</p></figcaption></figure>

### **Optical Character Recognition (OCR)**

* OCR enables the extraction of text from images or scanned documents, making the data more accessible and searchable.
* **Count**: Number of dataset items processed using OCR.
* **Processing Status**: Indicates whether the OCR task was successful or if there were errors during processing.

### **Personally Identifiable Information (PII)**

* PII handling involves identifying and managing data that could potentially identify a specific individual, such as names, social security numbers, addresses, etc.
* **Count**: Number of dataset items scanned for PII.
* **Status**: Indicates success or errors in identifying and handling PII.

### **Chunking**

* Chunking is the process of splitting documents into smaller, manageable pieces, called chunks, which can be processed independently.
* **Count**: Number of dataset items that underwent the chunking process.
* **Status**: Indicates success or errors in chunking.

### **Embeddings**

* Embeddings are vector representations of data, such as words, sentences, or images, that capture the semantic meaning and relationships within the data.
  * **Count**: Number dataset items that underwent the embeddings generation process.
  * **Status**: Indicates success or errors in generating embeddings.

## Dataset Used in Batch Recipe

For the datasets that are used in Batch recipe, you see an additional chart of batch\_chain in the datasets dashboard.

<figure><img src="/files/FktLhaDE2wXMrTHRX3kx" alt=""><figcaption><p>Dataset used in batch recipe</p></figcaption></figure>

### **Batch-chain**

* Batch-chains refer to the sequence of tasks processed in batches to improve efficiency and manageability. This includes grouping data for processing and ensuring each step in the sequence is completed successfully.
* **Count**: Number of dataset items processed in the batch-chain.
* **Status**: Indicates whether each task in the batch-chain was successful or if errors were encountered.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://karini-ai.gitbook.io/karini-ai-documentation/dashboard-overview/dataset-dashboard.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
