Evaluating agent performance in Microsoft Copilot Studio

Creating agents is an iterative process. It involves understanding how your agents work, making changes, and seeing the results. This process helps you build better agents.

Copilot Studio 's activity tracking feature provides a visual map of the agent's input, decision, and output sequence within an activity, allowing you to identify problems and opportunities for improvement.

An activity map is created for each activity. An activity begins when an agent initiates a conversation or an external event triggers the agent. The map represents each activity using a node.

There are two forms of activity tracking:

  • Real-time activity map showing agent performance during testing.
  • Historical activity maps and records from previous operations.

Note :

This feature is only available for agents that have enabled the generation scheduling feature.

Only your interactions with the agent and instances where the agent uses your login information for authentication will be displayed. The agent must use Microsoft's built-in authentication to identify your interactions.

Real-time activity map during testing.

When you ask your agent a question in the Test your agent dashboard , the activity map displays a visual representation of the plan the agent has created. The activity map highlights errors, such as missing or invalid input or output parameters for actions, and shows the execution time of each step. You can also verify that the information your agent is sending or receiving is correct.

To have the activity map automatically appear when you send queries to your agent in the Test your agent table , select the three dots (…) and then turn on Show activity map when testing .

Images 1 of Evaluating agent performance in Microsoft Copilot Studio

Here, the user asked, "What will the weather be like in Toronto, Canada tomorrow?" You can see the action the agent chose to respond to, as well as the inputs and outputs.

Images 2 of Evaluating agent performance in Microsoft Copilot Studio

Check the event trigger.

When you test an event trigger, the test chat will display the trigger's payload as a message. Only you can see this message in Copilot Studio for testing purposes. Your agent users cannot see it. Use the payload messages to understand when and what information and instructions the trigger sends to your agent.

Follow along between topics

When testing your agent, you can track between topics. When you enable tracking, the activity map will show the nodes within the topic as they are executed when a topic is triggered as part of a plan. You can track the conversation flow.

Images 3 of Evaluating agent performance in Microsoft Copilot Studio

Note : Coordination activities generated within a topic will not appear in the activity map.

View the variables

You can view the variables used by the agent during testing. These variables can be system-wide variables, environment variables, system variables, or custom variables that the agent has access to.

To view the variables:

  1. Open your agent. The Overview page will appear.
  2. Select Test to open the Test your agent window .
  3. In the Test your agent panel , start a conversation with your agent by entering a query or prompt.
  4. To view the variables in the test, select Test > Variables . The Variables panel will open, displaying the variables used in the test.

Observe the agent's thought process during the test conversation.

The Chain of Thought (CoT) feature provides detailed information about the agent's reasoning process during testing. It shows the intermediate steps, considerations, and decisions the agent makes when processing input and producing output. This feature is useful for understanding complex interactions and debugging agent behavior.

Note: The Chain of Thought feature is only available for certain models such as GPT-5 Reasoning, Claude Sonnet, and Claude Opus.

To observe the agent's thought process or reasoning during the experiment, follow these steps:

  1. Open your agent. The Overview page will appear.
  2. Select Test to open the Test your agent window .
  3. In the Test your agent panel , start a conversation with the agent by entering a question or prompt.
  4. Before the agent responds, the reasoning process will appear in the Test your agent table , displaying the thought process for the agent's response.

History of operation

Whenever the agent starts an activity, including tests you create in Copilot Studio, the Activity page records that activity in real time. Access the Activity page to:

  • Review the interactions and decisions your agent made during an activity.
  • Identify instances where the agent's behavior doesn't align with your goals.
  • View the time it takes the agent to complete each task.
  • Find the details of the error.

The agent's activity history is available for:

  • Interactions in the Copilot Studio test chat
  • Agents are published on Microsoft Teams and Microsoft 365 Copilot channels.
  • The agents are published to the SharePoint channel.
  • Activities begin with an automatic trigger.

Note :

You must have a Microsoft Exchange license and an inbox to view the agent's activity history. The data is stored using Microsoft 365 services, which Azure's data terms and commitments do not manage. Instead, this feature is provided by Microsoft 365 services and stored in a location associated with your Exchange inbox. Microsoft 365's data storage terms and commitments govern historical activity data.

Administrators should use the Power Platform admin center to prevent Microsoft 365 from caching data. Disabling this setting will stop future data caching on Microsoft 365. Existing data stored in Microsoft 365 will be deleted or discarded according to the data retention policy established for Microsoft 365.

Review the following information on the Activity page :

  • A list of activities and their details.
  • The conversational process unfolds step-by-step within an activity.

List of activities

An activity is a unit of interaction by an agent that includes conversing with the user, registering and responding to events, or a combination of these activities.

In the list of activities, you can see:

  • Name : The name of the user interacting with the agent. If the activity is not user-related, such as when the agent performs actions without human prompting, the user will be displayed as Automated .
  • Channels : Channels are where interactions take place. Activities occurring within the Test your agent dashboard will have a Copilot Studio icon.
  • Date : The date the activity's first interaction began.
  • Completed steps : The number of steps the agent completed during operation.
  • Last step : The final step completed in the operation.
  • Status : The status of the activity (successful, failed, in progress).

Customize Activity table view

You can customize the Activity board view to focus on the information that is most relevant to you.

To customize the view of the Activity table:

  1. Access the agent's Activity page . A list of activities will appear.
  2. Select Edit columns . The Edit columns dialog box will appear with a list of available columns.
  3. In the Edit columns table :
    • Select the checkbox next to the column names you want to display.
    • Uncheck the checkboxes next to the column names you want to hide.
    • Drag and drop the columns to your desired location, or use the up and down arrows.
  4. Select Save to confirm and update the Activity table view.

Images 4 of Evaluating agent performance in Microsoft Copilot Studio

View the conversation in an activity.

On the Activity page , you can view the conversation within an activity as a visual activity map, or both the conversation transcript and the visual activity map. By default, the Transcript + Map view is displayed.

1. To view a conversation within an activity, select an activity.

  • To see the transcript and map working together, select View and then click Transcript + Map view .
  • To view only the active map, select View and then click Map view .

Images 5 of Evaluating agent performance in Microsoft Copilot Studio

2. To return to the list of activities, select the Back icon.

The Transcript + Map view displays the conversation that takes place with the agent during an activity, along with a visual activity map. The transcript captures user input, trigger payloads, and agent responses. Use the transcript to see how your agent responds to user conversations. You can also view the trigger payload messages your agent receives from event triggers.

Observe the agent's thought process.

The Chain of Thought (CoT) feature provides detailed information about the agent's reasoning process within an operation. It shows the intermediate steps, considerations, and decisions made by the agent when processing input and producing output. This feature is useful for understanding complex interactions and debugging agent behavior. You can only view the agent's thought process for a specific operation in Map view.

Note : The Chain of Thought feature is only available for certain models such as GPT-5 Reasoning, Claude Sonnet, and Claude Opus.

To see the agent's thought process or reasoning for a specific action:

  1. In the Activity tab , select the activity you want to review. By default, the Transcript + Map view will be displayed.
  2. Below the activity map, select the Reasoning arrow . The agent's thought process will appear.

Pinning or unpinning works.

On the Activity page , you can pin important activities to the top of your activity list for easy access or unpin them when they are no longer needed.

  • To pin an activity, hover over it and select the Pin icon next to its name.
  • To unpin an activity, hover over the pinned activity and select the Unpin icon next to its name.

View all agent activity in the activity map.

A map view is a visual representation of the input, decision, and response flow that occurs within an activity. Each activity your agent, user, or trigger performs has a node. When viewing the activity map, select a node to see details about the inputs, decisions, and outputs for that activity.

The following image is an example of what you see when you select a Knowledge node. It shows:

  • The query that the agent used to search for knowledge sources. This query may differ from a user-written query or input data provided by a trigger.
  • The response generated by the agent was based on knowledge sources.
  • The sources that the agent referenced.
  • Other sources that the agent searched but did not use to generate a response. The agent searched these sources but did not find information relevant to the query.

Images 6 of Evaluating agent performance in Microsoft Copilot Studio

In this example, you can see the details of an action node for "Get the forecast tomorrow" displaying the inputs and outputs:

Images 7 of Evaluating agent performance in Microsoft Copilot Studio

Select Edit to view and change the configuration related to the selected activity.

Rationale

Rationale provides an explanation of how an agent decides to call a particular tool. The AI ​​generates the rationale on demand, based on the agent's metadata and activity. Rationale is displayed to knowledge sources or connectors with a Completed status . Select Show rationale to display the rationale.

Images 8 of Evaluating agent performance in Microsoft Copilot Studio

When examining agent behavior, use Rationale to better understand why the agent chooses to call a particular tool or fill in parameters. It helps you troubleshoot agent behavior. However, because the AI ​​generates the reason, it may not be entirely accurate. Use your best judgment when considering the provided reason.

Agent status

The following table provides information about the different states an agent may encounter during its operation:

Send feedback

If you have suggestions on how to improve agent quality, please submit your feedback using the Feedback icon  in the Activity tab .

Images 9 of Evaluating agent performance in Microsoft Copilot Studio

Close
Category

System

Windows XP

Windows Server 2012

Windows 8

Windows 7

Windows 10

Wifi tips

Virus Removal - Spyware

Speed ​​up the computer

Server

Security solution

Mail Server

LAN - WAN

Ghost - Install Win

Fix computer error

Configure Router Switch

Computer wallpaper

Computer security

Mac OS X

Mac OS System software

Mac OS Security

Mac OS Office application

Mac OS Email Management

Mac OS Data - File

Mac hardware

Hardware

USB - Flash Drive

Speaker headset

Printer

PC hardware

Network equipment

Laptop hardware

Computer components

Advice Computer

Game

PC game

Online game

Mobile Game

Pokemon GO

information

Technology story

Technology comments

Quiz technology

New technology

British talent technology

Attack the network

Artificial intelligence

Technology

Smart watches

Raspberry Pi

Linux

Camera

Basic knowledge

Banking services

SEO tips

Science

Strange story

Space Science

Scientific invention

Science Story

Science photo

Science and technology

Medicine

Health Care

Fun science

Environment

Discover science

Discover nature

Archeology

Life

Travel Experience

Tips

Raise up child

Make up

Life skills

Home Care

Entertainment

DIY Handmade

Cuisine

Christmas

Application

Web Email

Website - Blog

Web browser

Support Download - Upload

Software conversion

Social Network

Simulator software

Online payment

Office information

Music Software

Map and Positioning

Installation - Uninstall

Graphic design

Free - Discount

Email reader

Edit video

Edit photo

Compress and Decompress

Chat, Text, Call

Archive - Share

Electric

Water heater

Washing machine

Television

Machine tool

Fridge

Fans

Air conditioning

Program

Unix and Linux

SQL Server

SQL

Python

Programming C

PHP

NodeJS

MongoDB

jQuery

JavaScript

HTTP

HTML

Git

Database

Data structure and algorithm

CSS and CSS3

C ++

C #

AngularJS

Mobile

Wallpapers and Ringtones

Tricks application

Take and process photos

Storage - Sync

Security and Virus Removal

Personalized

Online Social Network

Map

Manage and edit Video

Data

Chat - Call - Text

Browser and Add-on

Basic setup