Perform evaluation, error analysis, and tuning Questions

Practice questions for Perform evaluation, error analysis, and tuning topic in GitHub Agentic AI Developer. 36 questions covering this domain.

36 questions8 easy19 medium9 hard

medium

In the VS Code Chat Debug view, which section is the best place to verify that custom instructions or an agent description were actually included in t...

medium

Evaluation shows an agent repeatedly picks the wrong external tool for a task. Which tuning action best matches GH-600?

medium

A team wants objective evaluation signals for an agent that updates dependencies. Which signal source best matches GH-600?

easy

Which root-cause category is explicitly listed in the GH-600 study guide for agent failures?

hard

During a long agent session, the model's answer appears cut off. According to VS Code troubleshooting guidance, what is the best next step?

easy

Which evaluation setup best aligns with GH-600 guidance for agent tasks?

hard

An enterprise administrator wants to measure pull request outcomes for work created by Copilot cloud agent. Which metric is explicitly available in Co...

medium

An agent produced the wrong change, and you need to identify why. Which evidence sources does GH-600 say to inspect?

medium

In the VS Code Chat Debug view, which section lets you verify the inputs and outputs of tools that were invoked during a request?

Q10

hard

You want to use the /troubleshoot command to ask why a session was slow and which customizations loaded. What must be enabled first?

Q11

medium

A path-specific .instructions.md file seems to have no effect. According to VS Code troubleshooting guidance, what should you check first?

Q12

medium

Which debug view is designed to visualize interactions between agents and subagents during a complex run?

Q13

hard

Evaluation shows an agent keeps following obsolete intermediate notes and misses the current task direction late in long runs. Which tuning action bes...

Q14

easy

What is the primary purpose of the Logs view in the Agent Debug panel?

Q15

easy

Which Chat Debug section lets you confirm the exact text that was sent as your request, including resolved # mentions?

Q16

medium

The AI answers generically and seems unaware of repository files. Which Chat Debug section should you inspect first?

Q17

medium

An expected MCP tool never runs during a request. Which check best distinguishes 'the tool was unavailable' from 'the model chose not to use it'?

Q18

medium

Which Agent Debug view shows aggregate statistics such as total tool calls, token usage, error count, and overall duration?

Q19

medium

A task says, 'Update logging only under src/api and do not change runtime behavior.' Which evaluation setup best matches that development intent?

Q20

hard

A failure might involve reasoning mistakes, tool behavior, and workflow state. Which evidence set provides the strongest basis for root-cause analysis...

Sign in to see all 36 questions

Create a free account to browse all questions — completely free during our launch phase.