Question 1

What is performance calibration?

Accepted Answer

Performance calibration is the process of aligning manager ratings across teams to ensure consistency and fairness. Without calibration, you get leniency in some teams, harsh grading in others, and no consistency in who gets promoted, who gets PIP'd, or how compensation is allocated. Calibration sessions bring managers together to review ratings against a shared standard, with the goal of producing outcomes that reflect actual performance, not manager advocacy or political capital.

Question 2

What are the most common biases in calibration sessions?

Accepted Answer

The six most common biases in calibration sessions are: recency bias (the last 4-6 weeks dominate the full-year assessment), halo effect (one standout achievement colors the entire rating), affinity bias (managers rate employees they like or relate to more favorably), central tendency (risk-averse managers cluster everyone in the middle to avoid conflict), visibility bias (employees who are more visible get rated higher than quieter contributors doing equally important work), and anchoring (the first rating voiced in the room shapes all subsequent discussion). These biases interact and compound, which is why facilitation alone doesn't fix them.

Question 3

What data should you have before a calibration session?

Accepted Answer

Fair calibration requires four types of data before the session starts: (1) Calibrated manager ratings, not raw ratings, but ratings adjusted for each manager's distribution tendency. A 4 from a lenient rater is worth less than a 4 from a strict rater. (2) Peer contribution data, ONA or structured peer nominations that show who employees trust and identify as contributors across the organization. (3) Performance trend data, how each employee has been rated over the last 2-3 cycles, not just the current one. (4) Collaboration signal data, who is doing cross-team work that won't show up in a single manager's assessment. Walking into calibration with only manager ratings means calibrating opinions against more opinions.

Question 4

How do you run a fair calibration session?

Accepted Answer

A fair calibration session follows four phases: Pre-calibration (send data packets to managers, collect ratings independently before the session, lock ratings to prevent social pressure adjustments), opening with data (show aggregate distribution, flag statistical outliers, surface rating/ONA divergence cases before verbal discussion begins), edge case discussion (focus the room on employees where data diverges, not a full sequential review), and documentation (record the final rating and the reasoning for every change, not just the number). The most common mistake is going employee by employee from the top, which exhausts time and attention before reaching the hardest cases.

Question 5

How does AI help with performance calibration?

Accepted Answer

AI contributes to calibration in three ways: generating calibration profiles (pulling from ONA data, performance history, and collaboration signals to produce a structured pre-read for each employee at scale), flagging statistical bias patterns (detecting when a manager's ratings consistently diverge from their team's peer nominations, or when language in calibration discussions reflects bias), and real-time language monitoring (scanning calibration transcripts for gendered language, personality-based versus behavior-based rationale, and specificity gaps between employees). AI surfaces this data, humans make the decisions.

Question 6

How long should a calibration session take?

Accepted Answer

A well-prepared calibration session for a team of 40-60 employees should take 2-4 hours. Most teams that run 2-3 day calibration marathons are doing so because they lack pre-session data, managers arrive unprepared and the session becomes the data-gathering exercise. When calibration packets are sent in advance and ratings are submitted before the session, the session itself shifts from information gathering to decision making. The result is 2-4 hours instead of 2-3 days.

The Calibration Playbook