Blog post

How to Run Your First Performance Calibration Session — 60-Minute Framework

A practical 60-minute script for running your first calibration session. Includes pre-read checklist, agenda template, facilitator talking points, and post-session actions.

How to Run Your First Performance Calibration Session — 60-Minute Framework
Last updated: March 2026

How to Run Your First Performance Calibration Session — 60-Minute Framework

You've gathered ratings. Managers have submitted feedback. Now you're facing the actual meeting. It's your first calibration session. You're wondering: Will it run long? Will managers fight each other? What do I actually say?

Here's the good news: A calibration session doesn't need to be a marathon. Sixty minutes is enough. But only if you have a script.

This is that script.

What You're Actually Doing in 60 Minutes

Calibration isn't a strategy meeting. It's a structured discussion where managers align on whether ratings are fair and consistent.

You're answering three questions:

  1. Did everyone use the same rating scale? (A 4 in one team means the same thing as a 4 in another team.)
  2. Are there unjustified pay gaps? (Two people in similar roles with the same work, but different ratings.)
  3. Are we missing context? (One manager's team all scores high because they don't see the bigger picture.)

That's it. Sixty minutes is enough because you're not reinventing decisions. You're checking for consistency.

The 60-Minute Breakdown

Minutes 0–5: Context & Rules (5 min)

You say (facilitator):

"We're here to make sure ratings are fair. This isn't about negotiating. If we find that similar work got different ratings, we'll talk about it. The goal is consistency, not pain. You know your people best. We're asking you to explain your thinking, and we'll make a call based on evidence."

Then clarify:

  • This meeting locks in final ratings. (No "I'll think about it.")
  • Time limit: 2–3 minutes per person. (You'll keep it tight.)
  • Decision authority: HR facilitator (you) makes the final call if there's disagreement.

Hand out or reference on screen:

  • The rating definitions. (One page. Everyone should have this before the meeting.)

Example:

  • Rating 3: Meets expectations. Gets the job done consistently.
  • Rating 4: Exceeds expectations. Takes on stretch work, mentors others.
  • Rating 5: Far exceeds. Drives company strategy, exceptional impact.

What it sounds like when you run this right:

"Sarah, you've prepped. Marcus, you've prepped. If we see a Sarah-Marcus comparison where they're rated 3 and 5 but did similar work, we'll ask questions. That's how we get consistent. OK? Let's go."

Time checkpoint: You've used 5 minutes. 55 left.


Minutes 5–40: Calibration Review (35 min)

This is the core. One person at a time. 2–3 minutes each.

For each employee, this is the rhythm:

Step 1: Manager Speaks (60 seconds) The person's direct manager goes first. They say:

  • The rating (1–5)
  • One sentence: Why this rating
  • Any context (new in role, stretch project, documented issue)

Example: "Sarah is a 4. She led the client retention project solo, brought in $500K in contracts, and documented the playbook so others could replicate it. She's ready for advancement."

Step 2: Other Managers React (60 seconds) Peer managers who know this person, or work in adjacent teams, speak up:

  • Do they agree?
  • Any different context?
  • Have they worked with this person?

Example: "I agree. I've seen her present to clients. She's articulate. That's a 4."

Step 3: You Ask (30 seconds) You (HR facilitator) ask one clarifying question if the rating is:

  • Surprising (doesn't fit the pattern of other ratings)
  • Outlying (is this person's a 5 while everyone else is a 3?)
  • Risky (a rating of 1 with no documented improvement plan)

Example questions:

  • "Is there specific evidence? A project? Feedback?"
  • "How does that compare to [other person] who we rated a 3?"
  • "Have you documented this concern with them?"

Step 4: You Decide (30 seconds) You confirm or adjust:

  • "We're confirming Sarah as a 4." (Or: "We're moving her to a 3 because the impact isn't quite there yet.")
  • Move to the next person.

For a team of 20 people: 20 people × 3 minutes = 60 minutes max. If you keep people tight (and you will), you're done in 50.

What it sounds like when you run this right:

You: "OK, Michael. Next."

Manager: "Michael is a 3. He completes his work on time, gets good feedback, but hasn't taken on anything beyond his role yet."

Peer manager: "I'd agree with that."

You: "Got it. Confirmed as a 3. Sarah, you're next."

[No debate. No politicking. Just data in, rating out.]

Troubleshooting: What if someone disagrees?

Sometimes a peer manager will say: "Wait, I'd rate Michael a 4. He helped my team with X."

Don't let this spiral. Ask one follow-up:

  • "What did he do that felt like a 4?"

Listen to the answer. Then you decide:

  • If it's new information that changes things, adjust.
  • If it's a difference in interpretation, pick one: "We're keeping him a 3 based on job scope. The help he gave your team is good work, but it's not a stretch assignment yet. Let's track growth in Q2."

Move on. This should take 90 seconds max.

Time checkpoint: After 20 people, you've used ~35 minutes. 25 left.


Minutes 40–50: Decisions & Implications (10 min)

Now that ratings are confirmed, flag the immediate actions.

You review:

  • Anyone rated a 1: Is there a performance improvement plan (PIP) in place? If not, you need one.
  • Anyone rated a 5: What's their development plan? When do you promote them?
  • Anyone rated 2: Are they on a growth track, or are they a risk?

Example:

"We confirmed three 5s: Sarah, Marcus, and Lee. I want to check: Sarah and Marcus are both ready for promotion by Q3. Lee is new, so probably Q4. Is that your sense?"

Managers confirm or adjust. You document.

"We confirmed one 1: David. I don't see a PIP on file. We need to create one and get David feedback within the week. Who's doing that?"

Someone volunteers (usually the manager). Done.

What it sounds like when you run this right:

"Three 5s. Two are getting promoted. One is getting a development plan. Two 2s: We think they're growth-oriented, so we're not flagging them yet, but we'll watch Q2. Any concerns? ... OK, I'll document that."

This is not a long discussion. Five to ten minutes max. You're setting post-calibration actions, not debating. Those actions happen after the meeting.

Time checkpoint: You've used ~45 minutes. 15 left.


Minutes 50–60: Wrap & Documentation (10 min)

You recap:

"Here's what happened: Confirmed 14 threes, 5 fours, 1 five. One 1 with a PIP action. Outliers were Sarah (4, mentor-ready) and David (1, needs structured feedback). You'll get the full results by email tomorrow."

You set expectations:

"Final ratings are locked as of now. Next step: We use these ratings to make pay decisions. That conversation is separate, and you'll hear about it by Friday. Next calibration is [date]."

You thank them:

"This took focus. You showed up ready. That's how calibration works."

Document immediately (during the call, if possible):

  • Final rating for each person
  • Attendees
  • Any outlier decisions and rationale
  • Post-calibration actions (PIPs, promotion plans, etc.)

Send it within 48 hours.

Time checkpoint: 60 minutes. Done.


The Pre-Session Prep (Do This Before You Walk Into the Room)

You can't run the meeting well without prep. This happens the week before.

Send the Pre-Read (1 week before)

Send each manager a spreadsheet or document with:

  • Employee name
  • Their rating (what the manager submitted)
  • Key accomplishment from the review cycle (one sentence)
  • Tenure (how long they've been with the company)
  • Any documented concerns (is there a PIP, formal warning, or public complaint?)

Managers review this and think: "Are my ratings consistent with everyone else's?" They come to the meeting prepared.

Example pre-read:

Name Manager Rating Key Accomplishment Tenure Notes
Sarah Chen Alex 4 Led client retention project, $500K impact 2.5 yrs Ready for advancement
Michael Roberts Alex 3 Completes work on time, solid feedback 1.5 yrs No growth stretch yet
David Park Alex 1 Inconsistent delivery, missed two deadlines 0.5 yrs Needs structured feedback

Managers see this, and they're not surprised in the room. They've thought about Sarah being a 4.

Confirm Logistics (1 day before)

  • Who's attending: List of managers + you (HR)
  • Where: A room or call where you can run 60 minutes uninterrupted
  • Materials: Printed or digital copy of rating definitions
  • Technology: Timer (or just a clock). You're keeping time tight.

Get Your Mindset Right

As the facilitator, your job is:

  1. Move the conversation fast. No tangents. One question per person, max.
  2. Surface inconsistencies. "Sarah's a 4, Marcus is a 3, same role, similar impact. What's the difference?"
  3. Make the call. If people disagree, you decide. Not a vote. Not consensus. Your call.
  4. Document immediately. Ratings lock. No "I'll think about it."

Don't be defensive. Don't lecture. Just move.


The 1-Page Agenda Template (Print This, Hand It Out)

PERFORMANCE CALIBRATION SESSION
Date: [DATE]
Time: 60 Minutes
Attendees: [MANAGER NAMES], [HR NAME]

AGENDA:
0:00–0:05    Opening & Rating Definitions           [Facilitator]
0:05–0:40    Person-by-Person Review (20 people)    [All]
0:40–0:50    Decisions & Next Steps                 [Facilitator]
0:50–1:00    Recap & Documentation                  [Facilitator]

RATING DEFINITIONS (Reference During Meeting):
1 = Does not meet expectations. PIP required.
2 = Meets some expectations. Growth opportunity.
3 = Meets expectations. Reliable contributor.
4 = Exceeds expectations. Ready for more responsibility.
5 = Far exceeds expectations. Top performer.

KEY RULES:
• Time limit: 2–3 minutes per person
• One follow-up question per outlier
• HR facilitator makes final call if disagreement
• Ratings are locked after this meeting
• Results distributed by email within 48 hours

NEXT STEPS AFTER CALIBRATION:
1. Document final ratings (today)
2. Create PIPs for any 1 ratings (by EOW)
3. Development plans for 5 ratings (by next Friday)
4. Pay decisions made separately (by Friday)

Print this. Hand it out. Reference it. Keeps people focused.


When Things Go Wrong (And They Might)

Issue 1: A Manager Argues About Their Rating

What they say: "Sarah should be a 5. She's the best person on my team."

What you do: Ask one question: "What makes her a 5 versus a 4?"

If they say something like: "She's just the best," that's not data. Say: "I need specifics. What did she do that no one else did?"

If they give evidence: "She led a $500K project, mentored three people, and moved us to a new platform," and that's different from the 4 you confirmed, then move her to a 5.

But don't let them argue. You're not debating. One question, one answer, you decide. Move on.

Don't say: "Everyone thinks you're biased." Do say: "Help me understand what that looks like."

Issue 2: Someone Says, "I Don't Know This Person Well Enough to Rate Them"

What you do: "That's fair. Then we go with what their direct manager said, unless someone else knows them. Anyone?"

If no one knows them, the rating stands.

Issue 3: You're Getting Behind on Time

20 people × 3 minutes = you should be at minute 40 when you're done with calibration.

If you're at minute 50 and still going, cut the remaining people short.

2 minutes per person, no follow-ups unless it's a 1 or 5.

"I know you have more context, but we're tight on time. The rating stands. If something urgent comes up later, we'll revisit."

This sounds harsh. It's not. Calibration isn't about perfect decisions. It's about consistent decisions. Tight timelines force consistency.

Issue 4: Someone Gets Emotional

A manager gets upset that their report was rated lower than expected.

You say: "I hear that this is frustrating. Let's talk about what this rating means next steps. If they're a 2, it means growth opportunity, not 'you're bad at your job.' We can talk about what they need to get to a 3 next quarter."

Move on. If they need to debrief, do it after the meeting.


After the Meeting: The Follow-Up Checklist

Within 24 hours:

  • Document final ratings (name, rating, attendees, date)
  • Flag any ratings of 1 for immediate PIP action
  • Flag any ratings of 5 for promotion/development planning

Within 48 hours:

  • Send managers the full results (spreadsheet or summary)
  • Confirm who's creating PIPs for 1 ratings
  • Schedule development conversation for 5 ratings

Within 1 week:

  • All 1-rated employees have a PIP conversation and written plan
  • All 5-rated employees have a development conversation
  • Pay decisions made separately using calibrated ratings

Keep: A record of attendees, final ratings, and any decisions. You'll reference this next cycle.


Common Mistakes (And How to Avoid Them)

Mistake 1: No Rating Definitions

Managers think a 4 means different things. Someone's 4 is another's 3.

Fix: Print the rating definitions. Post them. Reference them. "Is this person a 4 or 5? Check the definition. Does she exceed expectations AND take on stretch work? Yes to both? That's a 4."

Mistake 2: Letting One Manager Dominate

One talkative manager takes 10 minutes per person. The meeting runs 2 hours.

Fix: Use a timer. Audibly. "You've got 2 minutes. What's the rating?" Not rude. Just enforcing the structure.

Mistake 3: Confusing Calibration With Pay Discussions

"This person should be a 5, but we don't have budget for a raise."

Fix: Separate them. Rate based on performance. Decide pay later. This cycle, a 3. Next cycle, if they're a 4, they get a raise.

Mistake 4: No Documentation

You discuss, decide, then... nothing is written down. Next quarter, no one remembers.

Fix: Document during the call. Even a quick note: "Sarah 4 (mentor-ready). Marcus 3 (growth track). David 1 (PIP)." Write it while you're in the meeting. Send it within 48 hours.

Mistake 5: Ignoring Patterns

All your high performers are from one department. All your low performers are from another.

Fix: After the meeting, look at your data. Are there demographic differences? Team differences? If yes, ask why. This is your bias detector.


How to Know You Did It Right

After the meeting, ask yourself:

  • Did we discuss every person in 60 minutes?
  • Did we surface rating inconsistencies?
  • Did managers feel heard (not bulldozed)?
  • Did we lock in final ratings (not "we'll decide later")?
  • Did we flag follow-ups (PIPs, promotions)?
  • Did we document it?

If you checked all of these, you ran a good calibration. You didn't need to make the "perfect" decision on every person. You made consistent decisions, and you documented your thinking.

That's a win on your first try.


Your Next Calibration Will Be Easier

Your second calibration takes half the time. You have:

  • A template (the one above)
  • Last cycle's ratings for comparison
  • Managers who know the format
  • Documentation that shows your thinking

Calibration is most painful the first time. After that, it's a rhythm.


How Confirm Automates the 60-Minute Framework

Running this meeting manually (with email, spreadsheets, and notes) adds overhead you don't need.

Confirm centralizes calibration:

  • Ratings and justifications are already in one place
  • Pre-read data is auto-compiled
  • You flag inconsistencies in real-time (the system shows you outliers)
  • Attendees and timestamps are automatic
  • Next cycle, last cycle's ratings are available for comparison
  • Documentation happens as you make decisions

The 60-minute meeting is still about people having a conversation. But Confirm removes the data-gathering busywork. You focus on the conversation, not the spreadsheets.

Your first calibration with Confirm: 60 minutes, fully structured. Your second: 40 minutes, with 3 cycles of history for comparison.


Your Next Step

If you're running your first calibration:

  1. Schedule the meeting. Pick a team of 15–25 people to start. Larger groups = longer meetings.
  2. Do the pre-work. Send the pre-read one week before. It changes everything.
  3. Prep your agenda. Print the 1-page template above. Hand it out. Stick to it.
  4. Run it tight. Timer visible. Time limit per person. You're in control.
  5. Document immediately. Email results within 48 hours. Next cycle is easier.

Calibration isn't complicated. It's a 60-minute conversation with structure. Get the structure right, and you eliminate months of rating inconsistency.

Your first calibration sets the standard for everything that comes next.


Ready to streamline your calibration process? Confirm brings all your performance data into one platform: rating definitions, pre-reads, inconsistency detection, and documentation all built in. Learn how Confirm simplifies calibration and gets your team on the same page.


Key Takeaways

  • 60 minutes is enough. With structure, you calibrate 20 people in one hour.
  • The pre-read is half the battle. Send managers data one week early. They come prepared.
  • Keep it simple. One person at a time. Manager speaks. Others react. You decide.
  • Tight time limits force consistency. When people have 3 minutes, they can't debate forever.
  • Document immediately. Ratings lock. Next cycle, you have a baseline.
  • Your second calibration is easier. First one is the setup. After that, it's a repeatable rhythm.
  • Calibration is your bias detector. Look for patterns. Missing context. Unjustified gaps.

See Confirm in action

See why forward-thinking enterprises use Confirm to make fairer, faster talent decisions and build high-performing teams.

G2 High Performer Enterprise G2 High Performer G2 Easiest To Do Business With G2 Highest User Adoption Fast Company World Changing Ideas 2023 SHRM partnership badge — Confirm backed by Society for Human Resource Management

Ready to see Confirm in action?