Diversity & Inclusion
Measure Belonging: Practical Guide for Public Sector Teams

Measure Belonging: Practical Guide for Public Sector Teams

Pietro Beer • 6 April 2026

A circle diagram illustrating how to measure belonging, contrasting "In" behaviors like open communication with "Out" behaviors like passive aggressiveness.

Table of contents

The fastest way to measure belonging is to combine a short survey, a few outcome metrics, and follow-up listening
What belonging actually measures
The signals worth tracking first
How to build a score that is actually usable
Which tools to use and what each one misses
How to read the results without fooling yourself
What to do when the score is weak
A practical cadence for public-sector teams

Belonging is one of those culture signals that leaders feel before they can prove it. There is a practical answer to how to measure belonging: turn it into a small set of repeatable signals that show whether people feel accepted, respected, safe, and able to contribute. For public-sector teams, that matters because weak belonging shows up quickly in turnover risk, silence in meetings, and uneven experience across grades, locations, and protected groups.

The fastest way to measure belonging is to combine a short survey, a few outcome metrics, and follow-up listening

Use 5-8 Likert items to build a belonging score, then report both the mean and the top-two-box favourable rate.
Segment results by team, grade, location, tenure, and protected characteristics where sample size allows.
Pair perception data with retention, absence, grievance, internal mobility, and bullying or harassment signals.
Run one annual benchmark plus quarterly or monthly pulses after change, not a one-off survey.
Protect anonymity with a minimum reporting threshold of 10-15 responses, or whatever your governance standard requires.

What belonging actually measures

I do not treat belonging as a soft feeling with no operational value. It sits between culture and outcomes: inclusion creates the conditions, belonging reflects whether people actually experience them, and engagement shows how much energy they are willing to invest.

Concept	What it asks	What it usually captures
Inclusion	Can people participate fairly?	Access, voice, respect, opportunity, psychological safety
Belonging	Do people feel accepted and part of the group?	Authenticity, connection, trust, being valued, fit
Engagement	Do people bring effort and commitment?	Pride, motivation, advocacy, discretionary effort, intent to stay

The distinction matters. A team can be highly engaged and still feel cliquey. It can also feel welcoming on the surface while people quietly believe they cannot speak up or progress. If I want to understand belonging properly, I need to measure the felt experience, not just the mood.

That is why the next step is not to ask a single global question, but to identify the specific signals that make belonging visible in daily work.

The signals worth tracking first

When I build a belonging measure, I look for four layers of evidence rather than one headline number.

Direct sentiment - items about being accepted, respected, safe, heard, and able to be authentic.
Behavioural outputs - whether people stay, move internally, speak up, or take leave.
Equity gaps - whether one grade, department, location, or demographic group is consistently lower than the rest.
Qualitative clues - comments about meetings, managers, recruitment, progression, and everyday micro-behaviours.

If a team scores well overall but one subgroup is 8 to 12 points lower, I treat that gap as the real story. In public-sector settings, that split often appears before it becomes a retention issue or a grievance issue.

The Civil Service People Survey is a useful model here because it does not collapse everything into one culture score. It separates engagement, inclusion and fair treatment, and bullying and harassment, which is much closer to how belonging actually works in practice.

That layered view leads naturally to the question of what your score should look like and how short it can be without becoming flimsy.

Infographic shows how to measure belonging: +56% job performance, -50% turnover risk, +167% NPS, -75% sick days.

How to build a score that is actually usable

If you want something stable enough to track over time, use a brief validated scale rather than an improvised one-off question. Recent work in PLOS One developed a 7-item Belonging at Work Scale and found it reliable across several employee groups, which is exactly the kind of structure I would borrow when I need comparability and rigour.

When I design a score, I usually keep it simple:

Use one idea per item.
Keep the response scale consistent, usually a 5-point agree/disagree scale.
Cover a balanced mix of authenticity, voice, fairness, connection, and safety.
Calculate a composite score from the same items every time.
Report the mean score and the share who respond positively.

Domain	Example item	Why it helps
Authenticity	I can be myself at work.	Shows whether people are masking to fit in.
Voice	My views are listened to.	Captures whether people feel psychologically included.
Fairness	People like me are treated fairly here.	Links belonging to equity and trust.
Connection	I feel part of my team.	Measures day-to-day group attachment.
Growth	People like me can progress here.	Connects belonging to opportunity and retention.

I usually avoid overcomplicating the scale. Too many items slow response rates, and too many reverse-worded statements can muddy the data. Five to eight well-written items are normally enough for a recurring pulse, while a longer diagnostic survey can be reserved for deeper reviews.

Once the score is defined, the next question is how to collect it in a way that does not distort the results you are trying to understand.

Which tools to use and what each one misses

No single method is enough on its own. A survey tells you how much belonging people feel; conversations and operational data tell you why that number looks the way it does.

Method	Best for	Main limitation	My take
Annual or biannual survey	Benchmarking and trend tracking	Slower to reflect change	Best backbone measure for a public-sector dashboard.
Quarterly pulse	Checking whether interventions are moving the dial	Can be noisy if the sample is small	Useful after a policy change, restructure, or manager intervention.
Stay interviews or focus groups	Understanding what is driving the numbers	Small samples can overrepresent loud voices	Best used to explain patterns, not to replace the score.
Operational metrics	Seeing outcomes that often follow low belonging	They are lagging indicators	Track turnover, absence, internal moves, grievances, and promotion rates.
Open text comments	Finding the specific behaviours people notice	Hard to compare unless coded consistently	Excellent for themes like exclusion, voice, and manager behaviour.

Accessibility matters here as much as wording. If people cannot complete the survey easily on a phone, with assistive technology, or in their preferred language or format, the sample is already biased before the first answer is collected.

I also prefer anonymous collection for belonging work unless there is a strong reason not to. People are much more likely to be honest about exclusion, bias, or disrespect when they believe the feedback cannot be traced back to them.

That still leaves one difficult task: reading the results without telling yourself a comforting story.

How to read the results without fooling yourself

The most common mistake is to celebrate a healthy organisation-wide score and miss the fact that a few groups are clearly struggling. I would rather have a slightly messy dashboard that shows the truth than a polished one that hides the problem.

Here is the order I use when I review belonging data:

Look at the overall trend first.
Check response rate before trusting the score.
Cut the data by team, grade, location, and tenure.
Compare demographic groups where sample size is large enough to protect anonymity.
Look for alignment with manager trust, development access, workload, and fairness measures.
Cross-check with turnover, sickness absence, grievances, and internal mobility.

Small samples need discipline. As a rule of thumb, I would avoid reporting very small groups at all, and I would treat 10 to 15 responses as a minimum floor only if your governance rules permit it. For trend shifts, I pay more attention when I see a change of 3 to 5 points repeated across more than one pulse, because one noisy month can mislead you.

In the UK public sector, location and grade often matter as much as identity categories, so I always check those first. A low score in one directorate may have more to do with line management, workload, or local culture than with the organisation as a whole.

Once you have a credible read on the data, the point is not to admire the chart. It is to decide what to change.

What to do when the score is weak

Low belonging is useful only if it changes behaviour. In my experience, the best interventions are targeted at the driver behind the score, not just the score itself.

If the weakest area is	Likely issue	First move
Authenticity	People are masking to fit in	Review meeting norms, speak-up behaviour, and manager capability.
Voice	People do not think their views matter	Change decision-making routines and make follow-through visible.
Fairness	Development, reward, or workload feels uneven	Audit promotion, assignments, flexible working, and performance processes.
Safety	There is fear of speaking up	Strengthen reporting routes and deal directly with bullying or harassment.
Connection	People feel isolated from the team or organisation	Improve onboarding, team rituals, and cross-team relationships.

I would not expect a campaign alone to move these numbers very far. If workloads stay unmanaged, if line managers are not trained, or if promotion decisions remain opaque, the belonging score tends to stay stubbornly flat.

The stronger approach is to assign an owner, define a fix, and measure the same items again after the change has had time to work. That is where a short, reliable cadence becomes more valuable than a big annual launch.

A practical cadence for public-sector teams

If I were starting from scratch in a council, department, or arm’s-length body, I would keep the rollout simple and disciplined.

Weeks 1-2: choose 5-8 belonging items, one open comment question, and a reporting threshold.
Weeks 3-4: test accessibility, launch the survey, and explain why anonymity and response rates matter.
Month 2: cut the results by team, grade, location, tenure, and safe demographic groupings.
Month 3: pick two driver areas to fix, not ten.
Month 6: rerun the pulse and check whether the movement is real.

The short version is this: belonging is best measured as a repeated, segmented, mixed-method signal, not as a single vanity metric. If you get the question design right, protect the data properly, and act on the gaps people are showing you, the score becomes useful rather than decorative.

Frequently asked questions

Weak belonging can lead to high turnover, silence in meetings, and uneven experiences across different groups, directly impacting team effectiveness and public service delivery. Measuring it helps identify and address these issues proactively.

Combine a short survey (5-8 Likert items for a belonging score) with key outcome metrics (retention, absence, grievances) and follow-up listening. Segment results by team, grade, and demographics for deeper insights.

Inclusion focuses on fair participation, belonging on feeling accepted and part of the group, and engagement on effort and commitment. Belonging measures the actual felt experience, bridging inclusion and engagement.

Track direct sentiment (acceptance, safety), behavioral outputs (turnover, speaking up), equity gaps across groups, and qualitative clues from comments. This layered approach provides a comprehensive view.

Identify the weakest area (authenticity, voice, fairness, safety, connection) and target interventions there. For example, if authenticity is low, review meeting norms and manager capability. Assign ownership and measure again after changes.

Rate the article