There's a specific moment I've seen in almost every CRM engagement I've walked into. Someone pulls up a report — pipeline by stage, email performance, agent resolution rates, whatever the number is — and there's a pause. A flicker of something. Not quite confidence. Not quite doubt. Just a quiet, private uncertainty about whether the number on the screen actually means anything.
Nobody says it out loud. The meeting moves on. Decisions get made.
But that flicker? That's the symptom. And the disease underneath it is bad data.
CRM data quality is one of those problems that organisations know exists but find genuinely difficult to prioritise. It's not dramatic. It doesn't cause a single visible failure. It just quietly corrupts every decision made from it, every campaign built on top of it, and every report presented to leadership as evidence of how the business is doing.
The lie your CRM is telling you right now
Here's what I mean when I say your CRM is lying. It's not that the data is fabricated. It's that the data is incomplete, inconsistent, or structurally wrong in ways that make it look accurate while producing misleading conclusions.
A few examples from real engagements:
A marketing team runs an email campaign to their "warm leads" segment. The segment pulls 4,200 contacts. What nobody knows is that 800 of them are duplicates (same person imported twice under slightly different names), 600 are contacts who unsubscribed through a form that wasn't properly connected to the suppression list, and 300 are demo contacts from the initial HubSpot setup that were never deleted. The campaign goes out to 2,500 real warm leads — and 1,700 people who should never have received it.
A sales manager presents the monthly pipeline report. It shows $2.3M in deals at the "Proposal Sent" stage. What the report doesn't capture is that 40% of those deals haven't been updated in 90+ days, because sales reps update CRM records inconsistently — some after every call, some never. The real pipeline is somewhere around $1.4M. The manager presents the $2.3M number to the CEO. The CEO makes a hiring decision based on it.
A support team reports 94% CSAT. What the report doesn't capture is that only 18% of tickets triggered a CSAT survey, and surveys were more likely to go out on resolved tickets than escalated ones. The 94% is measuring a self-selecting sample biased toward satisfied customers. The real number is meaningfully lower.
Bad data doesn't just give you wrong answers. It gives you confident wrong answers — which is significantly more dangerous than no data at all.
5 symptoms of data rot — which ones do you have?
Data quality problems don't announce themselves. They accumulate quietly until they're so embedded in normal operations that people stop questioning whether the data is right and just accept that "the CRM is always a bit off." Here are the five most common signs I see:
1. People export to spreadsheets to "actually work with the data." When your team bypasses the CRM for real analysis, it's because they don't trust what the CRM shows them. The workaround is the symptom. The lack of trust is the disease.
2. No one can answer a basic segmentation question without a caveat. "How many active customers do we have?" "About 3,500 — but that might include some lapsed ones, I'm not sure." If basic questions about your own database require hedging, the data is in trouble.
3. Your email unsubscribe rate is creeping up. When contacts receive communications they never opted into, or get the same email multiple times, or get messages addressed to a name that's not theirs — they unsubscribe. Or worse, they mark it spam. Deliverability damage from bad data takes months to repair.
4. Sales reps have "their own system." A notepad. A personal spreadsheet. A folder of business cards. When CRM data is unreliable, experienced salespeople route around it and keep their own records. You lose visibility, you lose institutional knowledge when they leave, and you never get the data quality that comes from consistent input.
5. Reporting produces debates about methodology rather than decisions. If your monthly review meeting spends 20 minutes arguing about which numbers are right before anyone can discuss what to do about them — the data quality problem has reached leadership level.
What it's actually costing — beyond the obvious
Most organisations think about bad CRM data as an inconvenience. The real cost is significantly larger, and most of it is invisible on any standard P&L.
| Cost category | What it looks like | Typical impact |
|---|---|---|
| Marketing waste | Campaigns sent to wrong, duplicate, or disengaged contacts | 10–25% of budget |
| Sales inefficiency | Reps spend time qualifying leads that were never real | 3–5 hrs/rep/week |
| Deliverability damage | Spam complaints from contacts who shouldn't have been emailed | Months to recover |
| Bad strategic decisions | Hiring, investment, or pricing choices made from false pipeline data | Uncalculable |
| Team morale and trust | People stop using the CRM properly because it doesn't reflect reality | Self-reinforcing |
| After a proper cleanup | Reliable data, trusted reports, decisions made with confidence | Recoverable in weeks |
How to calculate the damage yourself
You don't need a consultant to get a rough sense of how much bad data is costing you. Here's a framework I use in audits that you can run yourself in an afternoon:
Step 1 — Duplicate rate. Take a random sample of 500 contacts from your CRM. How many have a duplicate record (same email, same company, slightly different name)? Multiply by your total contact count. If 12% of your sample has a duplicate, you probably have 12% excess contacts in your database — which means 12% of every campaign you send is going to people at least twice.
Step 2 — Critical field completion rate. Pick 5 fields that should be populated for every record (email, lifecycle stage, source, company, last activity date). What percentage of your contacts have all 5 filled in? Anything below 70% is a serious gap. Below 50% and your segmentation is essentially guesswork.
Step 3 — Lifecycle stage accuracy check. Pull 50 random contacts flagged as "Customer." Email or call 5 of them. How many are actually active customers? How many have churned, never purchased, or have no idea why they're in your database? This is uncomfortable — but it's the fastest way to understand how much you can trust your pipeline numbers.
Step 4 — Workflow audit. How many active workflows do you have? How many of them have been reviewed in the last 90 days? For each active workflow, can you name its purpose, its trigger, and its intended audience without opening it? If you can't answer those questions for 80% of your workflows, you have automation running on data you don't understand — and that combination is dangerous.
Where to actually start the fix
The instinct when you discover a data quality problem is to try to fix everything at once. That's how you end up six weeks into a cleanup project with a half-migrated database, confused stakeholders, and a CRM that's temporarily worse than when you started.
Start smaller. Start with the highest-impact, lowest-risk interventions first.
Freeze before you clean. Before touching any data, document what exists. Don't delete workflows you don't understand — pause them and create a log of what each one does. Deleted data is a lot harder to recover than paused automations.
Build your suppression list first. Identify every contact category that should never receive communications — unsubscribes, bounces, competitors, internal test contacts, demo data. Get that list locked down before any campaign goes out. This is the single highest-leverage action for protecting your sender reputation.
Deduplicate with rules, not guesswork. HubSpot's native deduplication tool handles most email-based duplicates. For anything more complex, tools like Dedupely or Insycle give you matching rules and merge logic you can review before applying. Never bulk-merge without a rollback plan.
Set up ongoing hygiene, not one-time cleanup. The most common mistake is treating data quality as a project rather than a process. Clean data requires continuous governance — regular deduplication runs, property completion reports, lifecycle stage audits. Build these into your monthly operations, not your quarterly sprints.
The businesses running on clean, well-governed CRM data make faster decisions, run more effective campaigns, and trust the numbers in their pipeline reviews. That's not because they have better technology. It's because someone made data quality a priority and kept it there.
If your team is working around the CRM instead of inside it — that's the signal. And it's fixable faster than most people think.
Suspect your CRM data is costing you? Let's find out.
I audit HubSpot and Zendesk portals, quantify the data quality gaps, and deliver a clear fix roadmap with timelines and fixed-fee pricing. You'll know exactly what's broken and exactly what it costs to fix before any work begins.
