A structured approach to identify growth via data
Revenue discovery is the systematic process of using existing historical data and real-time behavioral signals to identify unexploited financial value within a business ecosystem. For mid-market data teams, the ability to spot new revenue across fragmented systems is often the difference between stagnation and hitting the next growth milestone.
In our experience working with scaling SaaS and e-commerce companies, we have found that most organizations are sitting on a goldmine of information that remains trapped in siloed tools. To spot new revenue effectively, you must move beyond high-level dashboards and into granular, event-level analysis that correlates product usage with financial outcomes. This post outlines the technical frameworks and data engineering practices we use to help our clients move from reactive reporting to proactive revenue identification.
A systematic framework to spot new revenue within existing customer data
The most efficient way to increase ARR (Annual Recurring Revenue) is rarely through new customer acquisition alone. It is found in the gaps between how customers currently use your product and how they could be using it. When we conduct an AI Stack Audit for our clients, we often find that "revenue leaks" are actually invisible opportunities that simply haven't been quantified.
To spot new revenue using this approach, we look at three primary vectors:
- Feature-Value Correlation: Identifying which specific product features are most closely tied to account expansion.
- Churn Signal Inversion: Analyzing why customers leave and finding the "mirrored" behaviors of customers who stay and grow.
- Cross-Platform Arbitrage: Combining CRM (Customer Relationship Management) data with product telemetry to find users who are under-licensed for their actual usage volume.
Analyzing the Feature-Value Gap
Most data teams track feature adoption, but few map that adoption directly to the dollar value of the contract. By joining your Stripe or Chargebee billing data with your Segment or Mixpanel event data in BigQuery, you can build a model that shows exactly which features precede a seat upgrade.
| Opportunity Type | Data Sources | Key Indicator |
|---|---|---|
| Upsell Potential | CRM + Product Usage | User reaches 80% of a feature limit 3 months in a row |
| Cross-sell Match | Marketing Automation + Product | Customer uses Feature A but hasn't enabled Feature B (which often follows A) |
| Retention Recovery | Support Desk + Billing | High support ticket volume paired with low login frequency |
How to find new revenue opportunities using SQL and dbt
Finding the "white space" in your revenue requires more than just a BI (Business Intelligence) tool. It requires a robust transformation layer where business logic is codified. We use dbt (data build tool) to create "Revenue Opportunity" models that flag specific accounts for the sales team to contact.
In our work, we suggest building a dedicated dbt schema for revenue intelligence. This schema should not just mirror your source data; it should calculate composite scores for "Expansion Readiness."
Example dbt Logic for Expansion Signals
A common pattern we implement involves identifying accounts where the number of active users exceeds the purchased seat count or where usage growth is accelerating significantly.
-- models/marts/revenue/fct_expansion_leads.sql
WITH account_usage AS (
SELECT
account_id,
date_trunc(event_timestamp, MONTH) AS usage_month,
count(distinct user_id) as active_users,
sum(action_count) as total_actions
FROM {{ ref('stg_product_events') }}
GROUP BY 1, 2
),
account_subscriptions AS (
SELECT
account_id,
current_seat_count,
monthly_recurring_revenue
FROM {{ ref('stg_billing_subscriptions') }}
WHERE status = 'active'
)
SELECT
u.account_id,
s.current_seat_count,
u.active_users,
(u.active_users - s.current_seat_count) AS seat_deficit,
u.total_actions,
CASE
WHEN u.active_users > s.current_seat_count THEN 'Over-capacity'
WHEN u.total_actions > (SELECT avg(total_actions) * 1.5 FROM account_usage) THEN 'Power User'
ELSE 'Steady'
END AS expansion_category
FROM account_usage u
JOIN account_subscriptions s ON u.account_id = s.account_id
WHERE u.usage_month = date_trunc(current_date, MONTH)By surfacing this model to a Sales Ops dashboard, your team can find new revenue opportunities that were previously hidden in the logs. This shifts the data team from being a cost center to a direct contributor to the sales pipeline.
Strategies to spot untapped revenue in your marketing funnel
Marketing attribution is often treated as a way to prove ROI (Return on Investment), but its true power lies in identifying where spend is failing to capture available demand. To spot untapped revenue, we look for "high-intent, low-conversion" segments.
These are users who enter the funnel through high-intent keywords or landing pages but drop off before the demo stage. Often, the data shows that these users come from a specific industry or use a specific tech stack that your product supports, but your automated messaging isn't addressing those specific needs.
Identifying High-Value Segments
When we build data foundations for our clients, we focus on enriching the lead data early. By using APIs (Application Programming Interfaces) like Clearbit or Apollo, we can join firmographic data with funnel behavior. If you see that "Series B Fintech Companies" have a 50% higher LTV (Lifetime Value) but a lower conversion rate from lead to MQL (Marketing Qualified Lead), you have found a data driven revenue opportunity. The fix might be as simple as creating a dedicated landing page for that segment or adjusting the lead scoring weights.
Ready to fix your data foundation?
Book a free diagnostic call and find out where your stack stands.
Book a CallLeveraging predictive models for data driven revenue opportunities
Once you have a clean data foundation (BigQuery, dbt, and a primary BI tool), you can move into predictive analytics. Predictive modeling is not about magic; it is about probability. We help teams build "Next Best Action" models that tell account managers which customer is most likely to buy an add-on product this month.
To find these data driven revenue opportunities, we often look at:
- Temporal Patterns: Does expansion usually happen in month 4 or month 10 of a contract?
- Persona Overlap: Does adding a "Financial Controller" user role to an account usually lead to a higher-tier plan upgrade?
- Integrations: Accounts with 3 or more active integrations typically have a 20% higher retention rate and 15% higher expansion rate.
By quantifying these patterns, the data team can provide the CRO (Chief Revenue Officer) with a prioritized list of accounts. This is far more effective than the standard "shotgun" approach to sales outreach.
Why data quality is the foundation of revenue discovery
You cannot spot new revenue if you do not trust your data. In our experience, many revenue opportunities are missed because of duplicate records in the CRM or broken event tracking in the product. If one system says a customer is "churned" and another says they are "active," the sales team will likely ignore the account entirely to avoid an awkward conversation.
We advocate for a "Data Contract" approach where the engineering team and the data team agree on the schema and meaning of critical revenue events. This ensures that when your dbt models flag an opportunity, it is based on reality.
Critical Revenue Data Checklist
Before you can reliably spot new revenue, ensure your stack handles these four areas:
- Identity Resolution: A single user ID must persist across the website, the product, and the support desk.
- Timestamp Standardization: All events should be recorded in UTC to avoid "impossible" sequences of events in global companies.
- Null Management: Your models must account for missing billing data without breaking the entire expansion pipeline.
- UAT (User Acceptance Testing): The sales team should validate a subset of the "opportunities" to ensure the logic matches their on-the-ground experience.
Building the infrastructure for continuous revenue discovery
Scaling a data team involves moving from "one-off" requests to automated systems. A production-ready revenue intelligence system requires more than just a few SQL queries; it needs a managed infrastructure.
We recommend using Terraform to manage your data warehouse permissions and dbt to manage your transformation logic. This ensures that your revenue models are version-controlled and peer-reviewed. When we work with data teams in our Learn AI Bootcamp, we emphasize that the most valuable AI agents are those that act on high-quality, transformed data to automate the outreach for these revenue opportunities.
Imagine an AI agent that monitors your "Expansion Leads" dbt model and automatically drafts a personalized email for the account manager, including the specific usage stats that justify the upgrade. This is the level of automation that separates modern data teams from traditional reporting departments.
Frequently Asked Questions About Revenue Discovery
How can a data team help the sales department spot new revenue?
A data team can help the sales department by building "propensity to buy" models. These models analyze historical data to find behaviors that preceded past upgrades, such as a sudden increase in API calls or a new department joining the platform. By surfacing these leads in the CRM, sales reps can focus on high-probability targets.
What are the most common places to find untapped revenue?
Untapped revenue is most commonly found in "zombie" accounts that are paying for a low-tier plan but have high usage volume, and in high-churn segments that could be saved with targeted product interventions. Additionally, cross-selling features to customers who already use complementary features is a major source of growth.
Do we need a dedicated data scientist to find revenue opportunities?
No, most revenue opportunities can be found using standard SQL and a well-structured data warehouse. A skilled Analytics Engineer can build the models needed to identify expansion and churn risks. A data scientist is only required once you move into complex, multi-variable predictive modeling or deep learning for sentiment analysis.
How does dbt help in spotting new revenue opportunities?
dbt allows you to codify business logic (like what defines a "Power User") in one place. By centralizing this logic, you ensure that everyone from the CFO to the Account Executive is looking at the same definition of an opportunity. It also allows you to test your logic and ensure that your revenue flags are accurate.
What is the first step to becoming a more revenue-focused data team?
The first step is to perform a deep join between your financial data (Stripe, HubSpot) and your product usage data. Identifying the gap between what customers pay and how much they use the product is the fastest way to prove the value of your data team to the executive suite.
Ready to build your revenue intelligence engine?
Moving from static reports to a system that actively identifies growth requires a solid data foundation and a clear strategy. If you are a leader on a scaling data team, we can help you accelerate this transition.
We cover these exact workflows in our Learn AI Bootcamp, where we help teams move from basic SQL to production-grade AI agents and automated revenue discovery systems. If you want a more targeted assessment of your current capabilities, our AI Stack Audit provides a comprehensive roadmap for your data infrastructure. Or, if you prefer to discuss your specific architecture and goals, book a free consultation with our team.