Waterfall enrichment cascades a contact request through multiple data providers until a verified email or phone returns. The operator playbook orders the cascade by cost first, accuracy second, and deep search last, which lifts coverage from 55 to 85 percent and lands verified contacts for roughly $0.06 to $0.18 each at a thousand row volume.

Most operators do not run a real waterfall. They run Apollo first, sigh when half the rows come back empty, paste the leftovers into Hunter, then call it a strategy. The cost model is invisible. The hit rate is whatever it is. The cascade was never designed, it just accreted.

This is the operator playbook. What waterfall enrichment actually is, the five provider layers worth knowing, the order that minimizes cost without giving up coverage, the geographies that flip the order on its head, and the operating system pattern that runs the whole cascade from one prompt.

What waterfall enrichment actually is

Waterfall enrichment is a sequential lookup. You send a row (a name, a company, sometimes a LinkedIn URL) into a stack of data providers, in a defined order. Provider one tries to return the email or phone. If it does, the cascade stops and you pay only that provider. If it does not, the row falls through to provider two, then three, until either a verified contact returns or the row exhausts the stack.

The pattern matters because no provider covers the whole market. Apollo is strong on US tech contacts and weak on French manufacturing. ZoomInfo is strong on US enterprise and weak on bootstrapped European SaaS. Hunter is fast and cheap and only deep on public domain patterns. Stack the right providers in the right order and you compound their strengths without paying for all of them on every row.

The single source alternative is one vendor for every row, accepting whatever coverage that vendor happens to have. According to FullEnrich's published numbers, single source typically returns work emails for 40 to 60 percent of a B2B list, and a configured waterfall lifts that to 80 percent or higher. That gap is the whole game.

If you are new to the broader enrichment landscape, the lead enrichment overview explains the field categories (emails, phones, firmographics, intent) before this article narrows in on the waterfall pattern specifically.

Why waterfall enrichment beats single source enrichment

Three reasons the cascade wins, and they compound.

The first is coverage. A list that gets 55 percent coverage from a single vendor leaves 45 percent of your ICP unreachable. If you spent two weeks building that target list, you just threw away two weeks of work on the half you cannot contact. Waterfall recovers most of that 45 percent by routing to a different provider that happens to have the row.

The second is cost discipline. The cascade only charges you for the layer that succeeded. If your cheap first provider returns the email on 60 percent of rows, you never pay your expensive deep search provider for those 60 percent. Most operators assume waterfall is more expensive because it touches more vendors. The math runs the other way as long as you order the cascade correctly.

The third is data quality through consensus. When two providers in the cascade return the same email, that is a signal the email is real. When they disagree, you have a tie breaker decision to make and you can route the row to verification. Single source enrichment cannot give you that signal because there is no second opinion.

The classic counterargument, raised most directly in Cognism's pros and cons piece, is that mixing providers creates compliance risk in regulated regions and pushes you into murkier consent territory. The argument has teeth in EU outbound specifically, and we will come back to it in the geography section.

The five provider layers and what each one is best at

You do not need fifteen providers in a cascade. You need five layers, each with a job, and one provider per layer that fits your ICP.

Layer one, cheap pattern based. Hunter, Snov, Prospeo. These tools guess emails from a name plus a domain using public patterns ("firstname.lastname@", initials, dot variants) and verify the guess with an SMTP check. Cost per successful find is near zero. Hit rate is real on companies with predictable email schemas and falls apart on companies that use generic catch all addresses or rotating aliases.

Layer two, bulk B2B database. Apollo, ZoomInfo, Lusha. These vendors maintain enormous proprietary contact databases. The unit cost per email is small if you have a seat plan, larger if you pay per credit. Coverage is strong on US mid market and weak on EU and APAC depending on the vendor.

Layer three, specialist API. Crustdata, People Data Labs, Datagma. These are API first contact and firmographic providers, often with stronger international coverage than the bulk databases, and built to plug into agent workflows rather than a UI. The unit cost is higher than layer two, the data is often fresher.

Layer four, bundled waterfall. FullEnrich, BetterContact, Clay's waterfall. These are managed waterfalls that internally cascade through 10 to 20 providers and charge you only for verified results. They are themselves a waterfall, which means they are best used as a single layer inside your own waterfall, not as the whole pipeline.

Layer five, deep search. ContactOut, Lusha Mobile, Selligence, mobile phone specialists. The most expensive lookups in the cascade, used to recover the 10 to 20 percent of rows that nothing else found, or to find direct dial mobile phones (the highest cost field by an order of magnitude).

Two design notes. First, layers two and three are interchangeable for most cascades and depend on ICP shape (bulk for US tech mid market, API specialist for international or signal heavy work). Second, layer five should never run on every row. It runs on the 10 to 20 percent of rows that survived four prior layers, otherwise it eats the budget.

Building the cascade in the right order

The order is cost first, accuracy second, deep search last. Pricing dictates the structure because the whole point of the cascade is to stop on the cheapest provider that returns a verified result.

Start the cascade with layer one. Hunter and pattern based finders return results on something like 30 to 50 percent of rows at near zero marginal cost. If you skip this layer because the absolute hit rate is mediocre, you are paying layer two for rows that layer one would have caught for free.

Run layer two next. Apollo or ZoomInfo for US heavy work, Lusha for direct dials in some EU regions. This is the workhorse layer and most of your coverage will come from here.

Layer three is your specialist routing. If your target list is heavy on European or APAC contacts, swap layer two and three. The bulk databases get weaker, the API specialists get stronger. Crustdata is built for this kind of programmatic cascade work and integrates naturally with signal based outbound when you want to add a hiring or funding trigger on top of the row.

Layer four runs only on rows that survived the first three layers. This is where you pay a managed waterfall to do its own waterfall, which sounds redundant and is actually the cheapest way to consume that bundled service.

Layer five runs on the residual, only for the fields where the cost is justified by the value. A verified mobile phone for a target VP at an active buying signal is worth $1. A mobile phone for a row your SDR will never call is worth zero.

A common mistake worth flagging here is running every layer in parallel and accepting the first reply. That trades cost discipline for speed and only makes sense when the row is time critical (an active web visitor, a fresh job change). For batch enrichment, sequential is cheaper.

Cost per verified contact math

Here is the math nobody publishes. Take a list of 1000 rows you want to enrich for verified work emails.

Layer one hits 35 percent. Cost per layer one find is roughly $0. Running total: 350 verified emails, $0 spent.

Layer two runs on the remaining 650 rows. Hit rate on the survivor pool drops to 50 percent (the easy rows already cleared). Unit cost is around $0.10 per find. Running total: 350 + 325 = 675 verified emails, $32.50 spent.

Layer three runs on the remaining 325 rows. Hit rate around 40 percent. Unit cost around $0.20 per find. Running total: 675 + 130 = 805 verified emails, $58.50 spent.

Layer four runs on the remaining 195 rows. Hit rate around 50 percent through a bundled waterfall. Unit cost around $0.15 per find (these vendors only charge on success). Running total: 805 + 97 = 902 verified emails, $73.10 spent.

Layer five runs on the final 98 rows. Hit rate around 25 percent. Unit cost around $0.50 per find. Running total: 902 + 24 = 926 verified emails, $85.10 spent.

That is roughly $0.09 per verified email across a 1000 row list, with coverage at 92.6 percent. The naive comparison is running ZoomInfo single source at a flat seat cost: similar dollar spend, coverage at 55 to 65 percent, and you are out of pocket on the rows that returned nothing. The Datablist analysis of revenue impact, which models a 45 percent revenue lift from raising coverage from 55 percent to 80 percent, shows what the missing rows cost in pipeline terms.

The build versus buy decision sits inside this math. Building the cascade yourself with five separate API contracts costs roughly a week of engineering plus monthly maintenance, and the savings appear only if you push more than a few thousand rows a month. Below that volume, a managed waterfall like FullEnrich is cheaper than the engineering time.

FullEnrich as the bundled orchestrator

FullEnrich is the cleanest example of layer four built right. It cascades through 15 plus underlying providers on every request, charges you only on verified return, and bundles work email, personal email, and mobile phone into one credit ledger so you do not maintain five vendor contracts.

The current credit math, fetched from the FullEnrich pricing page on 2026-06-05: 1 credit per verified work email, 3 credits per personal email, 10 credits per mobile phone, 0.25 credits for person and company enrichment standalone. The free trial is 50 credits and paid plans start around $29 per month for 500 credits, with unused credits rolling over for three months on monthly plans and a year on annual plans.

What that means in operator terms. At Starter, you can verify 500 work emails for $29, which lands you at $0.058 per verified email, roughly in line with the cascade math above. Phones cost ten times more because phones are ten times more expensive to source. Personal emails sit in between because personal email databases are smaller and slower to refresh.

Two real cautions. The first is that FullEnrich is itself a waterfall, so dropping it as layer four inside a larger waterfall (one that started with Hunter and Apollo) means you are paying a managed waterfall to do less work than it is priced for. That is fine economically, you only pay on success, but it is worth being explicit so you do not run FullEnrich twice on the same row. The second is that the bundled waterfall is opaque on which underlying provider returned the result, which matters less for outbound and more for compliance reporting in EU work.

Hit rate tuning by ICP and geography

The cascade order is not fixed. ICP and geography rewrite it.

US tech mid market. Apollo or ZoomInfo as layer two is almost always the right call. Hunter as layer one catches the predictable domains. Crustdata or PDL as layer three for the rows the bulk databases miss. Layer four and five as residual cleanup. Expect 85 to 95 percent coverage.

EU mid market. This is where ordering breaks. Apollo and ZoomInfo are weaker on European contacts, Lusha is strong on certain markets, Cognism is the proprietary single source player that GDPR conscious teams default to. A defensible cascade is Hunter (layer one), Cognism or Lusha (layer two), Crustdata (layer three), FullEnrich (layer four). Skip the most aggressive deep search providers for EU outbound where consent and source matters for compliance.

APAC mid market. Most US centric databases are thin here. The cascade often starts at layer three (API specialists with stronger international coverage) and ends at layer five faster than US work does. Coverage tops out lower (typically 65 to 80 percent) because the underlying data simply does not exist at US density.

Late stage enterprise. ZoomInfo or Apollo lead, but the deep search layer (mobile specialists) matters more because the decision makers you want to reach use direct dials, not the office switchboard. Expect to pay more per row, and only run the cascade after a tight ICP filter so you are not enriching unqualified accounts at premium pricing.

Bootstrapped or seed stage targets. None of the bulk databases will reliably have these companies. Layer one (pattern based) and layer three (API specialists with web crawling like PDL) carry most of the load. Layer two is often a waste of credits because the company does not exist in the bulk database yet.

The honest version of geography tuning is that you do not nail it on the first cascade run. You sample 200 rows, measure hit rate per layer, and reorder. Then you run the rest. Operators who skip the sample run pay for a misordered cascade across the full list.

Common mistakes that blow the cost model

Five failure modes, in order of how often they happen.

Enriching before deduping. A 5000 row export from Sales Navigator typically contains 20 to 30 percent duplicates and another 15 to 20 percent out of ICP rows. Running the cascade on those rows spends real money returning data you will throw away. Deduplicate by company and by contact, run the ICP filter, then enrich.

Enriching fields the message will never use. If your outbound sequence does not reference job tenure, do not enrich job tenure. The waterfall has a credit for every field you add. Operator first mile work is deciding which fields actually drive the message. Everything else is fat.

Running every provider in parallel for batch work. Parallel is for time critical signal triggered enrichment (a hot web visit, a fresh hiring announcement). For batch enrichment, sequential is cheaper because the cascade stops on the first hit instead of paying every provider that found the same row.

Treating verification as optional. A verified email is not just an email pattern. It is an email that an SMTP check confirmed deliverable. Skipping verification ships bounces into HubSpot and into your sending infrastructure, which costs you sender reputation and eventually deliverability. Every layer in your cascade should verify before returning.

Forgetting the human review checkpoint on ambiguous rows. When two providers return different emails for the same person and verification likes both, surface the row for a human glance. Three seconds of operator attention beats a wrong message sent to the wrong inbox.

Yalc as the skill that runs the waterfall

Most teams build the cascade as a Make or Zapier or n8n graph: provider one node, conditional, provider two node, conditional, deduper, verifier, CRM writer. The graph works until somebody changes a provider API or until somebody else on the team needs to edit the flow. At 30 to 40 nodes, the graph is unreadable and edits cascade in unintended ways.

The alternative is to run the cascade as a markdown configured skill on top of an operator OS. Each layer is a function. The order is a config file. The CRM write is a function. The whole pipeline reads like a recipe and edits like one. When Apollo deprecates an endpoint, you edit one function in one file. When a row pattern fails, you read the log instead of debugging a node graph.

This is the architecture pattern the operator playbook for B2B lead generation sits inside. Humans own first mile (which fields to enrich, which ICP slice to target) and last mile (the call, the reply, the deal). The waterfall is middle mile work, the kind of mechanical orchestration that compounds when you let an operating system run it instead of paying a SaaS UI to host it.

Yalc is one example of this pattern. Markdown configured, locally installed, talks to Crustdata, FullEnrich, and HubSpot through real APIs, runs the waterfall from one Claude Code conversation. Every row enriched, every provider tried, every reply tagged feeds the next run with sharper data. The cascade gets cheaper over time because the operator learns which providers to skip for which ICPs and edits the config file accordingly.

The operator template for a thousand contact run

Run this checklist before you start the cascade.

  1. Pull the raw list (Sales Nav, Crustdata search, signal trigger export). Get the raw row count.
  2. Deduplicate by LinkedIn URL or by company plus name. Cut roughly 20 percent of rows.
  3. Apply the ICP filter (industry, size, geography, signal). Cut another 15 to 25 percent.
  4. Decide which fields the outbound message actually needs. Usually work email plus maybe LinkedIn URL. Mobile phone only for the highest tier rows.
  5. Sample 200 rows through the cascade. Measure hit rate per layer.
  6. Reorder the cascade if a layer is paying too much for too little. Geographic edits live here.
  7. Run the rest. Verify every result before writing to HubSpot.
  8. Log the final coverage and cost per verified contact. Use those numbers to refine the next run.

The discipline is in the first four steps, not in the cascade itself. The cascade is mechanical once the inputs are clean. Operators who skip steps one through four pay two to three times more per verified contact than operators who do them properly.

That is what running a real waterfall enrichment cascade looks like. Not five separate vendor logins and a Friday afternoon of CSV merging. One operator OS, one config file, one prompt, and a coverage number that climbs every time you iterate.

FAQ

What is waterfall enrichment?

Waterfall enrichment is a sequential lookup pattern where a contact row is sent through multiple data providers in a defined order. The cascade stops on the first provider that returns a verified email or phone, so you only pay for the layer that succeeded. The pattern raises coverage from 40 to 60 percent (typical single source) to 80 percent or higher.

How does waterfall enrichment work?

You configure an ordered list of data providers. Each row enters the cascade at provider one. If provider one returns a verified result, the cascade stops and you are charged only that provider. If not, the row falls through to provider two, then provider three, until the row is enriched or the stack is exhausted. The cascade is usually run in batches and is most cost efficient when the cheapest providers run first.

How many data providers should you include in a waterfall?

Five layers is the practical ceiling for most B2B operators. One pattern based finder (Hunter), one bulk database (Apollo or ZoomInfo), one API specialist (Crustdata or PDL), one bundled waterfall (FullEnrich), and one deep search (mobile or hard to reach data). Adding more than five rarely improves coverage and always slows the cascade.

How much does waterfall enrichment cost?

At a thousand contact volume, a well ordered cascade lands verified work emails for roughly $0.06 to $0.18 each. Mobile phone numbers cost roughly ten times that per verified result because phone data is harder to source. A managed waterfall like FullEnrich starts at $29 per month for 500 credits, where one work email is one credit and one mobile phone is ten credits.

Should you build a waterfall enrichment system yourself or use a bundled tool?

If you run more than a few thousand enrichments per month and you want compliance reporting, geographic routing, or signal triggered runs, building the cascade yourself on an operator OS is cheaper and more flexible. Below that volume, a managed waterfall like FullEnrich is cheaper than the engineering time you would spend on integration.

What types of contact information can waterfall enrichment find?

Verified work emails, verified personal emails, verified mobile phone numbers, direct dials, LinkedIn URLs, job titles, tenure, firmographic data (revenue, headcount, funding), and technographic data (what software the company uses). Each field has its own cascade and its own cost curve. Phones cost more than emails, personal emails cost more than work emails, mobiles cost the most.

Is waterfall enrichment better than LinkedIn Sales Navigator alone?

Sales Navigator is a search and targeting tool, not an enrichment tool. It returns names, titles, and company context but does not give you verified work emails or phone numbers. A real outbound pipeline uses Sales Navigator (or a Crustdata search) as the sourcing layer and a waterfall as the enrichment layer that turns those names into contactable records.

What happens when providers in the cascade return conflicting data?

When two providers return different emails for the same person, route the conflict to verification. If both verify, surface the row for a human review checkpoint. If one verifies and the other does not, take the verified one. If neither verifies, drop the row. Conflicting data is a signal, not a failure mode, and the cascade should treat it that way.