Pricing

Willingness-to-Pay Survey Design for SaaS Pricing

Design willingness-to-pay surveys that produce actionable pricing intelligence. Covers Van Westendorp, Gabor-Granger, and conjoint analysis — with the tradeoffs, sample requirements, and interpretation framework for each method.

SaaS Science TeamMay 31, 20269 min read
willingness to paysaas pricingpricing researchvan westendorpconjoint analysis

Pricing research built on instinct produces pricing built on assumptions. WTP surveys replace assumptions with quantified demand data — specific, segmented evidence about what buyers will pay, for which configurations, at what conversion probabilities.

The limitation of WTP surveys is that they measure stated preference, not revealed preference. People say they will pay more than they actually do. Managing that gap — through survey design, sample selection, and calibration — is the difference between WTP research that guides good decisions and research that produces confident-looking bad decisions.

See Your Growth Ceiling NowTry Free

Choosing the Right WTP Method

Three methods cover 95% of SaaS WTP research needs:

Van Westendorp Price Sensitivity Meter

Best for: Early-stage pricing discovery, single-product pricing, teams without research budget for sophisticated methods

How it works: Four questions per respondent:

  1. At what price would this product be so cheap that you would question its quality?
  2. At what price would this product be a good deal — you'd buy without hesitation?
  3. At what price would this be getting expensive, though you'd still consider it?
  4. At what price would this be too expensive to consider?

Plot the cumulative distribution of answers for each question. The intersections identify:

  • PMC (Point of Marginal Cheapness): where "too cheap" responses first exceed "not a bargain" — below this, you're signaling low quality
  • PME (Point of Marginal Expensiveness): where "too expensive" responses first exceed "not expensive" — above this, you're in price resistance territory
  • IDP (Indifference Price Point): where "bargain" = "expensive" — this is the psychological midpoint of acceptability
  • OPP (Optimal Price Point): where "too cheap" = "too expensive" — the most defensible price from both directions

Sample requirement: 50–150 respondents Time to run: 3–7 days Cost: Low — can be executed via Typeform or SurveyMonkey

Limitation: PSM does not model feature bundles, plan tiers, or trade-offs. It tells you the acceptable range for a single price point, not which configuration of features buyers prefer at different price levels.

Gabor-Granger Analysis

Best for: Direct demand curve measurement, identifying revenue-maximizing price points, validating a specific price change

How it works: Present each respondent with a single price point and ask: "If this product cost $X per month, would you purchase it?" Vary the price across respondents (sequential presentation within a respondent also works but introduces anchoring bias).

Example price sequence: $29, $49, $79, $99, $149, $199

From the responses, build a demand curve: at $29, 72% would purchase; at $49, 58%; at $79, 41%; at $99, 30%; at $149, 18%; at $199, 10%.

Revenue index at each price: $29×72 = $2,088; $49×58 = $2,842; $79×41 = $3,239; $99×30 = $2,970; $149×18 = $2,682; $199×10 = $1,990.

The revenue-maximizing price in this example is $79. Price points above or below produce less total revenue in expectation.

Sample requirement: 50 per price point tested (250+ for 5 price points) Time to run: 5–10 days Cost: Low to moderate

Limitation: Does not capture feature bundle effects or how plan structure affects purchase probability.

Conjoint Analysis

Best for: Feature-to-tier assignment, plan structure optimization, multi-attribute pricing decisions

How it works: Present respondents with pairs or sets of product configurations — each with different feature combinations and price points — and ask which they prefer. Statistical decomposition of choices reveals the value (part-worth utility) each attribute contributes.

For SaaS plan structure, conjoint reveals:

  • Which features drive the most incremental WTP
  • Which plan configurations are dominated (preferred by no segment)
  • How price sensitivity varies by feature bundle
  • The optimal plan structure for maximizing coverage across willingness-to-pay segments

Sample requirement: 150–400 respondents Time to run: 2–4 weeks (design, fielding, analysis) Cost: Significant — requires survey design expertise and statistical analysis software (Sawtooth, Qualtrics)

Calibrating Survey WTP to Real Purchase Behavior

Stated WTP is systematically higher than revealed WTP. The standard calibration factors:

Survey ContextCalibration Factor
Anonymous web survey, general population0.60–0.65 (multiply stated WTP by 0.60–0.65)
Panel survey, qualified demographics0.65–0.70
In-person or phone interview with product demo0.75–0.80
Survey of existing paying customers0.80–0.85
Survey of qualified prospects, active evaluation0.80–0.85

The calibration factor is applied to the WTP estimates before using them to set prices. If your Gabor-Granger analysis suggests a revenue-maximizing price of $99 from an anonymous web survey, the calibrated estimate is $59–$64 for actual purchase behavior. Setting at $99 based on raw survey data would exceed actual WTP for most buyers.

These calibration factors derive from academic research comparing stated and revealed preferences across pricing studies, including work published in the Journal of Marketing Research and replicated in commercial pricing research contexts.

Segmentation: Where the Real Intelligence Lives

Aggregate WTP results mask the variance across buyer segments that is most actionable for pricing architecture.

Run WTP analysis separately for:

  • Company size: enterprise buyers typically show 3–5× the WTP of SMB buyers for identical products
  • Use case: different use cases have different ROI density and therefore different WTP
  • Acquisition channel: self-serve organic signups have lower WTP than sales-qualified inbound
  • Role: economic buyer WTP exceeds technical evaluator WTP by 20–40%

When segments show significantly different WTP distributions, this is the signal for plan differentiation. If SMB buyers cluster around $49–$79 and mid-market buyers cluster around $149–$249, you have two tiers with a clear price boundary. Offering only one price point is leaving revenue on the table from mid-market buyers while still charging SMB buyers at their upper bound.

This segmentation analysis feeds directly into pricing model decisions — the WTP distribution by segment determines whether a good-better-best tier structure or a seat-based structure better captures value across the customer range.

Qualitative Follow-up: Understanding the Numbers

WTP surveys tell you what; qualitative interviews tell you why. The combination is far more actionable than either alone.

After analyzing quantitative WTP data, conduct 5–10 interviews with respondents whose answers are in the highest-WTP segment and 5–10 with those in the lowest-WTP segment. The questions:

  • What would you be able to do with this product that you cannot do today?
  • How does the value compare to the alternatives you are currently paying for?
  • What would make you feel that the price was completely justified?
  • What would make you feel that the price was too high?

The high-WTP interviews reveal the value framing, use cases, and ROI narratives that your highest-paying customers use to justify the purchase. These become your pricing page messaging and the proof points that reduce price objection friction.

The low-WTP interviews reveal the objections, competitive alternatives, and value gaps that explain price resistance. These inform either product investment (to close the value gap) or buyer qualification criteria (to avoid these buyers in your sales motion).

For how this connects to systematic pricing page optimization, see pricing page conversion experiments — the qualitative findings from WTP interviews are the raw material for the copy and structure tests that lift conversion.

Embedding WTP Signals in Product Instrumentation

Survey-based WTP research is a point-in-time measurement. Behavioral signals embedded in product instrumentation provide continuous WTP monitoring — no survey required.

Feature engagement depth as WTP proxy. Users who actively use premium or high-value features have higher WTP for the plan that includes those features. Track feature usage by plan tier and identify which features in the free or entry tier have high engagement — those are candidates for tier migration (moving them to a higher tier to capture the revealed WTP).

Pricing page scroll and engagement. Track what percentage of pricing page visitors scroll past each plan, hover over feature comparisons, and click on "see all features" toggles. Users who spend 90+ seconds on the pricing page and engage with comparison tables have higher WTP than those who bounce after 10 seconds. The engagement distribution gives you a proxy for how many visitors are genuinely evaluating vs. passively browsing.

Trial feature usage before churn vs. conversion. Analyze which features converted users (who became paying customers) used during their trial vs. which features churned trial users used. The features with the highest differential (much higher usage among converters) are the ones that drive perceived value — and therefore WTP. Pricing tiers that gate these features will command higher prices with lower conversion resistance.

Support ticket content analysis. Price objections in support tickets ("too expensive," "looking for alternatives," "comparing pricing") are revealed WTP signals. The frequency and customer segment of these tickets identifies which segments are at the edge of their WTP with current pricing — either candidates for a lower-priced tier or candidates for better value communication.

These behavioral signals do not replace quantitative WTP surveys but they provide continuous, unsolicited data that triangulates with survey findings. A segment that shows high feature engagement AND low pricing objection tickets AND high conversion rates is a segment with strong WTP alignment at current prices. A segment that shows high engagement AND high pricing objection tickets AND low conversion rates is a segment where WTP is below current pricing — a signal for either a new tier or better value messaging.

See Your Growth Ceiling Now

Calculate when your SaaS growth will plateau — free, no signup required.

Calculate Your Growth Ceiling

Conclusion

WTP surveys are a force multiplier for pricing decisions. They convert pricing from a debate about what feels right into a data-driven process anchored in actual buyer economics.

The design choices that determine quality: method selection matched to research question, sample selection from the right buyer population, calibration factors applied before acting on results, and segmentation analysis that reveals where plan differentiation should happen.

The most common failure mode is treating WTP survey results as literal purchase prices rather than distributional signals that require calibration and interpretation. Raw survey output is input to pricing judgment, not a replacement for it.

Run the survey, calibrate the results, segment the distribution, and then interview the extremes. That combination produces pricing intelligence that survives contact with actual buyers.

Frequently Asked Questions

What is willingness-to-pay (WTP) research in SaaS?
WTP research measures what customers and prospects would actually pay for your product or a specific configuration of features. It produces a distribution of price acceptability — not a single number — that informs where to set prices, how to structure tiers, and which features justify premium positioning.
What is the Van Westendorp Price Sensitivity Meter?
Van Westendorp PSM uses four questions to identify an acceptable price range: at what price is the product too cheap to trust? at what price is it a bargain? at what price is it getting expensive? at what price is it too expensive to consider? The intersection of response curves defines the Range of Acceptable Prices (RAP) and the Indifference Price Point (IPP).
What is Gabor-Granger analysis?
Gabor-Granger presents respondents with specific price points and asks whether they would purchase at that price. By varying the price systematically across respondents, it produces a demand curve showing purchase probability at each price point. The revenue-maximizing price is where price × probability is maximized.
How much does stated WTP overestimate actual purchase behavior?
Research consistently shows stated WTP exceeds revealed WTP (actual purchase behavior) by 20–40%. The adjustment depends on commitment depth: anonymous web survey responses require a 35–40% downward adjustment; responses from qualified ICP prospects with real purchase context require a 15–20% adjustment. Always apply a calibration factor before using survey WTP to set prices.
How many respondents do you need for a WTP survey?
Van Westendorp: minimum 50, recommended 100–150 for stable curves. Gabor-Granger: minimum 50 per price point tested, so 5 price points requires 250 total responses. Conjoint analysis: minimum 150, recommended 200–400 depending on attribute count. For B2B SaaS where ICP access is limited, Van Westendorp or Gabor-Granger are often the only feasible options.
Should you survey existing customers or prospects for WTP research?
Both, separately. Existing customers reflect committed buyers who have already decided your product is worth something — they will give you higher WTP estimates and are more willing to pay price increases. Prospects reflect uncommitted evaluators who are more price-sensitive. The gap between the two populations tells you about your pricing power with current customers vs. new customer acquisition.
What is MaxDiff and how does it differ from conjoint in pricing research?
MaxDiff (maximum difference scaling) measures the relative importance of features or plan elements rather than the price at which bundles are purchased. It is used to determine which features should be in each tier and which drive the most willingness to pay, before running conjoint to estimate the bundle price. The two methods are complementary, not competitive.

Related Posts