FAQ

Frequently Asked Questions

Get answers to common questions about Pave.
For more details, feel free to contact us.

Getting started

What problem does Pave solve?

Pave ingests raw bank transactions, balances, and other signals, cleans and enriches them into decision‑ready attributes, and applies product‑specific scores so you can approve more good borrowers while reducing losses. In practice, this yields higher‑quality approvals, fewer bad outcomes, and faster time‑to‑decision without extra engineering burden.

How is Pave different from an aggregator or a BI tool?

Aggregators move raw data but don’t make decisions; BI tools visualize inputs but don’t resolve data‑quality and modeling gaps. Pave turns raw feeds into reliable attributes, scores, and explainability designed specifically for underwriting and collections, reducing manual triage and tightening the loop from insight to production action.

Which product segments are covered (PL, EWA, SMB)?

Consumer personal loans, earned‑wage access/cash advance, and SMB credit. Each has domain‑specific attributes (for example, next‑pay and pay‑frequency for EWA; affordability and DSCR‑style trends for SMB) and score configurations.

What are the fastest first wins?

Stronger income detection to reduce false negatives and lift approvals, ACH timing to cut returns and fees, and threshold tuning on top cohorts to unlock yield with bounded risk. We stage these in weeks, then expand with KPI guardrails.

Can non‑technical stakeholders self‑serve?

Yes. Portfolio and cohort views, deciles, and reason codes enable policy evaluation without SQL. Leaders get KPI summaries, analysts export artifacts, and engineers wire a stable endpoint—supporting fast pre‑reads and decisive live reviews.

Which KPIs matter most during evaluation?

Approvals, delinquency and returns, expected loss, net yield, CAC‑to‑approval, and time‑to‑value. We structure readouts against these KPIs with attribution to policy changes.

Data sources and enrichment

What data sources do you support?

We start with your reality: bank transactions and balances (aggregators or statements), then layer in credit bureau features, loan tapes, processor events, and self‑reported data. You can begin with SFTP/warehouse exports and graduate to Pave APIs for real‑time attributes and scoring as you scale.

What transaction history window is required?

We recommend 6–12 months to capture seasonality and pay cycles. If you only have 90 days, we can produce directional results and clearly call out confidence limits. During rollout, feature windows are standardized and point‑in‑time labeling is documented for reproducibility.

How do you handle multi‑account users and co‑borrowers?

We unify accounts at the borrower level, deduplicate overlapping signals, and prevent double counting inflows that move between a user’s own accounts. For co‑borrowers, we expose combined‑ and per‑borrower views so policy can consider shared obligations while retaining auditability.

What if our data is messy or incomplete?

We normalize merchants and MCCs, reconcile display vs clean names, extract store numbers and locations, and align timestamps with recurrence. Missing fields are surfaced with staleness/confidence and conservative fallbacks; any imputation is labeled so it never masquerades as raw facts.

What does “coverage” mean in practice?

Coverage is the share of traffic with high‑confidence tags, merchants, MCCs, and locations. We publish coverage dashboards, track precision/recall, and show how changes affect downstream lift so you can prioritize the highest‑ROI fixes.

How do you validate tagging accuracy?

Golden datasets, regression tests, human‑in‑the‑loop checks for high‑impact domains (payroll, rent, utilities), and partner side‑by‑sides with precision/recall metrics to keep improvements durable.

How do you handle store numbers and location enrichment?

A hybrid of regex extraction and NER with place‑matching against references, shipped with precision/recall tracking on golden sets to quantify gains and avoid regressions.

Modeling, calibration, and explainability

Can Pave work alongside our existing model?

Yes. Most customers ensemble Pave with in‑house or bureau scores. We provide score‑to‑score maps, decile tables, and threshold simulations so you can introduce Pave as an additive signal and later converge on a unified policy when desired.

What validation gates must a model change pass before production?

Every change goes through staged gates: offline validation on recent holdouts, parity checks against prior versions, fairness and bias scans on protected segments, and performance stability by cohort. We then run a shadow test to confirm live payload parity, followed by a limited rollout with KPI bands and automatic rollback triggers. Promotion requires documented results and sign‑off from Product, Risk, and Engineering.

Do you provide explainability and adverse action support?

Yes. Decisions expose top drivers mapped to reason‑code language and human‑readable attributes. Declines can export compliant adverse‑action text; approvals can capture policy notes for borderline passes.

How do you map new scores to existing thresholds?

We build a score‑to‑score mapping over representative windows, confirm offers and declines align to objectives, and then flip the decision source to avoid distribution shocks.

Can leadership‑friendly explainability be provided?

Yes. We translate model effects into portfolio KPIs with odds‑to‑default tables and before/after decision examples so non‑technical leaders can sign off confidently.

How do you avoid leakage and look‑ahead bias?

Features are computed with explicit as‑of timestamps anchored to underwriting dates, and labels exclude post‑decision information. Holdouts and shadow runs confirm live behavior matches backtest assumptions.

What if live performance diverges from backtests?

We check cohort mix, data freshness, and routing, then evaluate calibration and thresholds. If the population shifted, we recommend targeted recalibration or a short retrain, validated in shadow with KPI bands.

Modeling, calibration, an Backtesting, evaluation, and liftd explainability

How quickly can we run a backtest?

If you have 6–12 months of transactions and outcomes (even approvals‑only), we typically return a first backtest in 1–3 weeks. The readout covers approvals, 5/15/30‑day delinquency, ACH returns, and expected loss by cohorts and amount buckets, then highlights quick wins for shadow and A/B.

Do we need denied applicants?

Denied applicants remove bias and strengthen calibration, but they’re not required to start. Many teams begin with approvals and add reject inference later. We document trade‑offs, run sensitivity checks, and incorporate denials when you’re ready so thresholds reflect the true decision frontier.

What does a standard evaluation plan look like?

Backtest to quantify potential, shadow test to harden data flows and calibrations, then a controlled A/B that isolates the score or policy change, with weekly reviews and pre‑agreed stop/go criteria.

How do you measure lift credibly?

We track approvals, 5/15/30‑day delinquency, ACH returns, expected loss, and net yield, segmented by amount bucket, tenure, channel, and cohort. We include confidence bands so you can tune thresholds against portfolio objectives.

Which KPIs matter most to executives?

Approvals, delinquency and returns, expected loss, net yield, CAC‑to‑approval, and time‑to‑value. We structure readouts against these KPIs with attribution to policy changes.

Can we export sample payloads and schemas?

Yes. Example requests and responses for attributes and scores, plus SFTP schemas for batch, are ready to drop into Postman and integration tests.

Deployment, APIs, and rollout

Can we start with batch exports and move to APIs?

Yes. Start with SFTP/warehouse exports for speed; graduate to real‑time endpoints once value and governance are proven. Schemas are versioned across both modes to avoid rework.

What’s the path from pilot to full production?

Once A/B KPIs are met, finalize thresholds, lock governance and monitoring, expand coverage, and align finance and product on pacing and volume. Maintain a regular performance cadence to keep rollout controlled.

What does “coverage” mean in practice?

How do you manage schema changes safely?

Each field has a versioned contract and deprecation window. We publish change notes, run regression suites, and provide pre‑release testing so downstream systems don’t break mid‑quarter.

How do you handle discrepancies between async and sync flows?

We document which fields power the UI versus decisions, use feature flags for risky changes, and provide side‑by‑side payloads to reconcile behavior and align partner expectations.

What happens if a connector fails?

We degrade gracefully with alerting, retries, safe cache fallbacks, and clear status for Ops and Support. Post‑incident, we document root cause and prevention.

Collections and payment timing

Can you predict next payday to time collections?

Yes. By combining observed cadence with inflow variance and pay‑window heuristics, we predict the next expected pay date per user. Teams schedule ACH attempts when balances are likely sufficient, reducing returns and improving recovery without adding friction.

How do you reduce ACH returns and collection costs?

We provide an ACH return‑risk score with timing recommendations to route high‑risk cases to safer strategies and schedule attempts around cash arrival. With debit‑vs‑ACH decisioning and capped retries, teams typically see fewer NSFs, lower processing costs, and steadier cash flow.

How do you time debit vs ACH retries?

Combine return‑risk and next‑pay predictions to stage attempts when balances are likeliest to clear and cap retries to avoid fatigue—easy to A/B against your current cadence.

Can you support rent and utility detection use cases?

Yes. We codify on‑time and late windows, support partial payments, and tune by channel. For high‑stakes uses (tenant screening, reporting), we pair models with rules and validation sets to maintain precision.

What if we lack balances on transactions?

Use per‑account snapshots reconciled to transactions; if we must impute, we label it clearly and quantify sensitivity so decision makers and Compliance understand influence.

How do you separate primary vs secondary accounts?

Rules flag primaries (recurring payroll, spend share), and borrower‑level aggregates weight them appropriately. Logic is auditable and tunable to your portfolio.

Segmentation and policy tuning

Can we prioritize specific channels or tenures?

Yes. Policy curves by channel, tenure, pay frequency, and amount bucket—with guardrails—let you tune approvals to strategic segments while protecting portfolio quality.

Do you support custom segments by amount buckets?

Yes. We provide bucket‑specific deciles and policy curves and guard against sparse data so you don’t over‑fit small cohorts.

How do you map new scores to existing thresholds?

We build a score‑to‑score mapping over representative windows, confirm offers and declines align to objectives, and then flip the decision source to avoid distribution shocks.

What are the fastest first wins?

Can Pave replace our existing model over time?

Can you support SMB cash‑flow underwriting?

Yes. We translate transaction streams into SMB‑appropriate affordability and stability features and augment with statements when needed—supporting automated or analyst‑assisted flows.

Monitoring, reliability, and governance

What’s your approach to monitoring and alerts?

We track data freshness, attribute latency, score distribution health, approval/return swings, and cohort mix. Alerts include context for quick diagnosis; weekly reviews summarize trends and actions.

How do you monitor and respond to model drift?

We monitor input distributions, live AUC/KS, decile stability, and cohort composition. Divergence triggers calibration or threshold remapping; material shifts trigger a retrain on recent cohorts, validated in shadow before ramping, with rollbacks and written rationale for governance.

How do we keep costs predictable as we scale?

Commercial levers (tiered pricing, staged rollouts) plus technical levers (payload minimization, selective recompute, caching for high‑volume paths). We size both and set a no‑surprises schedule.

What is your privacy and compliance posture?

We minimize PII, encrypt in transit/at rest, enforce least‑privilege and audit trails, separate raw sources from decision artifacts, and document lineage, reason codes, and adverse action support for regulated workflows.

How does Pave support regulated lenders?

Strong data governance, clear lineage, versioned models and endpoints, explainability, and adverse action support—suited to partner and regulator reviews—shorten approvals and reduce re‑work.

How do you manage schema changes safely?

Each field has a versioned contract and deprecation window. We publish change notes, run regression suites, and provide pre‑release testing so downstream systems don’t break mid‑quarter.

Transform your data sources into Attributes and Scores

Discover how to accelerate your model development from months to days.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Learn more

Explore More Products

Cashflow Scores

Offer higher amounts and drive growth with our Cashflow Scores, built on Cashflow Attributes and repayment history. Increase approvals and retention by identifying healthy, underserved borrowers.

Cashflow Attributes

Drive lift in your risk models to boost approvals with thousands of pre-built attributes built on our expansive loan performance dataset.

Cashflow Analytics

Automate processes to increase approvals and serve more borrowers using Pave’s real-time Cashflow Analytics. Streamline operations and identify healthy, underserved borrowers.

Cashflow Analytics in Snowflake

Gain seamless access to cashflow data within Snowflake to enhance analysis and decision-making. Leverage Pave’s standardized tables, updated daily, to uncover insights without complex ETL.