PayWave, a digital payments company, noticed that transactions using a VPN are flagged as fraudulent more often than transactions without a VPN. The fraud team wants to know whether VPN usage itself causes fraud risk to increase, or whether the relationship is driven by other factors such as cross-border activity.
You are given aggregate transaction data split by VPN usage and by cross-border status. First, quantify the observed correlation between VPN usage and fraud. Then evaluate whether that relationship still holds after controlling for cross-border transactions, and explain why correlation alone does not establish causation.
| Group | Transactions | Fraudulent Transactions | Fraud Rate |
|---|---|---|---|
| No VPN | 80,000 | 1,200 | 1.50% |
| VPN | 20,000 | 800 | 4.00% |
| Segment | Group | Transactions | Fraudulent Transactions | Fraud Rate |
|---|---|---|---|---|
| Domestic | No VPN | 70,000 | 700 | 1.00% |
| Domestic | VPN | 5,000 | 75 | 1.50% |
| Cross-border | No VPN | 10,000 | 500 | 5.00% |
| Cross-border | VPN | 15,000 | 725 | 4.83% |
Assume fraud labels are finalized and transaction assignment to these groups is accurate.