Skip to main content
Modern Value Investing

Using Alternative Data Without Losing Focus

Pomegra Learn

Using Alternative Data Without Losing Focus

The proliferation of alternative data—satellite imagery of parking lots, credit card transaction data, web traffic logs, trucking GPS signals—has created a new mythology around investing: that the investor with the best data wins.

This is partly true. An investor who knew Tesla's Fremont factory was running three-shift production in 2019 had edge. An investor who noticed Shopify sellers growing faster than expected in 2020 had early signal.

But alternative data is a tool, not a strategy. Many investors have buried themselves in data—spending $10k monthly on satellite imagery subscriptions, hiring data scientists to parse web logs—only to discover that the signal is either: (a) already known to the market, (b) misinterpreted, or (c) overwhelmed by fundamental factors that traditional analysis would have identified faster.

Quick definition: Alternative data is any information source outside traditional financial statements, earnings calls, and news—including satellite imagery, credit card transactions, shipping data, and web traffic. Used correctly, it validates theses. Used incorrectly, it creates false confidence in weak ideas.

Key Takeaways

  • Alternative data works best when it confirms a thesis already identified through fundamental research, not as the primary input to stock selection
  • High-quality alternative data (satellite imagery of factory activity, real transaction data from large retailers) provides validation; noisy data (web scrapers, incomplete transaction sets) creates false signals
  • The advantage window is narrow: once data becomes available to hedge funds and sophisticated investors, retail investors should assume the opportunity is priced in
  • The value of alternative data lies not in surprising upside but in validating that fundamental assumptions (factory utilization, store traffic, usage trends) are correct
  • For most value investors, traditional analysis—reading 10-Ks, understanding competitive positioning, calculating intrinsic value—still dominates edge-building

When Alternative Data Genuinely Helps

Validating capacity utilization: A company claims its factories are running at 80% capacity utilization. Satellite imagery of parking lots and electrical usage can validate this. If imagery shows full parking lots and high nighttime power usage, the claim is likely honest. If parking lots are empty at 6 PM, you have discovered a red flag.

Tracking store traffic trends: A retail company claims in-store traffic is stable. Credit card data aggregators (Affinity Solutions, Facteus) can show whether store visits are rising or falling. This is most useful when public guidance suggests stability but data shows deterioration—early warning of a miss.

Monitoring subscription churn: For SaaS businesses, credit card data can reveal how many customers are churning. If a company claims 92% net retention but credit card data shows subscription cancellations accelerating, you have found a discrepancy.

Identifying secular trends early: Satellite imagery of Chinese construction sites, shipping data from ports, or container-tracking data can identify boom or bust cycles before traditional economic data. This is valuable for cyclical businesses where timing is everything.

Real-World Examples

Palantir and geospatial analysis (pre-IPO, 2015–2020): Investors who understood that satellite imagery and geospatial data would become essential to defense, agriculture, and commodity trading positioned early in companies capturing that trend. The thesis was simple and fundamental; alternative data validated the market opportunity.

Tesla factory utilization (2019–2020): An investor who used satellite imagery to confirm that Tesla's Fremont factory was operating three shifts confirmed that production claims were conservative. This validated a bullish thesis based on fundamental analysis.

Amazon warehousing expansion (2020–2021): Satellite imagery tracked Amazon's extraordinary warehouse buildout during COVID-19 lockdowns. An investor watching this could have seen the revenue inflection coming before it appeared in quarterly results.

Bed Bath & Beyond store closures (2022): Credit card transaction data would have shown dramatic store traffic declines months before the company formally announced store closings. An investor using this data could have shorted or exited earlier than those relying solely on quarterly earnings.

The False Prophets Problem

Alternative data has spawned an entire industry of vendors promising to deliver edge through datasets. Many are misleading.

Incomplete datasets: A credit card data vendor claiming to track all U.S. retail transactions actually tracks transactions from partner credit cards only—typically 20–40% of all transactions. The data is biased toward credit users, omitting cash and debit. Drawing conclusions from this incomplete picture often leads to wrong signals.

Aggregation delay: Satellite imagery is real-time, but financial conclusions require analysis. By the time a hedge fund analyst concludes that factory utilization is weak, market participants are already repositioning. The advantage window is hours to days, not weeks.

Correlation vs. causation: A dataset shows that web scraper data indicates declining visits to a company's website. Investors conclude sales are falling. But website traffic can fall for reasons unrelated to business health (site redesign, marketing pullback, or platform changes). The data is noisy.

Overfitting: The most dangerous alternative data mistake is using historical data to "prove" a hypothesis, then assuming it will hold prospectively. A pattern in parking lot traffic might have predicted results in 2018–2019, but be irrelevant in 2024 due to changed conditions.

When NOT to Use Alternative Data

When fundamental questions are unanswered: If you do not understand a company's core business, competitive advantages, or cash generation, alternative data will not save you. It is a supplement to fundamental analysis, not a replacement.

When the time frame does not match your holding period: Alternative data is most useful for near-term validation (next 3–6 months). If your thesis is a 5-year compounding story, satellite imagery of factory activity in Q1 will not drive investment returns.

When costs exceed potential edge: A $10k/month satellite imagery subscription is only worth it if the edge it provides generates >$50k annual alpha on your portfolio. For a $1M portfolio, this is a 5%+ annual benefit—a high bar that few investors clear.

When the thesis is obvious: If you are using alternative data to validate that inflation is high, consumer spending is weak, or factory utilization is declining, you are likely confirming what the market already knows. The data may be well-sourced, but the insight is stale.

A Framework for Using Alternative Data Responsibly

Step 1: Start with fundamental analysis. Identify a mispriced business through traditional research (10-K analysis, competitive positioning, valuation). Develop a thesis: "Factory utilization is X, not Y."

Step 2: Identify a single data point to validate. Do not order five datasets. Pick one: satellite imagery to confirm factory activity, or credit card data to confirm store traffic. Make the data do specific work.

Step 3: Understand the data's limitations. A dataset is not reality; it is a proxy. Satellite imagery shows parking lots, not production volume. Credit card data shows a sample, not the whole market. Adjust your conclusions accordingly.

Step 4: Set a decision threshold. Before analyzing the data, decide: "If factory parking lots are empty, I will reduce position size by 50%. If they are full, I will hold." Make the rule before the data arrives, so you are not unconsciously interpreting favorable data favorably.

Step 5: Treat the validation as a datapoint, not gospel. If alternative data conflicts with your fundamental thesis, ask why. Maybe the data is wrong. Maybe your thesis is wrong. Maybe the time frame is off. Use the discrepancy as a question, not a death knell.

FAQ

Should I subscribe to an alternative data vendor? Only if you have a specific thesis where one data point would materially change a multi-million-dollar decision, and you have validated that the data is reliable (triangulate with other sources, test historical accuracy). For most retail investors under $5M in capital, the costs exceed the benefits.

Can I use free alternative data sources? Yes. Zillow and Redfin provide real estate data. Government databases (Census, trade data) provide economic signals. Credit card company press releases sometimes reveal spending trends. Twitter discussions among practitioners often surface real-time observations that are more reliable than vendor data.

Is satellite imagery really useful? Yes, but for specific purposes: tracking physical infrastructure (factory expansion, real estate development, agricultural land use). It is less useful for validating intangible factors (brand strength, customer satisfaction, management competence).

How far ahead does alternative data get you? In efficient markets, not far. Once a smart hedge fund or data scientist discovers a signal, it typically diffuses within weeks. The edge window for alternative data is hours to months, not years. This suits short-term traders but is less relevant for long-term value investors.

What's the biggest mistake investors make with alternative data? They treat it as a proxy for a thesis instead of a confirmation tool. A dataset showing website traffic is rising does not mean a business will compound at 20% annually. It means one assumption (traffic growth) has been validated. The thesis still depends on dozens of other factors.

  • The law of diminishing returns on information: When more data creates more noise than signal
  • Fundamental analysis as the anchor: Why financial statements remain the most important information source
  • Timing and catalysts: Why even perfect information is worthless if the catalyst is years away
  • Confirmation bias in data analysis: How to avoid cherry-picking data that supports your thesis

Summary

Alternative data is a supplement, not a foundation. Its best use is validating a thesis developed through fundamental analysis, allowing an investor to confirm that assumptions about factory utilization, customer behavior, or competitive positioning are correct. Most value investors will generate greater edge by deeply understanding a business's fundamentals, competitive position, and intrinsic value than by pursuing every new data vendor.

The discipline is recognizing when alternative data is genuinely useful (confirming specific, falsifiable claims) versus when it is noise wrapped in quantitative wrapping paper. The best investors use alternative data lazily: they identify one specific question, find the cheapest data source that answers it, and set a decision threshold before the data arrives.

Next

Blending Deep Value with High Quality