Using Alternative Data and Channel Checks Responsibly

The practical point: alternative data can give you a 2-6 week edge on quarterly results, but only if you triangulate it with fundamentals, respect legal boundaries, and accept that 30-50% of alt-data signals are noise that fails to replicate out of sample.
Why Alternative Data Matters (and Why It Can Mislead)
Alternative data refers to non-traditional information sources--web traffic, app downloads, credit card panels, satellite imagery--used to estimate company performance before official filings. Hedge funds spent an estimated $1.7 billion on alternative data in 2020, growing 20-25% annually since.1
The point is: you are not buying "insight"; you are buying partial, noisy signals that require calibration. A single alt-data source might explain 15-40% of revenue variance for e-commerce, but only 5-10% for B2B software with enterprise contracts.
Types of Alternative Data (What You Can Access)
Web Traffic Data (SimilarWeb, Semrush): Estimates monthly visits, time-on-site, and page views. Works best where online engagement directly correlates with revenue. Key signals: month-over-month visit growth, bounce rate changes, geographic shifts. Limitation: estimates can be 15-30% off versus actual server data.
App Download and Usage Data (Sensor Tower, data.ai): Tracks downloads, DAU/MAU, session duration. The point is: downloads measure acquisition; DAU/MAU measures retention--and retention drives revenue. A >20% DAU/MAU ratio is strong for consumer apps. Limitation: install estimates can be 20-40% off for smaller apps.
Credit Card Panel Data (Second Measure, Earnest): Aggregates anonymized transactions from panels representing 2-5% of U.S. consumers. Provides direct revenue proxy for consumer-facing companies. Key signals: spend growth vs. prior year, customer count vs. spend per customer. Limitation: panels skew by bank and demographics; B2B revenue invisible.
Satellite and Geolocation Data: Tracks parking lot traffic (retail), oil storage levels (energy), construction activity (real estate). Key signals: car counts vs. prior quarter, storage fill rates. Limitation: weather and local events create noise; interpretation requires domain expertise.
Legal and Ethical Boundaries (The Line You Do Not Cross)
The point is: legal alternative data comes from observable public behavior or consensual data sharing; illegal data comes from stolen information, hacking, or material non-public sources.
What Is Legal:
- Web scraping of publicly accessible pages (check terms of service)
- Aggregated, anonymized transaction data from panels with user consent
- Satellite imagery of public spaces
- Job posting analysis from public job boards
What Crosses the Line:
- Corporate espionage: obtaining internal documents, hacking systems
- Tipping: receiving material non-public information from insiders
- PII exposure: using data that identifies individuals without consent
Why this matters: the SEC has prosecuted cases where expert network consultants provided material non-public information disguised as "industry insights." If your data source cannot be explained publicly, you have a problem.
Channel Check Methodology (Primary Research Done Right)
A channel check is direct contact with suppliers, distributors, or customers to assess real-time performance. This is legal when done correctly.
The Right Way:
- Identify yourself and your purpose. You are an investor researching the industry.
- Ask about industry trends, not company secrets. "How is demand?" not "What did Company X order?"
- Aggregate multiple sources. One distributor is anecdote; five are data.
- Document your process. Written notes of calls, dates, topics.
The Wrong Way: Pretending to be a customer, asking employees to violate confidentiality, paying for insider access, relying on a single source.
The point is: channel checks are mosaic theory in action--combining public data with non-material observations to form a thesis. The mosaic is legal; the stolen piece is not.
Triangulating Alt Data with Fundamentals
Never trade on alt data alone. You triangulate: does this signal align with financials, guidance, and industry data?
The Framework:
- Alt data signal: Web traffic up 18% month-over-month
- Fundamental check: Prior guidance implied 10-12% revenue growth. Is 18% traffic plausible?
- Industry context: Competitors showing similar trends?
- Historical calibration: How did traffic-to-revenue conversion play out in prior quarters?
The test: can you explain divergence? Traffic up but revenue flat? Maybe conversion dropped. App downloads surging but engagement flat? Maybe paid acquisition is spiking with low-quality users. Why this matters: divergence is often a data quality issue, not an alpha signal.
Limitations and Lag Issues
Coverage Gaps: E-commerce has strong alt-data coverage; B2B software has weak coverage (enterprise deals not in panels); financial services are limited (AUM and fees invisible).
Lag: Alt data is not real-time. Web traffic: 7-14 day lag. Credit card data: 2-4 week lag. The point is: by the time you see the signal, fast-moving hedge funds may have already traded.
Signal Decay: Research shows alt-data alpha decays quickly. Web traffic signals lost 50% of predictive power within 2 years of becoming widely available.2
Worked Example: Web Traffic to Revenue Estimate
You analyze RetailCo (hypothetical), an e-commerce company. Consensus expects $1.15 billion Q4 revenue (+9% YoY).
Step 1 -- Gather Traffic Data:
- October visits: 42.3 million
- November visits: 48.1 million (+13.7% MoM)
- December pace: 51.2 million
Q4 total: ~141 million visits vs. ~125 million prior year (+12.8% YoY traffic growth).
Step 2 -- Estimate Revenue: Using 3.2% conversion rate and $85 AOV from prior filings:
- 141M x 3.2% x $85 = ~$384M/month
- Q4 implied: $1.15-$1.20 billion (supports a 2-4% beat)
Step 3 -- Triangulate:
- Credit card panel shows +11% spend growth (aligns)
- Competitor traffic up 8% (RetailCo outperforming)
Step 4 -- Size Position: Traffic data has 15-20% error bars. You assign 60% probability to beat, 30% in-line, 10% miss. This supports modest overweight--not aggressive positioning.
What experience teaches: alt data gave you a directional signal, but error bars prevented over-commitment--which is exactly how it should work.
Implementation Checklist (Tiered by ROI)
Essential (high ROI):
- Verify legal sourcing: can you explain where this data came from publicly?
- Check coverage: is this company well-represented in the data source?
- Triangulate: does alt data align with fundamentals?
- Assign error bars: what is the historical accuracy?
High-Impact (calibration workflow):
- Backtest signal vs. results for 4-8 quarters before trading
- Track signal decay: is alpha shrinking?
- Combine multiple alt-data sources to reduce noise
Optional (for active traders):
- Subscribe to real-time feeds ($10,000-$100,000+ annually)
- Build conversion/AOV models by company
The Takeaway
The takeaway: alternative data and channel checks are tools for sharpening estimates, not replacing fundamental analysis. Practitioners who use alt data profitably treat it as one input among many--triangulating across sources and respecting the 30-50% noise rate that separates signal from speculation.
Footnotes
Related Articles

ROE, ROIC, and Economic Profit Explained
Master return on equity, return on invested capital, and economic profit calculations with DuPont decomposition, sector benchmarks, and worked examples

Documenting a Thesis and Update Triggers
Learn about documenting a thesis and update triggers with practical examples and actionable frameworks for equity analysis

Tax Treatment of Qualified vs. Ordinary Dividends
Tax Treatment of Qualified vs. Ordinary Dividends provides a systematic framework for interpreting market information and identifying opportunities. Mastering this concept helps investors build sustai