Skip to main content
ESG Ratings and Their Disagreements

The ESG Data Gap Problem: Missing, Unreliable, and Self-Reported Data

Pomegra Learn

Why Is ESG Data Quality Such a Persistent Problem?

The ESG investing industry faces a structural data problem: the information that investment-grade ESG analysis requires either does not exist, is not disclosed by companies, or is disclosed in ways that are inconsistent, unverified, and difficult to compare. Financial analysts have spent over a century developing standardized accounting frameworks, mandatory disclosures, and external audit requirements to ensure that financial data is reliable enough for investment decisions. ESG data does not have that infrastructure — most ESG metrics are voluntary, unaudited, and produced using company-chosen methodologies that change year to year. Understanding the data gap problem is essential for using ESG ratings critically.

Quick definition: The ESG data gap problem refers to the inadequacy of available ESG data for investment-grade analysis — caused by incomplete disclosure (companies do not report many relevant ESG metrics), unreliable self-reporting (disclosed data is unverified), inconsistent methodologies (companies measure the same metrics differently), and coverage gaps (rating agencies cannot assess what companies do not disclose).

Key takeaways

  • Most ESG data is still self-reported without mandatory external audit or verification, in contrast to financial data where external audits are legally required for public companies.
  • The most significant data gaps are for: scope 3 greenhouse gas emissions (rarely fully disclosed), supply chain social conditions (almost never directly measured), biodiversity impact (minimal standardized metrics), and private company ESG data (minimal coverage by rating agencies).
  • Disclosure coverage varies dramatically by company size and geography: large US and European companies disclose significantly more ESG data than small-cap companies and companies in emerging markets.
  • Rating agencies use imputation and estimation to fill data gaps — but imputed scores are significantly less reliable than disclosed data, and many providers do not clearly distinguish imputed from disclosed scores.
  • Regulatory developments (CSRD, ISSB S1/S2, SEC climate rules) are the primary mechanism for improving ESG data quality, by requiring standardized, audited ESG disclosure.

The Disclosure Voluntariness Problem

Unlike financial reporting — where SEC rules require specific financial disclosures, GAAP defines measurement methodologies, and external auditors provide independent verification — most ESG reporting remains voluntary in most jurisdictions. This creates a structural bias:

Companies disclose what makes them look good: In a voluntary system, companies emphasize metrics where they perform well and omit metrics where they perform poorly. A company that has reduced energy consumption will prominently report energy intensity reduction; a company with declining diversity metrics may not report diversity data at all. ESG ratings built primarily on voluntary corporate disclosures inherit these selection biases.

Methodologies vary: Two companies that both report "carbon emissions per unit of revenue" may use different revenue denominators, different emission factor databases, different organizational boundary definitions, and different base years. Without standardized methodologies, comparability across companies is limited even when the same metric is reported.

Data changes year to year: Companies periodically restate historical ESG data when they change methodologies, acquire or divest businesses, or receive updated emission factors. This makes time-series comparison difficult and creates reliability concerns when analyzing ESG trends.

The Verification Gap

For financial reporting, external auditors must provide reasonable assurance on financial statements, and auditor standards create legal accountability for audit quality. ESG reporting has a weaker assurance framework:

Limited assurance vs. reasonable assurance: Most corporate sustainability reports receive "limited assurance" at best from external verifiers — a lower assurance standard than the "reasonable assurance" applied to financial audits. Limited assurance means the verifier has not found material errors, not that the data is confirmed accurate.

Voluntary assurance: In most jurisdictions (outside CSRD requirements in Europe), ESG assurance is voluntary. Many companies produce sustainability reports with no external verification at all. Investors using unverified ESG data have no independent confirmation that reported metrics reflect actual performance.

Assurance scope limitations: Even when sustainability reports receive external assurance, the scope typically covers only a subset of reported metrics — often the most quantitative and easily verifiable ones (carbon emissions, water use, health and safety incidents). Qualitative disclosures about policies, programs, and management quality are rarely subject to independent verification.

ESG data quality spectrum

The Scope 3 Emissions Problem

Scope 3 greenhouse gas emissions — all indirect emissions in a company's value chain, both upstream (supply chain) and downstream (product use) — represent the vast majority of most companies' total carbon footprint. A consumer goods company may have scope 1 and 2 emissions of 1 million tons CO₂e and scope 3 emissions of 100 million tons. Climate analysis that ignores scope 3 is missing the most significant part of the picture.

But scope 3 data is:

  • Rarely disclosed comprehensively: Most companies disclose only selected scope 3 categories; full value chain scope 3 disclosure remains uncommon even among large companies.
  • Methodologically inconsistent: Scope 3 emissions are typically estimated (not measured) using emission factors applied to spend data or activity proxies. Different companies use different emission factor databases and different estimation methodologies, making peer comparison unreliable.
  • Subject to double-counting: The same emissions appear in one company's scope 3 and another company's scope 1 or 2, creating double-counting challenges when aggregating portfolio scope 3 footprints.

CSRD and ISSB S2 are pushing for improved scope 3 disclosure, but data quality challenges will persist even as disclosure improves.

Supply Chain Social Data

Social ESG metrics face even more severe data gaps than environmental metrics:

Supply chain labor conditions: The most significant labor risks in many industries — garment manufacturing, electronics, agricultural commodities — exist in tier-2 and tier-3 suppliers that have no disclosure obligations and limited audit access. A clothing brand may disclose excellent labor conditions in tier-1 manufacturing facilities while having no visibility into sub-suppliers' practices. Rating agencies have virtually no direct data on these conditions.

Forced labor screening: Identifying forced labor or child labor in complex global supply chains requires on-the-ground investigation that no ESG rating agency conducts systematically. Proxy indicators — country of origin, industry, audit coverage — are used instead, but these are weak substitutes for actual supply chain monitoring.

Healthcare access and community impact: Social metrics related to community impact, indigenous community consultation, and access to essential services are almost entirely qualitative and self-reported. Rating agencies assess companies' stated policies and programs, not their actual community impact.

The Small-Cap and Emerging Market Coverage Gap

ESG data coverage deteriorates sharply below large-cap and outside major developed markets:

Small-cap disclosure: Most ESG rating agencies provide limited coverage for companies outside major equity indices. Small-cap companies lack the ESG reporting resources of large-cap peers and receive less ESG data collection attention from rating agencies. Investors in small-cap ESG strategies must either accept very limited ESG data or supplement with bespoke company-level research.

Emerging market coverage: ESG rating coverage for companies in emerging and frontier markets is materially thinner than for developed markets, for three compounding reasons: less disclosure by companies in markets with weaker mandatory reporting requirements; fewer ESG rating agency resources dedicated to non-Western market coverage; and language barriers that reduce accessibility of non-English disclosures to globally oriented rating agencies.

Real-world examples

Nike's supply chain gaps: Despite Nike being a leader in sustainability reporting for large consumer brands, investigations by NGOs have documented labor conditions in its supply chains that are inconsistent with its stated policies. This gap between policy disclosure (what Nike reports) and supply chain reality (what factory investigations find) illustrates the disconnect between ESG data availability and ESG reality for companies with complex global supply chains.

Agricultural commodity supply chain deforestation: The Tropical Forest Alliance and supply chain deforestation researchers have documented that corporate commitments to "deforestation-free" supply chains are frequently not supported by data demonstrating compliance. Many companies' "zero deforestation" pledges are verified only against a fraction of their actual supply chain, with coverage declining rapidly as supply chain complexity increases.

Scope 3 inconsistency at technology companies: Several studies have compared scope 3 emissions disclosures among large technology companies and found that companies using the same general methodology produce scope 3 figures that are difficult to compare because of different organizational boundary definitions, different handling of cloud infrastructure emissions attribution, and different product end-of-life emission assumptions.

Common mistakes

Treating rating agency imputed scores as reliable ESG data: When a company doesn't disclose a specific ESG metric, rating agencies impute a value based on industry averages or other proxies. These imputed values are flagged in data feeds but are often treated as equivalent to disclosed data in portfolio analysis. Imputed data introduces significantly more uncertainty than disclosed data.

Using ESG scores as a substitute for supply chain due diligence: ESG scores of a company's direct operations do not capture supply chain ESG risks. For companies in sectors with complex global supply chains (consumer goods, electronics, food), supply chain due diligence requires primary research beyond what ESG rating agencies provide.

Assuming European companies have better ESG data: European companies have stronger regulatory disclosure requirements than US and Asian companies, and in aggregate report more ESG data. But "more data" is not the same as "accurate data" — the European data advantage is primarily about disclosure volume, not necessarily verification quality.

FAQ

How does CSRD change the ESG data landscape?

CSRD requires EU-listed and large non-EU companies operating in the EU to disclose sustainability information using ESRS standards, with mandatory limited assurance initially escalating to reasonable assurance over time. This will materially improve ESG data availability and consistency for companies in scope — reducing disclosure gaps and improving comparability. However, CSRD covers only a subset of global companies, and its global impact depends on multinational companies applying CSRD standards beyond their EU operations.

How do ESG rating agencies handle missing data?

Rating agencies use several approaches to handle missing data: (1) score based only on available data, with lower confidence; (2) impute from industry averages or peer group data; (3) apply a "non-disclosure penalty" that reduces scores for missing data; (4) leave the metric blank and reweight other available metrics. Providers generally prefer approaches 1–3 over 4, to maintain coverage. The imputation approach and its accuracy vary by provider.

Are there open-source ESG data alternatives?

Several initiatives provide ESG data under open-access frameworks: CDP (Carbon Disclosure Project) provides company climate data from its annual questionnaire responses on a semi-public basis. The LSEG (formerly Refinitiv) ESG Open Access program provides some ESG data at no cost for academic research. The SFDR's Principal Adverse Impact (PAI) disclosures provide open-access ESG data for EU fund portfolios. Open-source ESG data remains limited compared to commercial data offerings.

Will AI improve ESG data quality?

AI and natural language processing are being applied to extract ESG data from unstructured sources — news articles, court filings, NGO reports — that are not captured in corporate sustainability reports. This can improve coverage (more companies assessed) and timeliness (faster detection of controversies). AI does not solve the verification problem — extracted unstructured data is not audited — but it can expand the information set and improve the detection of ESG risks that companies do not self-disclose.

How should investors communicate about ESG data limitations?

Institutional investors reporting portfolio ESG metrics should be transparent about data coverage gaps and imputation: what percentage of portfolio holdings have disclosed data versus imputed values, what the confidence level is for specific metrics, and what significant sources of data uncertainty exist. Presenting portfolio carbon footprints or ESG scores as precise measurements without acknowledging data quality limitations overstates the reliability of the underlying data.

Summary

The ESG data gap problem reflects a fundamental infrastructure deficit: ESG analysis requires data that is largely voluntary, unverified, inconsistently measured, and incomplete for large portions of the investable universe. The most significant gaps are scope 3 emissions, supply chain social conditions, and small-cap and emerging market coverage. ESG rating agencies fill gaps through imputation and estimation, but imputed data is substantially less reliable than disclosed data. Regulatory developments — CSRD, ISSB, and expanding national disclosure requirements — are the primary mechanism for improvement, though they address disclosed-but-unverified data before addressing the deeper verification challenge. Investors should treat ESG data as useful but imprecise signal, with explicit acknowledgment of its data quality limitations in portfolio reporting and decision-making.

Next

Controversy Adjustments