ESG Investing Performance: Does It Outperform or Underperform?
Does ESG Investing Outperform or Underperform?
The ESG investing performance debate is one of the most contested questions in finance. ESG proponents cite hundreds of academic studies showing neutral to positive performance; ESG critics cite the theoretical costs of constraint and studies showing underperformance; practitioners report wildly different outcomes across strategy types, time periods, and market conditions. The honest answer — rarely given because it disappoints both advocates and skeptics — is that ESG investing performance is highly heterogeneous: the performance outcome depends critically on which ESG strategy is being evaluated, over which time period, in which market conditions, with what benchmark, and using which performance methodology. There is no single ESG return. There are many ESG returns, varying systematically by strategy design.
The ESG performance debate concerns whether incorporating environmental, social, and governance factors into investment decisions improves, reduces, or leaves unchanged risk-adjusted financial returns — with evidence suggesting the answer depends heavily on the specific ESG strategy, time period, market conditions, and methodological choices applied.
Key Takeaways
- The academic meta-analysis evidence (Friede, Busch, Bassen 2015: 2,200+ studies) finds a predominance of neutral to positive ESG-financial performance relationships — but this aggregate finding conceals enormous variation by strategy type and context.
- ESG integration (risk management approach) has the strongest theoretical justification for neutral-to-positive performance; exclusionary screening has clearer theoretical performance costs.
- The 2020–2021 period of strong ESG outperformance was largely driven by sector tilts (underweight fossil fuels, overweight tech) rather than ESG quality per se — and reversed significantly in 2022 when energy prices surged.
- Comparing ESG fund performance requires matching benchmark, factor exposure, and time period — otherwise sector and factor differences masquerade as ESG effects.
- The most defensible conclusion for long-term investors: well-implemented ESG integration need not cost meaningful risk-adjusted return, while providing ESG risk management benefits and portfolio alignment with investor values.
Why the Performance Debate Is Difficult
Before examining evidence, understanding why performance measurement is hard:
Definition heterogeneity: ESG investing encompasses exclusionary screening, ESG integration, best-in-class selection, thematic investing, impact investing, and shareholder engagement. These strategies have fundamentally different return characteristics. Averaging their performance produces a meaningless composite.
Benchmark problem: ESG funds are often benchmarked against conventional indices that include the companies they exclude. When ESG funds exclude oil majors, their relative performance tracks the oil price as much as ESG quality. This benchmark mismatch attributes sector-rotation returns to ESG.
Factor contamination: ESG scores correlate with quality, low-volatility, and growth factors. ESG strategy performance partly reflects factor exposure rather than ESG as such. Isolating the pure ESG signal requires controlling for these factor exposures.
Time period sensitivity: ESG performance varies enormously across market regimes. The 2016–2021 period of low energy prices, tech sector dominance, and climate awareness growth favored ESG portfolios. The 2022 energy price spike and ESG political backlash reversed these advantages. Any single-period ESG performance claim carries significant time period selection bias.
Publication bias: Academic studies finding significant ESG performance effects are more likely to be published than studies finding no effect — inflating the apparent evidence for strong ESG-financial relationships.
The Academic Evidence: Meta-Analysis
The most comprehensive synthesis:
Friede, Busch, and Bassen (2015) — "ESG and Financial Performance: Aggregated Evidence from More than 2000 Empirical Studies": Found that approximately 90% of studies show neutral to positive ESG-financial performance relationships, with roughly 47.9% showing positive, 40.7% showing neutral, and only 7.1% showing negative relationships.
Limitations of meta-analysis: Aggregating 2,200+ studies covering different ESG definitions, asset classes, geographies, time periods, and methodologies produces a finding that is literally everything: ESG sometimes helps, sometimes hurts, usually doesn't matter significantly. The aggregate finding of "neutral to positive" is not very informative.
Morgan Stanley Institute for Sustainable Investing (2019): Found that sustainable equity mutual funds outperformed their traditional peer funds by median 4.3 percentage points in 2019, with lower downside risk. But 2019 was a strong ESG year — tech-heavy ESG portfolios outperformed during tech rallies.
Morningstar Research: Consistent finding that ESG funds have lower downside risk than conventional peers — lower standard deviation and shallower drawdowns in crisis periods (2008-2009, March 2020 COVID crash). But lower volatility may reflect quality and low-beta factor exposure rather than ESG per se.
Performance by Strategy Type
The evidence looks very different by strategy:
ESG integration (incorporating ESG as risk factors in fundamental analysis): Performance implications are generally neutral — analysts who incorporate ESG risks should improve risk assessment without imposing systematic return cost. Some evidence that integration improves analyst accuracy in predicting earnings surprises at high-ESG-risk companies.
Best-in-class / ESG tilts: Evidence mixed. Portfolios overweighting high-ESG-score companies have performed comparably to broad market indices over long periods — but with significant variation. Arabesque S-Ray and MSCI ESG Leaders index comparisons show roughly market-matching performance with slightly lower volatility.
Exclusionary screening: Clearest theoretical performance cost in liquid markets. Excluding sectors (fossil fuels, tobacco, defense) reduces diversification and can create significant sector tilts. In periods when excluded sectors outperform (2022 fossil fuels), exclusionary strategies underperform significantly.
Thematic ESG (clean energy, water, sustainability): High-volatility strategies with sharp sector cycles. Clean energy ETFs (ICLN, QCLN) dramatically outperformed in 2020 and underperformed in 2022–2023 — reflecting investor theme cycles more than fundamental ESG quality.
Impact investing: Intentionally accepts below-market returns in some impact strategies; aims for market or above-market returns in others. Performance comparison requires comparing within return-expectation category.
The 2022 ESG Reality Check
2022 provided the clearest evidence of ESG performance variability:
Energy sector surge: Russia's invasion of Ukraine triggered massive fossil fuel price increases. Energy stocks — systematically underweighted in ESG portfolios due to climate exclusions — were among the strongest performers in 2022. S&P 500 Energy sector returned +65.7% in 2022.
ESG fund underperformance: Most ESG equity funds meaningfully underperformed in 2022 relative to conventional benchmarks, primarily due to energy underweighting and tech overweighting (tech was among the weakest 2022 sectors).
Investor reaction: Some ESG skeptics cited 2022 underperformance as evidence of the cost of ESG. ESG proponents argued 2022 was an anomalous geopolitical event and that excluding fossil fuels remains financially rational over a longer-term energy transition horizon.
Honest interpretation: Both perspectives contain truth. ESG portfolios systematically underweight fossil fuels; fossil fuels can outperform for extended periods; this creates real performance drag in those periods. Whether the long-term energy transition thesis justifies the short-term fossil fuel underweight is a judgment call, not a certainty.
Risk-Adjusted Performance: The More Consistent Finding
The ESG performance finding that holds most consistently across studies:
ESG portfolios — particularly ESG-integrated and best-in-class strategies — exhibit lower risk (lower volatility, lower downside risk, shallower drawdowns in crisis periods) relative to conventional benchmarks. When performance is evaluated on a risk-adjusted basis (Sharpe ratio, Sortino ratio), ESG strategies often compare favorably even when absolute return is similar.
COVID crash (March 2020): ESG portfolios demonstrated meaningful downside protection compared to conventional benchmarks — an important data point for the ESG-as-risk-management thesis.
2008-2009 crisis: High-governance-score companies showed lower drawdowns in the financial crisis — consistent with governance quality as a risk indicator.
This lower-risk finding is more robust than the return finding, and arguably more relevant for long-term institutional investors whose primary concern is avoiding catastrophic drawdowns.
Common Mistakes
Treating a single time period's ESG performance as definitive. ESG performance varies with market regimes. Any single period — good (2020-2021) or bad (2022) — tells us more about that period's market dynamics than about ESG investing's long-run characteristics.
Comparing ESG funds to broad market benchmarks without adjusting for sector and factor differences. ESG funds have systematic sector tilts (underweight fossil fuels, overweight tech) and factor tilts (quality, low-vol). Without controlling for these, you are measuring sector rotation not ESG.
Conflating ESG investing performance with impact investing returns. Impact investing often accepts concessional returns for impact. Comparing impact fund returns to equity market benchmarks is a category error.
Related Concepts
Summary
The ESG investing performance debate produces genuinely mixed evidence because ESG investing is not a single strategy — it encompasses integration, best-in-class selection, exclusionary screening, thematic, and impact approaches with fundamentally different performance characteristics. Academic meta-analysis finds predominantly neutral-to-positive ESG-financial relationships, but this aggregate finding conceals enormous strategy and time-period variation. The most defensible findings: ESG integration need not impose meaningful risk-adjusted return cost; exclusionary screening creates systematic sector concentration; ESG strategies tend to show lower downside risk than conventional benchmarks in crisis periods. The 2022 fossil fuel surge provided a concrete demonstration of ESG performance variability, while COVID-2020 demonstrated ESG's downside protection thesis. Investors should evaluate ESG performance with appropriate benchmark, factor controls, and multi-period perspective rather than relying on single-period outcomes.