Latency Tier
The latency tier of a trader refers to the speed at which market information reaches them and orders are executed. Latency is measured in microseconds (μs), milliseconds (ms), and seconds. High-frequency traders operate in the 10–1,000 microsecond range; institutions typically experience 1–100 millisecond latencies; retail investors may see second-level delays. Reducing latency by even microseconds can mean the difference between profit and loss.
This entry is about speed tiers in trading. For the infrastructure enabling it, see colocation; for the data feeds supporting it, see direct market data feed.
Latency components
Total latency has multiple components:
- Market latency: Time for the exchange to execute an order (typically 1–10 microseconds for orders that can be matched immediately).
- Network latency: Time for data to travel from the exchange to the trader’s system (typically 0.1–10 milliseconds).
- Processing latency: Time for the trader’s algorithm to process data and generate an order (typically 0.1–10 milliseconds).
- Transmission latency: Time to send the order from the trader’s system back to the exchange (typically 0.1–10 milliseconds).
Total end-to-end latency might be 1–100 milliseconds or more for most traders.
Latency tiers and trading styles
Ultra-low latency (10–100 microseconds): High-frequency traders. Infrastructure at this level is expensive (servers co-located at exchanges, specialized networking, custom code). Profits come from microsecond-level advantages.
Low latency (100–1,000 microseconds = 0.1–1 millisecond): Sophisticated institutions and algorithmic traders. Achievable with quality infrastructure and direct exchange connections.
Medium latency (1–10 milliseconds): Active traders and institutions without ultra-low-latency infrastructure. This is typical for most institutional traders.
High latency (10–1,000 milliseconds = 0.01–1 second): Retail traders using standard brokers. Brokers themselves may route orders quickly, but the retail client’s connection may add delays.
Very high latency (1–15+ seconds): Delayed data users. Free market data often has a 15-minute delay; some reports are even older.
Latency arbitrage
Latency differences create trading opportunities:
Example: A stock trades at $100 on the NYSE and $100.05 on NASDAQ. A high-frequency trader with direct feeds from both venues might:
- See the price discrepancy (1 millisecond).
- Buy on NYSE at $100.00 (2 microseconds).
- Sell on NASDAQ at $100.05 (2 microseconds).
- Profit $0.05 per share (say, 1,000 shares = $50).
A retail investor with 1-second latency cannot execute this arbitrage; by the time they see the prices, they have moved.
Controversy and fairness
Latency advantages are controversial:
Arguments in favor:
- Speed encourages market-making and liquidity.
- Technology investment is rewarded.
- Information is still ultimately available to all.
Arguments against:
- Creates information asymmetry; fast traders see prices before slow ones.
- Enables predatory strategies (front-running, spoofing).
- Disadvantages retail investors who cannot afford low-latency infrastructure.
- Contributes to flash crashes and volatility.
Infrastructure costs
Achieving lower latency requires expensive infrastructure:
- Servers and software: Custom-built systems optimized for speed. Cost: $100,000–$1,000,000+.
- Co-location: Placing servers at exchange data centers to minimize network latency. Cost: $10,000–$100,000+ per year.
- Direct feeds: Subscribing to exchange market data feeds. Cost: $1,000–$10,000+ per month.
- Network: High-quality, dedicated network connections. Cost: $10,000–$100,000+ per year.
- Total system cost: A full high-frequency trading setup: $1–$10 million or more.
This cost structure means that ultra-low-latency trading is accessible only to well-capitalized firms.
Measures of latency
Absolute latency: Total time from market event to trader decision (e.g., 5 milliseconds).
Relative latency: Latency difference between two traders or systems (e.g., “Firm A has 2ms advantage over Firm B”).
Latency distribution: Typical latencies vary; traders care about the worst-case (tail) latencies as much as average.
Regulation and latency
Regulators have discussed whether to mandate uniform latency or slow down fast traders:
- No direct regulation. The SEC has not mandated specific latency targets.
- Indirect regulation. SIP data is slower than direct feeds, but this is not a formal constraint.
- Proposed solutions. Some propose speed bumps (artificial delays to eliminate sub-microsecond advantages) or faster SIPs.
However, no major regulatory changes have materialized.
Latency and market stability
During flash crashes and volatile episodes, latency differences can exacerbate volatility:
- Fast traders see cascading bids disappearing and hit bids aggressively.
- Slow traders are still seeing outdated prices.
- By the time slow traders’ orders execute, prices have moved sharply.
This can lead to disorderly markets and execution at irrational prices.
See also
Closely related
- Colocation — enables low latency
- Direct market data feed — low-latency data source
- High-frequency trading — depends on low latency
- SIP — slower than direct feeds
- Order routing — affected by latency
Wider context
- Market microstructure — latency shapes this
- Price discovery — affected by latency differences
- Liquidity — provided by low-latency traders
- Volatility — exacerbated by latency disparities
- Fair execution — latency creates fairness concerns