Trends is free while in Beta
296%
(5y)
73%
(1y)
27%
(3mo)

About Apache Iceberg

Apache Iceberg is a high performance open table format for large analytic datasets in data lakes, enabling reliable schema evolution, hidden partitioning, and ACID like guarantees at scale.

Trend Decomposition

Trend Decomposition

Trigger: Adoption of scalable, ACID compliant table formats in data lakes to improve reliability and performance for analytics workloads.

Behavior change: Companies standardize on Iceberg as the canonical table format across data pipelines and BI tooling.

Enabler: Open specification, strong ecosystem integrations (Spark, Flink, Hive), and cloud provider support across data lake services.

Constraint removed: Fragmented table formats and brittle partitioning strategies are replaced with a unified, evolvable schema and consistent semantics.

PESTLE Analysis

PESTLE Analysis

Political: Increased emphasis on data governance and interoperability across cloud providers and open source ecosystems.

Economic: Reduced data warehouse costs via scalable data lake architectures and improved query performance lowers total cost of ownership.

Social: Greater collaboration between data teams and analytics consumers due to standardized data access patterns.

Technological: Mature open table format with strong runtime support across engines and cloud services.

Legal: Data governance and compliance requirements drive adoption of auditable, consistent data formats.

Environmental: Potential reductions in data movement and storage redundancy through more efficient querying over unified tables.

Jobs to be done framework

Jobs to be done framework

What problem does this trend help solve?

Provides reliable, scalable, and ACID like data lake tables for analytics.

What workaround existed before?

Fragmented formats and bespoke pipelines that struggled with schema evolution and partitioning.

What outcome matters most?

Speed, reliability, and cost efficient analytics at scale.

Consumer Trend canvas

Consumer Trend canvas

Basic Need: Reliable data lake table semantics for analytics.

Drivers of Change: Need for scalable, open formats; cross engine compatibility; cloud data lake growth.

Emerging Consumer Needs: Faster, more predictable analytics; easier data governance.

New Consumer Expectations: Schema evolution without downtime; strong consistency guarantees.

Inspirations / Signals: Community momentum around Iceberg; vendor and cloud support announcements.

Innovations Emerging: Enhanced streaming/batch querying, time travel, and lineage features.

Companies to watch

Associated Companies
  • Netflix - Early adopters and contributors to the Iceberg ecosystem; use Iceberg for scalable analytics.
  • Expedia Group - Uses Iceberg in data pipelines to manage large analytics datasets across brands.
  • Alibaba Cloud - Supports Iceberg within its data lake offerings and related analytics tooling.