Join us for an open discussion on Apache Iceberg in Gurgaon on 18th Jan Learn more ->

Resources: Blogs, Events, and more.

library of technical deep dives, customer case studies, events, and more on data engineering topics from our team, and the community.

#Engineering

e6data raises $10m to dismantle lock-in from data intelligence vendors on analytics and AI by halving costs and increasing performance 5x

e6data's architecture diagram: where is fits in existing data stacks.
Lakehouse Days
Lakehouse Days: January 2025
Apache Iceberg: understanding the internals, performance, and future
Inside look at metadata evolution before and after compaction in the Apache Iceberg
Under the Hood of Apache Iceberg: An Inside Look at the Metadata Evolution After Compaction

By Karthic Rao and Shreyas Mishra on 03 Jan 2025

The growing relevance of low-level metadata catalogs in data lakehouses
From Hidden to Hero: Low-Level Technical Metadata Catalogs’ Relevance for the Future of Lakehouses

By Vishnu Vasanth and Karthic Rao on 26 Dec 2024

Lakehouse Days
Lakehouse Days: Dec 2024
Apache Iceberg: understanding the internals, performance, and future
A surreal scene of a lakehouse surrounded by icebergs, with people actively sorting large data on wooden tables.
Enhancing Query Performance in the Apache Iceberg: A Hands-On Guide to Sorting Within Partitions

By Karthic Rao and Shreyas Mishra on 27 Nov 2024

Using links, guides, and resources to optimize lakehouse tables
The Ultimate Resource Hub for Optimizing Iceberg Tables

By Karthic Rao and Fredson Lewis on 22 Nov 2024

Using Apache Iceberg’s advance snapshot creation and time travel features for managing heavy workloads
A Hands-On Guide to Snapshots and Time Travel in Apache Iceberg

By Karthic Rao and Fenil Jain on 15 Nov 2024

Data Analysts using Lakehouses to analyse large volumes of data
The Lakehouse Evolution: Making Object Storage the Backbone of Modern Analytics

By Karthic Rao and Vishnu Vasanth on 12 Nov 2024

Setting up and managing hive metastore for Lakehouses
A Comprehensive Guide for Managing Permissions in Hive Metastore for Lakehouses

By Karthic Rao on 07 Nov 2024

Shuffling data blocks on Iceberg Lakehouses
The Shuffle Series: Part 1: Understanding the Shuffle Problem in Distributed Computing

By Karthic Rao and Sweta Singh on 05 Nov 2024

Online Summit FaceBook Event
Lakehouse Views: July 2024
In this edition, we will be deep-diving into the cutting-edge developments in real-time streaming architecture, focusing on Kafka, Redis, data caching mechanisms, and governance around them.
Online Summit FaceBook Event
Lakehouse Days: August 2024
Open table formats: Apache Iceberg, Delta, and Apache Hudi
Online Summit FaceBook Event
Lakehouse Days: September Edition || Practice PySpark, SQL, and DSA problems with us
Events