Skip to content

Comments

Docs: Add blog post about File Format API#15380

Open
pvary wants to merge 7 commits intoapache:mainfrom
pvary:ff_api_blog
Open

Docs: Add blog post about File Format API#15380
pvary wants to merge 7 commits intoapache:mainfrom
pvary:ff_api_blog

Conversation

@pvary
Copy link
Contributor

@pvary pvary commented Feb 20, 2026

No description provided.

@github-actions github-actions bot added the docs label Feb 20, 2026
@pvary pvary requested a review from kevinjqliu February 20, 2026 12:28
Copy link
Contributor

@kevinjqliu kevinjqliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! mostly nit comments


## Conclusion

Finalizing the File Format API marks a major milestone for Apache Iceberg. It enables clean extensibility, encourages innovation across file formats, and prepares Iceberg for the next generation of analytics and AI workloads. Work on the TCK and Vortex integration is already underway, and we are excited to see what the community builds next.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is there a tracking issues we can link here?

categories:
- announcement
---

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should probably add a sentence about how the API will be included in the upcoming 1.11 release

---
date: 2026-02-20
title: Finalizing the Apache Iceberg File Format API
slug: apache-iceberg-file-format-api-finalization
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
slug: apache-iceberg-file-format-api-finalization
slug: apache-iceberg-file-format-api


The community welcomes all contributors. You can help by testing integrations, participating in TCK development, or experimenting with new formats.

## Conclusion
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i like this video explanation, maybe we can include it in the article somehow, https://www.youtube.com/watch?v=aERM4rjBDhY

Copy link
Contributor

@singhpk234 singhpk234 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice write up @pvary !

Comment on lines +102 to +103
### **2. Column Families**
The API enables vertically split storage layouts—column families—which support:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we have a proposal for this already ?


The Technology Compatibility Kit (TCK) is one of the most important next steps. It will validate correctness, semantics, feature completeness, type support, and compatibility for new format implementations.

## Next Steps
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we wanna mention comet too ?


## What the File Format API Provides

The File Format API introduces a well‑defined, pluggable interface for integrating new formats into Iceberg. It allows engines to interact with formats through a standardized set of builders and metadata structures.
Copy link
Contributor

@singhpk234 singhpk234 Feb 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

optional :

Suggested change
The File Format API introduces a well‑defined, pluggable interface for integrating new formats into Iceberg. It allows engines to interact with formats through a standardized set of builders and metadata structures.
The File Format API introduces a well‑defined, pluggable interface for integrating new data file formats into Iceberg. It allows engines to interact with formats through a standardized set of builders and metadata structures.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think its for both data and delete files

pvary and others added 2 commits February 21, 2026 09:57
Co-authored-by: Kevin Liu <kevinjqliu@users.noreply.github.com>
Co-authored-by: Kevin Liu <kevinjqliu@users.noreply.github.com>
@pvary pvary self-assigned this Feb 21, 2026
pvary and others added 4 commits February 21, 2026 10:07
Co-authored-by: Kevin Liu <kevinjqliu@users.noreply.github.com>
Co-authored-by: Kevin Liu <kevinjqliu@users.noreply.github.com>
Co-authored-by: Kevin Liu <kevinjqliu@users.noreply.github.com>
Co-authored-by: Prashant Singh <35593236+singhpk234@users.noreply.github.com>
Copy link
Contributor

@kevinjqliu kevinjqliu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

I noticed you accepted the change for the title to "Introducing the Apache Iceberg File Format API". Do you want to accept the change for the url slug to remove the word "finalization"?
All my other comments are nits. feel free to ignore.

Heres a local rendering:
Screenshot 2026-02-21 at 8 57 46 AM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants