New Feature
6 days ago

Dataset Versioning & Schema Diff (Hive/Delta)

ILUM now tracks version history for datasets and shows field-level diffs so you can audit changes and catch breaking schema updates quickly.
What’s new
  • Version history timeline with event labels (e.g., CREATE, DROP) and timestamps.
  • Schema change summary per version: counts of additions / removals / modifications.
  • Field-level diff chips (e.g., FIELD REMOVED product_id (long), FIELD ADDED price (long)).
  • Format & engine tags surfaced with each version (e.g., PARQUET, DELTA).
  • Compare action to diff any two versions side-by-side.
  • Works for tables registered in the Hive Metastore (including Delta Lake tables).
Why it’s useful
  • Immediate visibility into breaking changes before downstream jobs run.
  • Clean audit trail for governance reviews and incident retros.
  • Faster root-cause analysis when KPIs or pipelines regress after a change.
Where to find it
Open a dataset → Versions tab.