Give feedback and suggest new ideas for Ilum.

All entries New Improved Fixed Release Feature

Release

Version 6.3.1 Live ! 🎉😊

This release focuses on quality-of-life improvements, platform integrations, and powerful new features to help you work smarter with Apache Spark and your data lakehouse stack.

🚀 What’s New

✅ Trino now in AIO Stack

You asked, we listened, Trino is now included in the default All-in-One (AIO) Ilum deployment. Run blazing-fast distributed SQL queries across your data lake right out of the box.

✅ New Spark Session Form for Jupyter (Sparkmagic)

No more hand-crafted JSON. Create Spark sessions in Jupyter using a streamlined form:

Select cluster
Define memory & dynamic allocation
Add Python packages
Enable auto-pause on idle
This is our most user-friendly Spark session setup yet—and we’ll continue improving it!

✅ Markdown Support in SQL Editor

Turn your SQL editor into a lightweight data notebook. Add headers, notes, and explanations with Markdown for clear, documented analysis in-line with your queries.

✅ Resource Quotas & Limits for Spark Jobs

Gain fine-grained control over Spark resources:

Set CPU/memory limits per job
Apply namespace-level quotas
Prevent overuse and ensure team fairness across shared clusters

✅ Grafana in UI Modules

Monitor your workloads in real time. Grafana dashboards are now integrated into Ilum’s UI, providing insights into Spark usage, resource consumption, and system health.

✅ Extra Buckets for Spark Storage

Now you can define additional storage buckets to separate workloads, support multi-tenant setups, or expand available storage for Spark jobs.

🛠️ Under the Hood

📦 MinIO Chart Updated — Now using ^15.x version for better performance and Helm compatibility
🧪 Jupyter SessionConfig Adjustments — Minor tweaks to improve startup behavior and flexibility

This is a minor release, but it brings major improvements to everyday workflows. We’re continuously evolving Ilum to support modern data teams—from infrastructure to analysis.

Thanks for being part of the journey. More coming soon! 🚀💡

Release

5 months ago

Version 6.3.0 Live ! 🎉😊

Ilum 6.3.0 is here, a robust update designed to supercharge your data workflows and elevate your Spark experience. This release brings a host of new features, performance enhancements, and essential integrations that empower advanced data processing and governance for Spark users.

New Features in This Release

1. Introducing Gitea Integration: Enhancing Version Control Inside Ilum
We’re excited to announce that Gitea—a lightweight, self-hosted Git repository manager—is now built into Ilum. Manage version control for Jupyter notebooks, Spark job scripts, Airflow DAGs, and more, all without leaving the platform. This seamless integration simplifies collaboration for data engineers, data scientists, and DevOps teams.

2. Internal Security System Improved
Our new internal security system strengthens authorization and authentication. User data is securely stored in a dedicated database, with Role-Based Access Control (RBAC) enabling you to organize users into roles and groups, ensuring every action is properly permissioned.

3. Enhanced SQL UI with Notebook-Style Execution
Experience a more interactive SQL Editor that now supports notebook-style operations. Write and execute SQL queries in structured cells—just like in Jupyter Notebook—so you can explore and analyze big data more efficiently.

Other Improvements

Changed ilum-ui Service Type:
Due to issues with kubectl port-forward, we now expose a NodePort by default.
Security Configuration Enhancements:
Security-related settings in ilum-core have been moved from the config map to a dedicated Kubernetes Secret, enhancing the protection of sensitive data.
Ilum Submit Configurations for Spark SQL Engines:
Launch Spark SQL engines via the Ilum Web Application or JDBC endpoint with automatic configuration application from the selected cluster—eliminating manual configuration steps.
Embedded Git Repository:
With Gitea integrated as a module, Ilum now provides a built-in Git server for seamless version control.
Kafka Address Configuration for ilum-core:
Added the ability to set a dedicated Kafka address for the ilum-core pod, separate from the global Kafka configuration for Spark jobs.

Happy Data Processing! 🚀✨

Release

6 months ago

Version 6.2.1 Live ! 🎉😊

Ilum 6.2.1 is a robust update that will supercharge your data workflows and enhance your experience with Spark.

We’re excited to bring you one of our strongest releases to date, with new features, performance enhancements, and essential integrations that enable advanced data processing and governance for Spark users.

SQL API/UI Section: Making Data Easier with Ilum.
Initiate a New Era of Querying Data! Execute SQL queries directly on your cluster! Full SQL dialect support allows you to work natively with any modern open formats like Delta Lake, Apache Iceberg, and Apache Hudi. Also, you can also use multiple engines SparkSQL, Trino(Comming) and Apache Flink(Comming).

Advanced Spark Metrics Integration.
With enhanced metrics, you can analyze your spark jobs better. Check the resources, jobs, and health of running applications quickly and troubleshoot the issues quickly. Whether you're a data engineer or a developer or a scientist, you can drive efficiency with these metrics.

Auto-Scaling for Interactive Sessions.
Efficient Resource Utilization Is The Key! Inactive interactive jobs will automatically scale down to zero after inactivity, freeing up resources for other jobs. When the activity resumes, the sessions smoothly scale back up by themselves.

Automated Spark job scheduling with cron
No need to trigger jobs by hand! With our new cron-based scheduling, you can automate your Spark job executions using a familiar cron-like syntax. Our system is integrated with Kubernetes so that your jobs run on time with automated retries and notifications.

Integration with Superset.
Get better visualisation and analytics through Superset integration. You can check the insights in your data by using Analytics, and you can share them too.

Spark Memory Settings Section.
Customize Spark's settings for memory for better performance tweaks. Adjust Spark memory settings for better resource use and improved job performance.

Integration with MLflow.
Speed up your machine learning! Using MLflow Integration, manage your ML experiments, track result and help streamline model deployment within Ilum.

Prepare to take your data journey to the next level with Ilum 6.2.1 – enhancing your analytics, Spark job management, and data interactions like never before. Get the latest version and see!

Happy Data Processing! 🚀✨

Release

over 1 year ago

Version 6.1.0

Data Lineage Support Added: Enhanced data management with new lineage tracking capabilities.
Base Storage Support for HDFS/Azure/GCS: Expanded storage options to include HDFS, Azure Blob Storage, and Google Cloud Storage.
LDAP/OAuth2 Support: Strengthened security with added support for LDAP and OAuth2 authentication methods.
YARN Web UI Access: Improved monitoring and management with access to the YARN web user interface.
Prometheus Metrics in Spark Jobs: Enabled Prometheus metrics for detailed monitoring within Spark jobs.
Hadoop Config in Config Maps: Simplified configuration by allowing Hadoop settings to be passed to Spark jobs via Kubernetes config maps.
Cluster Storages Spark Config to Spark Jobs: Streamlined job configuration with the ability to pass cluster storage settings directly to Spark jobs.
Detecting Spark Driver Pod Failure: Increased reliability with new mechanisms to detect Spark driver pod failures.
Fetching Metrics from the History Server: Enhanced analytics and troubleshooting with the ability to fetch metrics from the Spark history server.
Tags for Jobs and Groups: Improved organization and management with tagging capabilities for Jobs and Groups.
Exposing YARN Jobs for Ilum's Prometheus: Expanded monitoring options by making YARN jobs visible to Ilum's Prometheus setup.
Tons of Small Improvements: Implemented numerous minor enhancements across the platform to boost performance and user experience.

Release

almost 2 years ago

Version 6.0.0

UI v3
Embedded Jupyter Notebook
Embedded Airflow
Encapsulate the spark-submit processes within separate kubernetes pod
Tons of small improvements

https://www.youtube.com/watch?v=6d27js-FNHU

Release

almost 2 years ago

Version 5.2.0

Python spark jobs support
Airflow integration
Ilum python package available in pypi
Ilum helm package available in artifacthub
Spark update to 3.4.1

Release

over 2 years ago

Version 5.1.0

Now featuring an accessible Spark UI link for actively running jobs, offering seamless monitoring and management.
Fully compatible with Ilum REST API v1.1, enabling advanced functionality.
Streamlined license attachment during Helm installation, eliminating the need for manual intervention.
Enhanced data filtering capabilities with the ability to list all existing spark cluster names.
Added a robust health check endpoint for the Ilum service, ensuring system stability and reliability.
Refined user interface with subtle yet impactful improvements, elevating the overall user experience.

Release

over 2 years ago

Version 5.0.2

Enhanced user interface with subtle refinements for an improved user experience.
Extended ilum-job-api to maintain seamless backward compatibility with Java 8, ensuring support for legacy systems.

Release

over 2 years ago

Version 5.0.1

Introduced versatile, multi-platform Docker images for ilum-core, expanding compatibility and usability across various systems.
Revitalized Bitnami Helm repository with essential updates to dependencies, ensuring seamless access and functionality.
Implemented subtle user interface enhancements for a more polished and user-friendly experience.

Release

over 2 years ago

Version 5.0 Live NOW! 🎉😊

Introducing Ilum 5.0.

We're excited to share with you one of our biggest updates yet, packed with our most popular feature request, significant performance improvements, a revamped UI, plus plenty of additional improvements, tweaks and fixes. With this update, you can expect to see an enhanced user experience and improved performance.

Watch the video!

Some of the new features:

Apache Zeppelin integration
Jupyter integration
New UI
OpenAPI support
Introduced the ability to configure gRPC communication, enabling more efficient and flexible data exchange.
Transitioned the default Spark Docker image to docker.ilum.cloud/ilum-spark:3.3.0, providing users with the latest and most optimized version.
Shifted the default communication mode to gRPC, emphasizing speed and performance.
Expanded offerings with the addition of Helm charts for Ilum Livy Proxy, Jupyter SparkMagic, and Zeppelin, streamlining deployment and management processes.
Introduced Ilum Table Format, a cutting-edge storage format designed for efficient data processing and optimized query performance, bringing innovation and enhanced capabilities to the platform.