Version 6.3.0 Live ! 🎉😊

Ilum 6.3.0 is here, a robust update designed to supercharge your data workflows and elevate your Spark experience. This release brings a host of new features, performance enhancements, and essential integrations that empower advanced data processing and governance for Spark users.

New Features in This Release
1. Introducing Gitea Integration: Enhancing Version Control Inside Ilum
We’re excited to announce that Gitea—a lightweight, self-hosted Git repository manager—is now built into Ilum. Manage version control for Jupyter notebooks, Spark job scripts, Airflow DAGs, and more, all without leaving the platform. This seamless integration simplifies collaboration for data engineers, data scientists, and DevOps teams.
2. Internal Security System Improved
Our new internal security system strengthens authorization and authentication. User data is securely stored in a dedicated database, with Role-Based Access Control (RBAC) enabling you to organize users into roles and groups, ensuring every action is properly permissioned.
3. Enhanced SQL UI with Notebook-Style Execution
Experience a more interactive SQL Editor that now supports notebook-style operations. Write and execute SQL queries in structured cells—just like in Jupyter Notebook—so you can explore and analyze big data more efficiently.

Other Improvements
  • Changed ilum-ui Service Type:
    Due to issues with kubectl port-forward, we now expose a NodePort by default.
  • Security Configuration Enhancements:
    Security-related settings in ilum-core have been moved from the config map to a dedicated Kubernetes Secret, enhancing the protection of sensitive data.
  • Ilum Submit Configurations for Spark SQL Engines:
    Launch Spark SQL engines via the Ilum Web Application or JDBC endpoint with automatic configuration application from the selected cluster—eliminating manual configuration steps.
  • Embedded Git Repository:
    With Gitea integrated as a module, Ilum now provides a built-in Git server for seamless version control.
  • Kafka Address Configuration for ilum-core:
    Added the ability to set a dedicated Kafka address for the ilum-core pod, separate from the global Kafka configuration for Spark jobs.


    Happy Data Processing! 🚀✨

Version 6.2.1 Live ! 🎉😊

Ilum 6.2.1 is a robust update that will supercharge your data workflows and enhance your experience with Spark.

We’re excited to bring you one of our strongest releases to date, with new features, performance enhancements, and essential integrations that enable advanced data processing and governance for Spark users. 

SQL API/UI Section: Making Data Easier with Ilum.
Initiate a New Era of Querying Data! Execute SQL queries directly on your cluster!  Full SQL dialect support allows you to work natively with any modern open formats like Delta Lake, Apache Iceberg, and Apache Hudi. Also, you can also use multiple engines SparkSQL, Trino(Comming) and Apache Flink(Comming).

Advanced Spark Metrics Integration.
With enhanced metrics, you can analyze your spark jobs better. Check the resources, jobs, and health of running applications quickly and troubleshoot the issues quickly. Whether you're a data engineer or a developer or a scientist, you can drive efficiency with these metrics.

Auto-Scaling for Interactive Sessions.
Efficient Resource Utilization Is The Key! Inactive interactive jobs will automatically scale down to zero after inactivity, freeing up resources for other jobs. When the activity resumes, the sessions smoothly scale back up by themselves.

Automated Spark job scheduling with cron
No need to trigger jobs by hand! With our new cron-based scheduling, you can automate your Spark job executions using a familiar cron-like syntax. Our system is integrated with Kubernetes so that your jobs run on time with automated retries and notifications.

Integration with Superset.
Get better visualisation and analytics through Superset integration. You can check the insights in your data by using Analytics, and you can share them too.

Spark Memory Settings Section.
Customize Spark's settings for memory for better performance tweaks. Adjust Spark memory settings for better resource use and improved job performance.

Integration with MLflow.
Speed up your machine learning! Using MLflow Integration, manage your ML experiments, track result and help streamline model deployment within Ilum.

Prepare to take your data journey to the next level with Ilum 6.2.1 – enhancing your analytics, Spark job management, and data interactions like never before. Get the latest version and see!

Happy Data Processing! 🚀✨

Version 6.1.0

  1. Data Lineage Support Added: Enhanced data management with new lineage tracking capabilities.
  2. Base Storage Support for HDFS/Azure/GCS: Expanded storage options to include HDFS, Azure Blob Storage, and Google Cloud Storage.
  3. LDAP/OAuth2 Support: Strengthened security with added support for LDAP and OAuth2 authentication methods.
  4. YARN Web UI Access: Improved monitoring and management with access to the YARN web user interface.
  5. Prometheus Metrics in Spark Jobs: Enabled Prometheus metrics for detailed monitoring within Spark jobs.
  6. Hadoop Config in Config Maps: Simplified configuration by allowing Hadoop settings to be passed to Spark jobs via Kubernetes config maps.
  7. Cluster Storages Spark Config to Spark Jobs: Streamlined job configuration with the ability to pass cluster storage settings directly to Spark jobs.
  8. Detecting Spark Driver Pod Failure: Increased reliability with new mechanisms to detect Spark driver pod failures.
  9. Fetching Metrics from the History Server: Enhanced analytics and troubleshooting with the ability to fetch metrics from the Spark history server.
  10. Tags for Jobs and Groups: Improved organization and management with tagging capabilities for Jobs and Groups.
  11. Exposing YARN Jobs for Ilum's Prometheus: Expanded monitoring options by making YARN jobs visible to Ilum's Prometheus setup.
  12. Tons of Small Improvements: Implemented numerous minor enhancements across the platform to boost performance and user experience.

Version 6.0.0

Version 5.2.0


Version 5.1.0


Version 5.0.2

  • Enhanced user interface with subtle refinements for an improved user experience.
  • Extended ilum-job-api to maintain seamless backward compatibility with Java 8, ensuring support for legacy systems.

Version 5.0.1

  • Introduced versatile, multi-platform Docker images for ilum-core, expanding compatibility and usability across various systems.
  • Revitalized Bitnami Helm repository with essential updates to dependencies, ensuring seamless access and functionality.
  • Implemented subtle user interface enhancements for a more polished and user-friendly experience.

Version 5.0 Live NOW! 🎉😊

Introducing Ilum 5.0.

We're excited to share with you one of our biggest updates yet, packed with our most popular feature request, significant performance improvements, a revamped UI, plus plenty of additional improvements, tweaks and fixes. With this update, you can expect to see an enhanced user experience and improved performance.

Watch the video!

Some of the new features:
  • Apache Zeppelin integration
  • Jupyter integration
  • New UI
  • OpenAPI support
  • Introduced the ability to configure gRPC communication, enabling more efficient and flexible data exchange.
  • Transitioned the default Spark Docker image to docker.ilum.cloud/ilum-spark:3.3.0, providing users with the latest and most optimized version.
  • Shifted the default communication mode to gRPC, emphasizing speed and performance.
  • Expanded offerings with the addition of Helm charts for Ilum Livy Proxy, Jupyter SparkMagic, and Zeppelin, streamlining deployment and management processes.
  • Introduced Ilum Table Format, a cutting-edge storage format designed for efficient data processing and optimized query performance, bringing innovation and enhanced capabilities to the platform.
Powered by FeedBear