Рет қаралды 98
Apache Spark is the de-facto standard for large-scale data processing. Data security is of paramount importance when running Spark in an enterprise environment. In this talk, we present a comprehensive data security solution for a Spark-based batch processing service on Apache Ranger.
By integrating Ranger with Spark, administrators can define and enforce data authorization policies based on context and operational needs. Ranger supports various authorization methods such as role-based access control and attribute-based access control. Ranger Hive Authorizers enables the administrator to easily enforce fine-grained access control for Hive tables. Moreover, Ranger also provides great flexibility by allowing custom data resources and access types.Ranger can also be integrated with open source Batch Processing Gateway project to provide authorization of various operations at the queue level, offering superior control for admins to prevent unintended usage of compute resources.
Slides: apachecon.com/...