Apache Hadoop 3.0 now generally available on Microsoft Azure HDInsight

Apache Hadoop 3.0 now generally available on Microsoft Azure HDInsight

1 min read

 

As of today, Apache Hadoop 3.0 is now generally available in Microsoft Azure HDInsight. While there are many security and performance improvements, we’d like to highlight four of the most important:

 

Security

 

For those concerned with GDPR or other privacy compliance in big data applications, Apache Hive 3.0 includes two key security improvements: ACID transactions are on by default and developers can build traditional database applications on data lakes.

 

Developers are now able to encrypt Azure Managed Disks using their own encryption keys thanks to Bring Your Own Key support for Apache Kafka.

 

Performance

 

Apache HBase 2.0 improves frequency of data flushing in remote cloud storage, while Apache Phoenix 5.0 improves query visibility by capturing information about queries run against a cluster.

 

.NET, Python, and Java SDK availability is expanded; developers can now use whatever language they prefer when managing clusters.

 

To read more about what Hadoop 3.0 brings to Azure HDInsight, including additional broad ecosystem improvements, check out the Microsoft Azure blog here.