At BUILD 2015, Microsoft announced Azure Data Lake, a new cloud-based repository created by Microsoft for big data analytic workloads. Today, Microsoft is expanding the service with Azure Data Lake Store, Data Lake Analytics and Azure HDInsight. These three expansions will make Azure Data Lake more accessible and easier to use.
- Azure Data Lake Store, previously announced as Azure Data Lake, will be available in preview later this year. The Data Lake Store provides a single repository where you can easily capture data of any size, type and speed without forcing changes to your application as data scales. In the store, data can be securely shared for collaboration and is accessible for processing and analytics from HDFS applications and tools.
- Azure Data Lake Analytics, a new service built on Apache YARN that dynamically scales so you can focus on your business goals, not on distributed infrastructure. This service will be available in preview later this year and includes U-SQL, a language that unifies the benefits of SQL with the expressive power of user code. U-SQL’s scalable distributed query capability enables you to efficiently analyze data in the store and across SQL Servers in Azure, Azure SQL Database and Azure SQL Data Warehouse.
- Azure HDInsight, our fully managed Apache Hadoop cluster service with a broad range of open source analytics engines including Hive, Spark, HBase and Storm. Today, we are announcing general availability of managed clusters on Linux with an industry-leading 99.9% uptime SLA. HDInsight will be able to take advantage of capabilities in the Store for increased throughput, scale and security.
The above and more will be covered at Microsoft's AzureCon 2015. Luckily, the convention will be available for viewing online and will also offer live virtual Q&A sessions, technical sessions and more. AzureCon will start tomorrow at 9am PST.
3 Comments - Add comment