When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.

The SQL Server and Azure SQL Connector for Apache Spark is now available on GitHub

Hamza Jawad Neowin @HamzaJawad98 · Jun 22, 2020 15:00 EDT with 1 comment

Last week, Microsoft announced the availability of the latest cumulative update (CU5) for SQL Server 2019, which focused towards expanding the capabilities offered through Big Data Clusters. Along with other changes that were delivered, the Apache Spark Connector for Azure SQL and SQL Server was revealed to have been open-sourced under the ApacheV2 license. Now, the connector has been made available on GitHub in the form of a V1 release.

Based on the Spark DataSourceV1 API and SQL Server Bulk API, the Apache Spark Connector enables the usage of transactional data in big data analytics. Moreover, it offers the ability to utilize both on-premise and in-cloud SQL databases as a source of input data or as an output data sink for Spark jobs. It can also work up to 15 times faster than the default JDBC connector, depending upon the sort of scenario that is being undertaken.

The connector boasts a variety of other features as well, the notable ones among which are:

Support for all Spark bindings (Scala, Python, R).

Basic authentication and Active Directory (AD) keytab support.

Reordered DataFrame write support.

Reliable connector support for single instance.

Updates for the connector to improve it further are already in the pipeline. You can check out the project and all its associated files here on GitHub.

Tags

Join the conversation!

Login or Sign Up to read and post a comment.

1 Comment - Add comment

Loading

Looks like your ad blocker is on.

×

We rely on ads to keep creating quality content for you to enjoy for free.

Please support our site by disabling your ad blocker.

Continue without supporting us

Choose your Ad Blocker

Adblock Plus
Adblock
Adguard
Ad Remover
Brave
Ghostery
uBlock Origin
uBlock
UltraBlock
Other

In the extension bar, click the AdBlock Plus icon
Click the large blue toggle for this website
Click refresh

In the extension bar, click the AdBlock icon
Under "Pause on this site" click "Always"

In the extension bar, click on the Adguard icon
Click on the large green toggle for this website

In the extension bar, click on the Ad Remover icon
Click "Disable on This Website"

In the extension bar, click on the orange lion icon
Click the toggle on the top right, shifting from "Up" to "Down"

In the extension bar, click on the Ghostery icon
Click the "Anti-Tracking" shield so it says "Off"
Click the "Ad-Blocking" stop sign so it says "Off"
Refresh the page

In the extension bar, click on the uBlock Origin icon
Click on the big, blue power button
Refresh the page

In the extension bar, click on the uBlock icon
Click on the big, blue power button
Refresh the page

In the extension bar, click on the UltraBlock icon
Check the "Disable UltraBlock" checkbox

Please disable your Ad Blocker
Disable any DNS blocking tools such as AdGuardDNS or NextDNS
Disable any privacy or tracking protection extensions such as Firefox Enhanced Tracking Protection or DuckDuckGo Privacy.

If the prompt is still appearing, please disable any tools or services you are using that block internet ads (e.g. DNS Servers, tracking protection or privacy extensions).