Skip to content
Skip to content
Menu
Microsoft Data Professionals' Forum
  • About
  • Code of Conduct
  • Contact
  • FAQ
  • Home
  • More Meetups
  • Sponsors
Microsoft Data Professionals' Forum

19 December 2023

By slowder on November 3, 2023December 14, 2023

Transitioning your T-SQL skills to Spark SQL

This presentation is a crash course covering the basics of Spark SQL for the Microsoft T-SQL Server developer.

Azure Databricks is a managed service which provides the latest versions of Apache Spark based upon open source libraries. Spin up clusters and build quickly in a fully managed environment with the global scale and availability of Microsoft Azure.

The Adventure Works database is provided as raw delimited files to transform. We will go over read and writing files to popular file formats using PySpark, a Python-based wrapper for the Scala API. The real power of PySpark is the ability to read a file into a data frame and abstract the contents of the file as a temporary view during processing. Optionally, the raw data files can be presented as tables in the hive catalog. Once this abstraction is complete, all the SQL skills that you have obtained over the years can be used to transform the views/tables in the hive catalog into refined data in the data lake.

This month we’re going remote. Join us with MS Teams at 6pm EST.

John Miner

Insight Digital InnovationsJohn Miner is a Senior Data Architect at Insight Digital Innovation helping corporations solve their business needs with various data platform solutions.

He has over thirty years of data processing experience, and his architecture expertise encompasses all phases of the software project life cycle, including design, development, implementation, and maintenance of systems.

His credentials include undergraduate and graduate degrees in Computer Science from the University of Rhode Island. Also, he has earned certificates from Microsoft for Database Administration (MCDBA), System Administration (MCSA), Data Management & Analytics (MCSE) and Data Science (MPP).

John has been recognized with the Microsoft MVP award seven times for his outstanding contributions to the Data Platform community.

When he is not busy talking to local user groups or writing blog entries on new technology, he spends time with his wife and daughter enjoying outdoor activities. Some of John’s hobbies include wood working projects, crafting a good beer and playing a game of chess.

Post navigation

21 November 2023
January 16 2024

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • January 16 2024
  • 19 December 2023
  • 21 November 2023
  • 17 October 2023
  • 16 Aug 2023

Recent Comments

No comments to show.

Archives

  • November 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • March 2023
  • January 2023
  • February 2020
  • January 2020
  • December 2019
  • November 2019
  • September 2019
  • July 2019
  • May 2019
  • March 2019
  • February 2019
  • January 2019
  • July 2014
  • March 2014
  • February 2014
  • May 2013
  • April 2013
  • February 2013
  • September 2012
  • March 2012
  • February 2012
  • January 2012
  • November 2011
  • October 2011
  • July 2011

Categories

  • Uncategorized
©2025 Microsoft Data Professionals' Forum | WordPress Theme by SuperbThemes.com