🗺️Guide: Synapse Spark ✨ – Getting Started

Pls feel free to share with others and spread the word.

Step 1 watch this video

Step 2 skim through these slides for more context:

  1. Coming soon…

The rest is all hands-on stuff – if you get stuck at any point lmk.

Step 3 do this learning path:

Perform data engineering with Azure Synapse Apache Spark Pools – Learn | Microsoft Docs

Step 4 do these 4 Spark tutorials

Tutorial: Create Apache Spark job definition in Synapse Studio – Azure Synapse Analytics | Microsoft Docs

Step 5 do the ML tutorials.

At the very least, I strongly recommend doing this one. Visualize data with Apache Spark – Azure Synapse Analytics | Microsoft Docs

Step 6 do the notebooks in the knowledge center. Note: I’d stick to Scala and PySpark to keep it simple.

Open a synapse workspace >  Home > Knowledge center > Browse gallery > click Notebooks (Go wild!)

These are very popular:

  1. Boston house price prediction
  2. Getting Started with Delta Lake

By this point you’d be a Synapse Spark rockstar, ready to migrate any ADB notebook.

Additional Resources:

  1. https://github.com/Azure/Test-Drive-Azure-Synapse-with-a-1-click-POC
  2. https://techcommunity.microsoft.com/t5/azure-synapse-analytics/azure-synapse-analytics-october-update/ba-p/2875372
  3. https://docs.microsoft.com/en-us/azure/synapse-analytics/spark/microsoft-spark-utilities
  4. https://github.com/Azure-Samples/Synapse/tree/main/Notebooks

Thanks,

Hiram

One thought on “🗺️Guide: Synapse Spark ✨ – Getting Started

Leave a Reply

Please log in using one of these methods to post your comment:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.