You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This repo contains my learnings and practices Zepplin notebooks on Spark using Scala. All the notebooks in the repo can be used as template code for most of the ML algorithms and can be built upon it for more complex problems.
A collection of small projects exploring PySpark features and functionality including packages and modules, algorithms, and general data science techniques.
Pyspark serves as a Python interface to Apache Spark, enabling the execution of Python and SQL-like instructions for the manipulation and analysis of data within a distributed processing framework.