In this repo, I build a LogisticRegression prediction model with Dask and PySpark and initialize an AWS EMR cluster to run the entire pipeline.
-
Updated
May 30, 2021 - Python
In this repo, I build a LogisticRegression prediction model with Dask and PySpark and initialize an AWS EMR cluster to run the entire pipeline.
Asynchronous API using Dask and AWS Fargate
Collection of machine learning algorithms ...
NY City Taxi Analysis using Dask
A Dask library for Big Data processing in Python demo
A project using the National Library of Medicine's Semantic Medline Database to create a graphical-relational database.
Script para configuración e installacion de requermientos de un worker de Dask Distributed
Distributed solution for Traveling Salesman Problem using Dask.distributed and OR-Tools
dask-ecs-lib is a Python library that effortlessly spins up a Dask cluster on AWS ECS using Fargate, allowing you to seamlessly execute and parallelize your functions.
Testing PyCaret, Fugue, and Dask
User documentation website for the Sulis tier 2 HPC service. Built using Jekyll.
Testing access performance of Sentinel-1 RTC metadata catalogs
Code for fetching, sampling, and analysis of NYC taxi data from TLC and Uber for 2009-2018
A custom dask remote jobqueue for HTCondor.
Preserve all necessary runtime data of a Dask client in order to "replay" and analyze the performance and behavior of the client after the fact
Python 3 tools for distributed analysis and visualisation of big climate data on HPC systems.
Add a description, image, and links to the dask-distributed topic page so that developers can more easily learn about it.
To associate your repository with the dask-distributed topic, visit your repo's landing page and select "manage topics."