Skip to content

ssulca/dipdata

Repository files navigation

dipdata

Diplodatos 2020

Diplomatura en Ciencia de Datos, Aprendizaje Automático y sus Aplicaciones


Table of contents

├── analisisvis
│   ├── data
│   └── notebooks
├── curacion
│   ├── input
│   ├── notebooks
│   ├── output
│   └── queries
├── distributed
│   ├── bin
│   ├── conf
│   ├── docs
│   ├── inputs
│   └── notebooks
├── doc
├── introml
│   └── notebooks
├── scripts
├── supervised
│   └── notebooks
└── unsupervised
    └── notebooks

Data visualization

Data analytics and visualization on The Human Freedom Index - 2019

Introduction to Machine Learning

Linear Regression on Boston Housing Dataset

Cross-validated grid-search on Loan Dataset where TARGET Label: 1 = client defaulted on loan - 0 = loan repaid

Supervised Learning

Prediction Competition Walmart Recruiting: Trip Type Classification in diplodatos 2020,

Unsupervised Learning

FIFA 20 complete player dataset kaggle

Clustering using

Distributed Computing

Distributed computing in big data using pyspark and graphframe

License

Distributed under the MIT License. See LICENSE for more information.

Acknowledgments