Large scale data processing @ Datascience WRUST

Logo

Large scale data processing course page

View the Project on GitHub riomus/lsdp

Laboratories

Grading

L0 - Introduction

L1 - Bash, Docker, Python parallelization

L2 - Scraper implementation (Celery, InfluxDB, Grafana)

L3 - Text embedding and database ingest (MongoDB, Redash)

L4 - Spark

L5 - K8s, Helm

L6 - Model serving

Lectures

W0 - Introduction

W1 - Basic notation, definitions

W2 - Programming languages

W3 - Platforms, tools

W4 - Spark

W5 - GBM

W6 - Large-scale ML