The main business question was to get insights in the performance of 35.000 sensors in the electricity grid, generating TBs of time series data.
The challenge was to:
The end results are continuously monitored in a hosted dashboard that is refreshed automatically on a daily basis.
Project lead and stakeholder management.
Python, PySpark SQL, AWS EMR, Apache Airflow, Docker, Gitlab CI/CD, PowerBI.
Forecasting electricity load of congestion points in the electricity grid by combining data from sensors and weather forecasts.
Read morePredicting costly excavation damages and delivering the results automatically to an operational prevention team.
Read moreDeveloping a big data solution including a dashboard for analyzing high volumes of noisy sensory data.
Read moreRecently, I have been invited to speak about Data Science developments. I also published scientific papers about my work.
Read more