AWS Online

Data Platforms – 3 nov

November 3, 2021 / 11:15 am - 5:30 pm

Data Tools and Technology Get to grips with the latest tools for data engineering and data observability: dbt, Coiled, and SODA. Learn from the people who are closely involved in the development of the products that are rapidly becoming the tools of choice for leading enterprises. Midway through the session, you will have the opportunity to learn more about TensorFlow Extended running on Airflow during the workshop. Then it's time to dive into SODA, a smart observability tool. After which, Juan Manuel Perafan will share some of the nightmares anyone who has ever developed a dashboard definitely wants to avoid. This packed session ends with an introduction to Coiled.

skip to content

Schedule

11:15 am

dbt Vision and Developments

Host

Renald Buter Chief Operations GoDataDriven

Guests

Jeremy Cohen Product Manager dbt Labs
1:30 pm

ML Ops workshop: Tensorflow Extended running on Airflow

Key concepts of TensorFlow Extended (TFX) and develop the skills to run TFX workflows on Apache Airflow In this workshop, we’ll dive into TFX, a tool built for consistent and reliable deployment of TensorFlow-based models to production. In practice, this means that TFX allows you to follow MLOps best practices with model versioning, data validation, metadata management, performance monitoring, serving and more. For this session, we’ll explore the key concepts of TFX and teach you how to run TFX workflows on Airflow. Use the orchestration system like Apache Airflow or Kubeflow to execute workflows as directed acyclic graphs (DAGs) of tasks. At the end of this workshop, you will explore the key concepts of TFX and teach you how to run TFX workflows on Airflow, hosted on the cloud. It will demonstrate the end-to-end workflow and steps how to analyze, validate and transform data, train a model, analyze and serve it.

Host

Roman Ivanov Machine Learning Engineer at GoDataDriven

Guests

Julian de Ruiter Machine Learning Engineer at GoDataDriven
3:15 pm

SODA - Smart Data Observability

Host

Renald Buter Chief Operations GoDataDriven

Guests

Cor Zuurmond Data Demystifier & Machine Learning Engineer GoDataDriven
Maarten Masschelein Co-Founder Soda Soda
4:15 pm

Dashboarding nightmares - Juan

Host

Renald Buter Chief Operations GoDataDriven

Guests

Juan Manuel Perafan Analytics Engineer
5:00 pm

COILED: BURST TO THE CLOUD WITH DATA SCIENCE AND ML WORKFLOWS

In this session, Matt Rocklin, CEO of Coiled, explains how they help data scientists use Python for ambitious problems, scaling to the cloud for computing power, ease, and speed—all tuned for the needs of teams and enterprises. Coiled is built on Dask, a free and open-source library that helps scale your data science workflows and provides a complete framework for distributed computing in Python.

Host

Renald Buter Chief Operations GoDataDriven

Guests

Matt Rocklin CEO at Coiled