Сегодня стартует бесплатный курс обучения data engineering zoomcamp
https://github.com/DataTalksClub/data-engineering-zoomcamp
На курс изучается теория и практика(!) следующих технологий:
Google Cloud Platform (GCP): Cloud-based auto-scaling platform by Google
Google Cloud Storage (GCS): Data Lake
BigQuery: Data Warehouse
Terraform: Infrastructure-as-Code (IaC), to create project infra on Google Cloud Platform
Docker: Containerized environment for resources such as Postgres
SQL: Data Analysis & Exploration
Airflow: Pipeline Orchestration tool
DBT: Data Transformation tool
Spark: Distributed Processing
Kafka: Streaming
https://github.com/DataTalksClub/data-engineering-zoomcamp
На курс изучается теория и практика(!) следующих технологий:
Google Cloud Platform (GCP): Cloud-based auto-scaling platform by Google
Google Cloud Storage (GCS): Data Lake
BigQuery: Data Warehouse
Terraform: Infrastructure-as-Code (IaC), to create project infra on Google Cloud Platform
Docker: Containerized environment for resources such as Postgres
SQL: Data Analysis & Exploration
Airflow: Pipeline Orchestration tool
DBT: Data Transformation tool
Spark: Distributed Processing
Kafka: Streaming