Close Confidently orchestrate your data pipelines with Apache Airflow by applying industry best practices and scalable strategiesKey FeaturesUnderstand the steps for migrating from Airflow 1.x to 2.x and explore the new features and improvements in version 2.xLearn Apache Airflow workflow authoring through real-world use casesUncover strategies to operationalize your Airflow instance and pipelines for resilient operations and high throughputPurchase of the print or Kindle book includes a free PDF eBookBook DescriptionData professionals face the monumental task of managing complex data pipelines, orchestrating workflows across diverse systems, and ensuring scalable, reliable data processing. This definitive guide to mastering Apache Airflow, written by experts in engineering, data strategy, and problem-solving across tech, financial, and life sciences industries, is your key to overcoming these challenges. It covers everything from the basics of Airflow and its core components to advanced topics such as custom plugin development, multi-tenancy, and cloud deployment. Starting with an introduction to data orchestration and the significant updates in Apache Airflow 2.0, this book takes you through the essentials of DAG authoring, managing Airflow components, and connecting to external data sources. Through real-world use cases, you’ll gain practical insights into implementing ETL pipelines and machine learning workflows in your environment. You’ll also learn how to deploy Airflow in cloud environments, tackle operational considerations for scaling, and apply best practices for CI/CD and monitoring. By the end of this book, you’ll be proficient in operating and using Apache Airflow, authoring high-quality workflows in Python for your specific use cases, and making informed decisions crucial for production-ready implementation.What you will learnExplore the new features and improvements in Apache Airflow 2.0Design and build data pipelines using DAGsImplement ETL pipelines, ML workflows, and other advanced use casesDevelop and deploy custom plugins and UI extensionsDeploy and manage Apache Airflow in cloud environments such as AWS, GCP, and AzureDescribe a path for the scaling of your environment over timeApply best practices for monitoring and maintaining AirflowWho this book is forThis book is for data engineers, developers, IT professionals, and data scientists who want to optimize workflow orchestration with Apache Airflow. It's perfect for those who recognize Airflow’s potential and want to avoid common implementation pitfalls. Whether you’re new to data, an experienced professional, or a manager seeking insights, this guide will support you. A functional understanding of Python, some business experience, and basic DevOps skills are helpful. While prior experience with Airflow is not re

Apache Airflow Best Practices

QRcode

A practical guide to orchestrating data workflow with Apache Airflow

Confidently orchestrate your data pipelines with Apache Airflow by applying industry best practices and scalable strategiesKey FeaturesUnderstand the steps for migrating from Airflow 1.x to 2.x and explore the new features and improvements in version 2.xLearn Apache Airflow workflow authoring throug

Voir toute la description...

Auteur(s): Intorf, DylanStorey, DylanDoorn, Kendrick Van

Editeur: Packt Publishing

Année de Publication: 2024

pages: 188

Langue: Anglais

ISBN: 978-1-80512-375-0

eISBN: 978-1-80512-933-2

Confidently orchestrate your data pipelines with Apache Airflow by applying industry best practices and scalable strategiesKey FeaturesUnderstand the steps for migrating from Airflow 1.x to 2.x and explore the new features and improvements in version 2.xLearn Apache Airflow workflow authoring throug

Voir toute la description...

Découvrez aussi...