Warm tip: This article is reproduced from serverfault.com, please click

airflow debugging testing

Debugging airflow tasks using airflow test vs using DebugExecutor

发布于 2020-11-29 10:46:50

I'm searching for best way to run/debug tasks and dags in my IDE. I see that there are two ways of doing this. I can run airflow test command in debug mode for particular dag and optionally task. Other way is to use DebugExecutor and run particular dag. I see that both ways require that Airflow database is up and running and that all pools are configured (probably queues as well). My questions are:

What is the main difference between these two?
Does airflow test uses DebugExecutor under the hood?
Is there a way to run/debug dags and tasks without running Airflow database and creating dependent pools and queues?

Questioner

partlov

Viewed

0

12.9k 2020-11-29 19:50:53

What is the main difference between these two?

Debug Executor runs a full DAG Run so you can test the trigger rules. airflow test command run only one task.

This is even clearer in Airflow 2.0. We have separate commands:

airflow dags test - starts one DAG Run with DebugExecutor.
airflow tasks test - starts one tasks.

Does airflow test uses DebugExecutor under the hood?

No. If you use DebugExecutor then you need to run full scheduler. If you use the airflow task command, only code that is being executed by worker is executed.

Is there a way to run/debug dags and tasks without running Airflow database and creating dependent pools and queues?

You can load a DAG with DagBag and then call the execute method of your task.

from airflow.model.dagbag import DagBag
dag_file_path = "/home/test-user/dags/dag-file.py"
dagbag = DagBag(dag_folder=dag_file_path)
dagbag.dags['test-dag-id'].task_dict['task-id'].execute({})

partlov 2020-11-29 11:42:19

Great, thanks! So, this way of running tasks is something similar we would use for unit tests.

热门帖子

1

vercel 免费版 3 个指标都超了，好像我的网站已经获得了基础流量。还好套了 cloudflare 不然流量也超了。

2

最近换了真我手机，发现这系统连应用的数据都无法备份

3

MacBook Pro 决赛圈

4

杭州岗位太少了

5

求推荐硬路由（1000 内）

6

DevOps Engineer-兼职远程-每日工作量 3-4 个小时-薪资可议

7

有没有新加坡地区的 vps 推荐？

8

使用了这么多年水果机，才发现竟然不能指定某日设置闹钟

9

求推荐一款无风扇能上机架的万兆电口交换机

10

学英语的朋友们可以来看看 https://langplayground.top/

热门github

1

A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input

2

Dev tool that writes scalable apps from scratch while the developer oversees the implementation

3

shadcn/ui, but for Svelte. ✨

4

The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and machine learning engineers to proactively find risks in their generative AI systems.

5

Performance-portable, length-agnostic SIMD with runtime dispatch

6

ZK Credo

7

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

8

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

9

Mamba is a new state space model architecture showing promising performance on information-dense data such as language modeling, where previous subquadratic models fall short of Transformers. It is based on the line of progress on structured state space models, with an efficient hardware-aware design and implementation in the spirit of FlashAttention.

10

This repository contains System Design resources which are useful while preparing for interviews and learning Distributed Systems

11

Curso para aprender el lenguaje de programación Python desde cero y para principiantes. 75 clases, 37 horas en vídeo, código, proyectos y grupo de chat. Fundamentos, frontend, backend, testing, IA...

12

🎓 Path to a free self-taught education in Computer Science!

13

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java

14

A collective list of free APIs

15

📚 Freely available programming books