This incident type involves the crash of the Apache Airflow scheduler on a Linux server. Apache Airflow is an open-source platform to programmatically author, schedule, and monitor workflows. The scheduler manages the execution of workflows and their tasks. When the scheduler crashes, it can cause workflow failures and disruptions in the overall system. This type of incident requires investigation and troubleshooting to identify the root cause of the crash and implement a solution to prevent it from recurring.
Parameters
Debug
Check if Apache Airflow scheduler is currently running
Check the status of the Apache Airflow scheduler service
Check if the Airflow configuration file is valid
Check the scheduler logs for any errors
Check the system logs for any relevant errors
Check the disk space usage on the server
Check the memory usage on the server
Check the CPU usage on the server
Check if there are any other processes consuming too much resources
Insufficient server resources (memory, disk space, CPU) causing the scheduler to crash.
Repair
Restart the Apache airflow scheduler service to see if it resolves the issue.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.