⚙️ Configure Project Paths and Options¶
Runtime settings are centralized in YAML files, defining project paths, data scope, and feature flags per pipeline.
📂 Configuration Files¶
All configuration settings are defined per pipeline. Edit the file corresponding to your use case:
- 📡 OpenSky 2025:
scripts/opensky/config.yaml - ✈️ QAR (Private):
scripts/qar/config.yaml
📝 Configuration Structure Example¶
This example shows the primary fields in the OpenSky configuration.
scripts/opensky/config.yaml
paths:
data_dir: "/path/to/data"
download_dir: "downloaded_parquet"
preprocess_dir: "preprocessed_parquet"
# ... (more directories)
era5_cache_dir: "era5_cache"
era5_features:
- u_component_of_wind
- v_component_of_wind
- temperature
typecodes:
- A320
- A20N
# ...
bada:
bada_4_2_dir: "/path/to/BADA/4.2.1"
🔑 Key Parameters and Best Practices¶
| Section | Parameter | Type | Best Practice / Description |
|---|---|---|---|
| Paths | data_dir |
Path | Crucial: Keep this path absolute. All subfolders (download_dir, models_dir, etc.) are resolved relative to this root. |
| Paths | era5_cache_dir |
Path | Path for local cache of meteorological fields. Setting this prevents re-downloading large files. |
| Scope | typecodes |
List | Single Source: Adjust aircraft type scope here, not by modifying pipeline scripts. |
| BADA | bada_4_2_dir |
Path | Set this only if you plan to run baseline evaluation (07_bada_prediction.py). |
| ERA5 | era5_features |
List | Defines the specific meteorological fields (wind components, temperature) to be used as exogenous inputs. |
QAR Pipeline Variations
The QAR configuration is minimal. It typically only retains essential paths (data_dir, predicted_dir, models_dir), typecodes, and options for parallel processing (computing.default_cpu_count).
Directory Existence
Ensure your main directories exist before running data downloads or preprocessing scripts.
🚀 Next Steps¶
- Create an Architecture: Now that paths are configured, learn how to build the model's core components.