Dataset Information
The Path4Med project focuses on developing a data pipeline for modeling the groundwater level in irrigated areas, specifically in the Tejo Vulnerable Zone (ZV Tejo) in Portugal, including Quinta da Cholda, Golegã. The datasets, sourced from the Sistema Nacional de Informação de Recursos Hídricos (SNIRH), include historical data on key variables such as precipitation, aquifer depth, nitrate concentration, and electrical conductivity. These variables are critical for assessing water and soil quality in the Mediterranean Agro-Hydro-System and supporting sustainable water management. Due to the large size of the files (up to 65MB), they are hosted on Google Drive for easy access.
Download Datasets
Metadata of study variables
Variable | Temporal MinMax | Recording Frequency | Points | Units | Zone |
---|---|---|---|---|---|
Precipitation | 1979-10-01 to 2025-03-15 | Daily | 70 | mm | ZVT and surrounding area |
Piezometry | 1974-10-01 to 2025-01-16 | Undetermined | 73 | m | ZVT_aquifers |
Nitrates | 2000-03-01 to 2023-10-25 | Undetermined | 112 | mg/L | ZVT_aquifers |
Electrical Conductivity | 2000-03-01 to 2023-10-25 | Undetermined | 103 | µS/cm | ZVT_aquifers |
Database Centralization
The Path4Med project adopted the CRISP-DM methodology to structure
the data pipeline, with a focus on the Data Preparation phase to
consolidate, clean, and organize historical data. We centralized
the datasets into a relational database using DBeaver and
PostgreSQL. This database includes tables such as, metadata
tables:
info_qualidade
, info_piezo
,
info_meteorologia
, and value tables:
condut_tejo_loc
, nitrato_tejo_loc
, and
precipitacao_tejo_loc
, which store data on water
quality, meteorological conditions, and aquifer characteristics.
This setup ensures data quality and supports predictive modeling
of groundwater levels in the ZV Tejo.

Fig 1. Screenshot of the Path4Med database schema in DBeaver, showing the relational structure of water quality, meteorological, and aquifer data.
Visualization and Mapping in QGIS
Using QGIS, the Path4Med project mapped monitoring points across Portugal Continental, focusing on the Tejo Vulnerable Zone and its surroundings, including Quinta da Cholda, Golegã. The map visualizes key variables such as aquifer depth and nitrate concentration. This geospatial analysis aids in evaluating the effectiveness of agronomical practices for mitigating pollution, supporting sustainable water management in the region.


Fig 2. Map extracted from QGIS showing monitoring points above ZVT.