ASABE Logo

Article Request Page ASABE Journal Article

Extreme Learning Machine Predicts  High-Frequency Stream flow and  Nitrate-N Concentrations in a  Karst Agricultural Watershed

Timothy McGill2, William Isaac Ford1,*


Published in Journal of the ASABE 67(2): 305-319 (doi: 10.13031/ja.15747). Copyright 2024 American Society of Agricultural and Biological Engineers.


1    Biosystems and Agricultural Engineering, University of Kentucky, Kentucky, USA.

2    Geosyntec Consultants, Jacksonville, Florida, USA.

*    Correspondence: bill.ford@uky.edu

Submitted for review on 24 July 2023 as manuscript number NRES 15747; approved for publication as a Research Article and as part of the “Digital Water: Computing Tools, Technologies, and Trends” Collection by Associate Editor Dr. Debabrata Sahoo and Community Editor Dr. Kati Migliaccio of the Natural Resources & Environmental Systems Community of ASABE on 8 December 2023.

Highlights

Abstract. Efforts to reduce nitrogen contributions from karst agroecosystems have had variable success, in part due to an incomplete understanding of nitrogen source, fate, and transport dynamics in karst watersheds. Recent advancements in environmental sensors and data-driven artificial intelligence models may be useful in improving our understanding of system behavior and the linkages between soil hydrologic processes and karst nitrate loading dynamics. We collected 35 months of high-resolution streamflow, nitrate-N concentration, soil moisture and temperature (from 10-100 cm depths), and meteorological data in a karst agricultural watershed in the Inner-Bluegrass region of Central Kentucky. Two-layer extreme learning machine (TELM) models were developed to predict nitrate-N concentrations and flow rates as a function of meteorological and soil parameter inputs. Results suggest tight linkages between soil moisture gradients at different depths and nitrate-N concentrations at the watershed outlet. TELM modeling results supported visual observations from the high-frequency data and suggest that inclusion of both soil moisture and temperature parameters at all soil depths improved predictions of both flow rate and nitrate-N concentration (with optimal NSE values of 0.93 and 0.94, respectively, when all inputs were considered). Hysteresis analysis suggested that inclusion of the deepest soil layer (100 cm) was necessary to predict hysteresis observed during storm events. The findings of the study highlight the importance of variable activation of matrix waters in preferential flows throughout events and seasons and its subsequent impacts on nitrate-N concentrations. Results suggest that management models should incorporate vertical variability in soil hydrology to accurately characterize nitrate source and transport dynamics. Further, the results of hysteresis analysis underscore the importance of inclusion of hysteresis indices, in addition to typical model evaluation statistics, to ensure accurate representation of nutrient flow pathways.

Keywords.Extreme learning machine, Karst agroecosystem, Nitrate, Water resources.

The hydrologic and biogeochemical drivers of nitrate loading from karst agricultural watersheds remain poorly understood despite the recognized impact of these landscapes on eutrophication of receiving waters (Weary and Doctor, 2014; Robertson and Saad, 2021). Recent studies have demonstrated that nitrogen transiently stored in the vadose zone is often the primary source of nitrate in watersheds without high rates of chemical fertilizer, manure, or sewage inputs (e.g., Husic et al., 2019; Chang et al., 2022). Nevertheless, current process-based and conceptual modeling frameworks in karst agroecosystems have limitations in simulating nitrate loading dynamics due to an oversimplification of soil hydrological and biogeochemical processes and to the high parametric uncertainty associated with equifinality (Hartmann, 2016; Husic et al., 2019). A need exists to advance our understanding of the impact of soil hydrologic and biogeochemical drivers on nitrate loadings in karst landscapes to improve management tools. Recent influxes in high-frequency sensing data have led researchers to increasingly utilize AI-based modeling approaches as predictive and inference tools in water quality assessment, yet their utility for predicting nitrate dynamics in karst agroecosystems remains unclear and is the focus of this study.

Karst landscapes are characterized by dissolution conduits and caves in the subsurface bedrock that have high hydrologic connectivity to surface and soil water. Karst soils that develop on carbonate rock are fine textured with a high clay content (Hartmann et al., 2014). As a result, transmission of waters to the subsurface through the soil matrix (micro porosity) is often low, and subsurface connectivity to dissolution conduits generally occurs through preferential flow paths (e.g., sinkholes and swallets). However, studies have demonstrated that hydrological and biogeochemical processes in the soil matrix are the most important drivers of nitrate concentration dynamics to subsurface springs (Husic et al., 2019). More recently, preferential flow dynamics from soil have been identified as a primary factor impacting nitrate delivery to subsurface springs in cultivated landscapes (Yan et al., 2023), cumulatively suggesting that variable connectivity of soil matrix waters to subsurface conduits via preferential flow paths is an important driver. Preferential flows are now recognized to be comprised of both waters stored in the bulk soil matrix prior to the event (old-water) and event water in the form of precipitation or irrigation inputs (new-water) (Nazari et al., 2022). Exchange between the bulk soil matrix and macropore domains is bi-directional depending on antecedent conditions (Klaus et al., 2013; Germer and Braun, 2015; Chen et al., 2021). During precipitation events with low antecedent moisture, preferential flows from overland flow may be transported directly to the subsurface or may infiltrate into soils; however, under saturated soil conditions, pre-event matrix water can be mobilized to groundwater springs via preferential flow paths and can occur at variable depths throughout the soil column (Klaus et al., 2013; Ford et al., 2018; Wang et al., 2018). Variable connectivity of different soil layers during storm events is likely important for predicting nitrate concentrations and loadings in karst agroecosystems.

Additionally, vertical and temporal variability in soil biochemical processes influence the nitrate composition of porewaters that are transiently stored in the soil matrix. For example, coupled evapotranspiration and mineralization of soil organic matter can result in nitrate buildup in porewaters between flushing events, leading to seasonal variations in nitrate concentrations in both karst and non-karst agroecosystems (Donner et al., 2004; Yevenes and Mannaerts, 2011; Tian et al., 2015; Yang et al., 2017; Seybold et al., 2019; Yue et al., 2019). As a result, studies have demonstrated asymmetric profiles of nitrate-N concentrations with depth in the soil column (Igbal and Krothe, 1995; Green et al., 2018). Nevertheless, existing conceptual and physically based models simulating karst hydrology and water quality often neglect vertical soil profile variability (e.g., Hartmann et al., 2014; Husic et al., 2019), due in part to limited databases that capture nitrate dynamics at high frequencies.

Advances in sensing technology over the last decade have enabled the collection of in situ hydrologic and water quality data at sub-hourly frequencies via UV spectroscopy-based sensors (Pellerin et al., 2013; Carey et al., 2014; Jensen and Ford, 2019). These high-frequency datasets have enabled analysis of concentration versus discharge patterns at the event-scale, which can inform source and pathway connectivity dynamics (Blaen et al., 2017; Liu et al., 2021). In karst agroecosystems, watershed nitrate concentrations during storm events initially become diluted with low nitrate concentrations from new-water on the rising limb of the hydrograph, increase following peak flows as soil and epikarst connectivity of old-water increases, and then decrease towards pre-event levels as phreatic connectivity becomes prominent during the receding limb of the hydrograph (Baran et al., 2008; Rusjan et al., 2008; Buda and DeWalle, 2009; Ford et al., 2019; Husic et al., 2019). Quantitative hysteresis index (HI) values have been used to evaluate the direction and magnitude of hysteretic behavior and therefore may be useful in numerical model evaluations to ensure models can accurately capture nitrate source and pathway variability (Liu et al., 2021; Mehdi et al., 2021; Husic et al., 2023). Given the variability in connectivity of waters throughout the soil profile during events and variable nitrate concentrations in pore waters with depth, we hypothesize that the hysteretic behavior of watershed nitrate concentrations is tightly linked with the hysteretic behavior of soil hydrology.

Given the limitations of current numerical models to simulate karst hydrology and water quality, data-driven artificial intelligence models may be useful in informing important processes that impact nitrate concentrations. The influx of large databases associated with in situ sensor advancements has led to an increase in AI applications in hydrologic and water quality studies in recent years (Krishnan et al., 2022). Several studies have applied Extreme Learning Machines (ELMs), which are feedforward neural networks that achieve similar or better generalization performance, scalability, and faster learning times than other AI-modeling approaches such as support vector machines (Huang et al., 2012). Recent studies have demonstrated the effectiveness of ELM models to predict stream and river flow rates (Deo and Sahin, 2016; Yaseen et al., 2016; Roushanger et al., 2017; Rezaie-Balf and Kisi, 2017; Yaseen et al., 2019), as well as ground and surface water quality (Barzegar et al., 2018a,b; Fijani et al., 2019). Therefore, ELM models may also be effective at simulating nitrate loading dynamics in karst agroecosystems.

The objective of this study was to assess the impact of soil hydrologic connectivity dynamics on nitrate loadings in karst agroecosystems, utilizing an AI-based modeling approach to inform important processes. To meet the overarching objective, the specific objectives were to: (1) collect and analyze a novel high-frequency in situ dataset of surface water nitrate-N concentrations and flow rates and compare with high-frequency soil sensing data in a karst agricultural watershed, (2) assess the utility of ELM models to predict nitrate-N concentrations and flow rates in the karst agroecosystem and quantify the relative importance of meteorological, soil moisture, and soil temperature inputs on model response variables, and (3) assess the utility of ELM models to account for storm-event hysteresis impacting nitrate-N concentration and loadings.

Study Site

To meet the objectives of this study, data collection and analysis efforts were conducted in the Camden Creek watershed in Kentucky, U.S.A. (fig. 1). This watershed has a drainage area of 1,069 ha (including surface and spring drainage areas) and drains much of the University of Kentucky 600 ha C. Oran Little Research Center (LRC). The watershed is located within the Inner-Bluegrass physiographic region of central Kentucky, which is characterized by a temperate Midwestern United States climate (Ford et al., 2019). Over the past decade, the region has received an average annual rainfall of 1322 mm with an average temperature of 13.14°C. The surface tributaries in the watershed are shallow and emanate from springs. Streams flow over limestone bedrock and are mostly unshaded through grazed pasture with riparian vegetation, with low streambed sediment storage on the exposed bedrock (Ford et al., 2019).

The watershed was selected for study because: (1) extensive hydrologic and water quality research has been conducted in the watershed (Ford et al., 2019; Bunnell et al., 2020; Radcliffe et al., 2021), (2) a U.S. Climate Reference Network meteorological and soil gauging station is located on the research farm, (3) the watershed is impacted by complex heterogeneity in karst maturity with flow pathways ranging from seeps activated during high-intensity rainfalls to large regional perennial springs (Reed et al., 2010), and (4) the land use is typical of agriculture across the state of Kentucky, as well as much of the food producing Midwest encompassing cropland, animal husbandry operations including beef, swine and sheep facilities, and pasture lands for horse and cattle. Inorganic nitrogen application rates and the volume of manure slurry applied to the fields were provided by farm managers. Based on reported fertilization rates, organic fertilizer application (injected liquid swine manure) occurred on approximately 4% of the watershed area, while inorganic fertilizer application (primarily 32-0-0 urea ammonium nitrate and 46-0-0 urea) occurred on approximately 16% of the watershed area. In our preliminary analysis, we considered application rate and timing as model input variables but found them to be insensitive predictors, likely because of the limited drainage area impacted by fertilization (McGill, 2022). We therefore excluded fertilization as an input variable for this study.

Figure 1. (a) University of Kentucky ARC property lines in Kentucky, USA, noting fields, basin watershed boundaries, and locations of the ST1 surface water monitoring site and the NOAA weather station. (b) Aerial image of the NOAA meteorological and soil moisture sensing station. (c) Sensors, housing, and v-notch weir located at the watershed outlet.

Methodology

Data Collection and Analysis

Data collection efforts included surface water, meteorological, and soil monitoring. Surface water measurements were collected at the watershed outlet (ST1) and included flow depth and nitrate-nitrite-N concentrations obtained from in situ water quality sondes that were validated via grab sample analysis. Meteorological and soil variables, including precipitation, wind speed, air temperature, solar radiation, soil moisture, and temperature at depths of 10, 20, 50, and 100 cm were obtained from the National Centers of Environmental Information’s climate reference network, which is operated through the National Oceanic and Atmospheric Administration (NOAA).

Surface Water Measurements

Volumetric flow rates (Q) at the watershed outlet were calculated based on flow depth over a 120° V-notch with an invert 0.5 feet from the channel bed with a YSI EXO2 sonde (Bunnell et al., 2020). The YSI EXO2 Sonde required monthly calibration, battery replacement, and general maintenance consistent with manufacturer specifications. The sonde’s pressure measurements were collected at 15-minute intervals, beginning on 29 August 2018 through 4 August 2021 (1070 days), with some periodic gaps in the record due to sensor failures and research restrictions associated with the Covid-19 pandemic. Flow depths measured with the sonde were compared with manual flow depth measurements to ensure accuracy.

High-frequency nitrate/nitrite-N measurements at the watershed outlet were obtained using a SUNA V2 sensor manufactured by Sea-Bird Scientific. Data were collected from 7 September 2018 to 4 August 2021 at 15-minute intervals. Like flow rate, data gaps existed due to sensor errors, restrictions to research from Covid-19 precautions, and data that failed QC protocols. The longest period of missing data occurred from 20 November 2019 until 20 July 2020. Reference spectrum updates were completed every four to twelve weeks, although most updates were carried out every four to five weeks. The sensor was checked for debris or other possible fouling during site visits. The SUNA was powered externally, and a data logger was used to store and offload data. The software used to communicate with the SUNA V2 was Sea-Bird Scientific’s UCI 2.0.3 version software. Grab samples were used to validate SUNA measurements and ensure flagged values were accurate. At least two samples were collected each month and delivered to the Kentucky Geological Survey for analysis of nitrate-N with EPA method 9056A-Determination of Inorganic Anions by Ion Chromatography (EPA, 2007). Concentrations of nitrite-N are generally more than an order of magnitude lower than nitrate-N at the site, and hence were not measured explicitly. Linear regression was performed between measured nitrate-N values and SUNA measurements to ensure the accuracy and precision of in situ measurements.

Meteorological and Soil Moisture Data

Soil and meteorological data were obtained from the Midwestern Regional Climate Center cli-MATE database and the U.S. Climate Reference Network (Diamond et al., 2013) and collected by the automated weather observation system located on the LRC property (ID:63838, latitude 38.0944 N, longitude -84.7464 W, fig. 1). This station contains a Geonor T-200B series precipitation gauge and three sets of HydraProbe II sensors using SDI-12 connections mounted 120° off from each other (fig. 1), based on NOAA standards (Geonor, 2016; NOAA, 2021). The station is in a pasture field, containing Bluegrass-Maury and McAfee silt loam-soil along a hillslope that is characteristic of the broader Camden Creek watershed. Soil moisture and temperature sensors were located at 10, 20, 50, and 100-cm depths. Data were collected hourly and interpolated to 15-minute results using linear interpolation to align with nutrient and flow rate measurement frequency. Meteorological variables including precipitation, 1.5m windspeed, air and surface temperature, relative humidity, and solar radiation data were obtained at 5-minute intervals and were aggregated to 15-minute intervals. The meteorological and soil data were obtained from 29 August 2018 to 4 August 2021. While we feel the meteorological and soil data were characteristic of the watershed, we note that the soil and meteorological data was limited to this single monitoring station.

There were several periods with missing or erroneous soil data. The largest data gap was associated with the 100-cm data from 29 August 2018 to 30 June 2019. Following quality control analysis, 100 cm soil depth data were only analyzed from July 2020 to August 2021. Further, during manual inspection of the 50 cm soil data, periodic short-term erroneous increases in soil moisture data were removed and interpolated based on measurements before and after the erroneous readings. Data gaps also existed from 31 December 2018 until 12 January 2019 for air temperature, precipitation, surface temperature, and 1.5 m wind speed.

Modeling Nitrate-N Concentrations in  Karst Agroecosystem Watershed

Model Scenarios and Approach

Data-driven AI modeling was performed to assess: (1) ‘How does representation of soil hydrologic variability impact watershed nitrate-N concentration and flow predictions in karst watersheds?’, and (2) ‘What degree of vertical discretization of the soil profile is needed to represent within-event hysteresis dynamics?’. The response variable in the model was nitrate-N concentrations or flow rate at the watershed outlet. Variables used as inputs for the nitrate-N concentration and flow rate predictions were based on available long-term meteorological and soil data, which were perceived to impact hydrologic and N cycles in the karst agroecosystem watershed. Model inputs included precipitation, air temperature, ground surface temperature, soil moisture and soil temperature at depths of 10, 20, 50, and 100 cm, solar radiation, wind speed at 1.5 m, and relative humidity. We ran three model scenarios to address our research questions. Scenario 1 included all meteorological parameters and soil moisture and temperature variables from 10 to 50 cm for the full 3-year monitoring period. Scenario 2 included only meteorological parameters for the full 3-year monitoring period. Scenario 3 included all meteorological parameters and soil moisture parameters from 10 to 100 cm for the one-year period in July 2020 to August 2021, in which all datasets were available.

We selected a two-hidden-layer ELM (TELM) model to predict the nitrate-N concentration and flow rate response variables at the watershed outlet. For each of the three modeling scenarios, the input and response variable datasets were randomly divided into fifths, in which four of the divided sets were used to train the model in batches and the last fifth was retained for model validation. Each scenario was trained at least three times to account for training equifinality. We used the Rectified Linear Unit activation function (Agostinelli et al., 2014; Banerjee et al., 2020; Jin et al., 2015). We tested the sensitivity of the number of hidden layers and the number of neurons in each hidden layer and found that 50 neurons in each hidden layer provided the lowest mean squared (MSE) for outputs. To check for TELM model overfitting due to redundant inputs, a variable pruning analysis (Bishop, 1995) was performed, and the results showed a limited impact on model fit and calibration statistics. An optimized tensor computation package known as PyTorch was employed to assist in model generation in Python (Steppa and Holch, 2019).

Statistical Evaluation of Model Performance

The output of the TELM model for each scenario was compared using statistical and visual model evaluation approaches. Timeseries of measured and modeled data were plotted to visualize goodness-of-fit both globally and at finer-resolution time scales. Nash-Sutcliffe Efficiency (NSE) was used as a quantitative model evaluation statistic (McCuen et al., 2006; Moriasi et al., 2007) and was calculated as follows:

        (1)

where

= ith observation of nitrate-N concentration or flow rate

= ith value of nitrate-N concentration or flow rate from the model

ymean = mean of the observed nitrate-N concentration or flow rate for the entire modeling period

N = total number of observations.

Integrated Gradients Analysis of Input Variables

Integrated gradients (IG) analysis was performed to determine the impact of input variables on model training. Gradients of the output with respect to the model input are a natural analog of the model coefficients for a deep network, meaning they are a good starting point for defining the attribution, or relevance, of inputs to the output (Sundararajan et al., 2017). The IG algorithm within the Captum module recognizes these gradients within the trained model and integrates them across the entire sample to determine the average importance each variable carried during the training process (see Sundararajan et al., 2017 for full explanation). By using these integrated gradients, each variable’s perceived effect on the system can be represented. Values were reported as a fraction and sum to one. This attribution function was run three times for each model run to ensure an accurate estimate was made due to the use of the Riemann Sum to determine the IG. Once generated from each model run, the IG were averaged, giving a composite IG for each run. The composite IGs from each run were then averaged to give an IG for each scenario.

Concentration-Discharge Hysteresis Analysis

To evaluate the ability of the TELM model to predict nitrate hysteresis, we performed quantitative hysteresis index (HI) and qualitative concentration-discharge plot analysis (Lloyd et al., 2016). A small, medium, and large (except for summer) event was obtained from each season, and we compared the observed hysteresis using the measured nitrate-N concentration and observed discharge with the modeled hysteresis using the predicted nitrate-N concentration and observed discharge. We operationally defined event size based on peak flow rate (small < 0.3 cms, medium 0.3 to 0.9 cms, and large > 0.9 cms). The seasons were defined as winter (21 December to 19 March), spring (20 March to 20 June), summer (21 June to 21 September), and fall (22 September to 20 December). In total, eleven storm events were selected for analysis (table 2). The hysteresis index was determined by first normalizing the flow and concentration data for each storm event as follows:

        (2)

where

Qi = measured flow at the ith time step

Qmin = minimum flow measured during the event period

Qmax = maximum flow measured during the event period.

        (3)

where

Ci = nitrate-N concentration at time step i

Cmin = minimum nitrate-N concentration during the event period

Cmax = maximum nitrate-N concentration during the event period.

With data normalized, the HI can be determined by calculating the difference in normalized concentration values on rising and falling limb at 5% and averaging intervals throughout the event.

        (4)

where

k = index representing percentile of peak flow in 5% increments

HIevent = hysteresis index for each event

Ck,RL = normalized concentration at kthpercentile of flow on the rising limb

Ck,FL = normalized concentration at kthpercentile of flow on the falling limb

M = number of flow intervals analyzed.

Results and Discussion

Surface Water and Soil Data Results

Results showed that the SUNA V2 in situ measurements produce accurate and unbiased results under low and high flow conditions relative to laboratory analysis of the 80 grab samples (figs. 2 and 3a). The best-fit linear regression equation between laboratory and SUNA V2 measurements had a slope of 0.96 and an intercept of 0.07 (R2 = 0.98) (fig. 2).

Figure 2. Linear regression comparing laboratory analysis of grab samples (Laboratory Nitrate-N concentration) and the SUNA V2 Nitrate-N measurements. Linear regression results and coefficient of determination are included.

Timeseries of the nitrate-N and discharge data highlighted the impacts of flow dynamics and seasonality on nitrate-N concentrations (fig. 3). On average, summer concentrations were least, often dropping below 2 mg/L, coinciding with lower flow rates for the season. Conversely, during elevated baseflow conditions of late fall and winter, nitrate-N concentrations were greater, often exceeding 4 mg/L. Regarding storm events (figs. 3b-e), a dilution in the nitrate-N concentration was commonly observed on the rising limb of the hydrograph, although this dilution is much more prominent in the winter months, reflecting the gradients observed in pre-event concentrations between seasons. On the falling limb of the hydrograph, nitrate-N concentrations increased and often met or exceeded pre-event levels.

Findings from our study site are reflective of dynamics commonly observed for contaminants in karst and agricultural watersheds, excluding findings from heavily fertilized agricultural landscapes (Liu et al., 2007; Baran et al., 2008; Buda and DeWalle, 2009; Blaen et al., 2017; Husic et al., 2019; Yue et al., 2019; Jackson and Polk, 2020). Storm event hysteresis patterns reflect hydrologic connectivity pathways that have been commonly described. Lui et al. (2007) and Jackson and Polk (2020) showed similar seasonal trends and storm event dilution and recharge characteristics in karst study sites when analyzing specific conductance and dissolved inorganic carbon. Likewise, numerous studies have highlighted increased nitrate-N concentration and loading during the wet season, which has been attributed to enhanced hydrologic connectivity of nitrate rich pore waters in the soil to the watershed outlet (Basu et al., 2010; Yang et al., 2017; Husic et al., 2019; Yue et al., 2019). Contrary to findings from heavily fertilized agricultural fields and watersheds (e.g., Rusjan et al., 2008; Kennedy et al., 2012; Ford et al., 2018), we did not find flushing of nitrate-rich waters during the rising limb or peak of the hydrograph, which is demonstrated quantitatively in our hysteresis analysis (table 4; fig. 6).Collectively, the findings suggest the study site is a representative testbed to evaluate the ability of machine learning algorithms to represent hydrologic connectivity processes of N sources, although its inability to reflect fluvial fertilizer losses is a limitation.

Comparison of the nitrate-N measurements with soil moisture data illustrates that the magnitude and variable response of soil moisture layers with depth correspond with nitrate-N concentration dynamics at the watershed outlet (fig. 4). The measured nitrate-N concentrations were typically low during periods of low soil moisture, and concentrations were elevated during periods of high soil moisture (fig. 4a). Generally, greater changes in soil moisture levels corresponded with greater shifts in nitrate-N concentrations. In events during seasonal transitions from dry to wet periods, particularly summer to late fall (fig. 4b), the variable soil moisture responses within different layers of the soil profile corresponded with a variable response in nitrate-N concentration fluctuations. When the shallower depths of the soil profile (10 and 20-cm depths) had increasing soil moisture contents without a response in deeper soil layers (e.g., 25 to 26 August 2019), modest increases in nitrate-N concentration were observed (1.26 to 1.86 mg/L). Conversely, as deeper soil layers (e.g., 50-cm) had increasing soil moisture content, larger shifts were observed in nitrate-N concentrations. For instance, the 6 October 2019 event impacted the 50-cm soil depth with an increase in volumetric water content from 0.13 to 0.19, and a subsequent increase in nitrate-N concentration from 0.36 to 2.53 mg/L was measured (fig. 4b). We also observed a lag in response time between the layers of the soil profile when all measured layers are dynamic during an event (fig. 4c). This shows the deeper layers begin to activate and reach their peak moisture content later in the event than the shallower layers, with the 100-cm layers showing activation following all other layer’s peak activation. Given the variable response of nitrate dynamics to the variable soil depths and the variable timing of soil moisture response during storm events, we anticipate that representation of soil moisture variability would be an important component to accurately represent event-hysteresis dynamics.

Figure 3. (a) Timeseries for measured nitrate-N concentrations and discharge at the watershed outlet from 29 August 2018 through 4 August 2021. Storm events and subsequent recessions are provided for typical events in winter on (b) 10 to 11 February 2021, and (c) 23 to 24 January 2019. Storm events and subsequent recessions are also provided for typical events in summer on (d) 2 to 3 September 2020 and (e) 21 to 22 July 2019.

TELM Modeling of Flow Rate and  Nitrate-N Concentrations

Comparison of the predicted nitrate-N concentration for Scenarios 1 and 2 highlighted the importance of accounting for soil moisture variability in predicting dynamics in the karst agroecosystem watershed (table 1; figs. 5a-c). For the nitrate-N concentration models, the best fit models for Scenario 1 and 2 had NSE averaging 0.89 and 0.38, respectively. While the criterion is not available for sub-daily NSE values, the daily criterion for N modeling (Moriasi et al., 2015) suggests our results would fall in the “Very good” and “Satisfactory” categories for Scenario 1 and 2, respectively. Visually, the results show the TELM models in both Scenario 1 and 2 represented the seasonality of the data, with an average increase during the wetter winter months and an average decrease during the summer months (figs. 5a and 5b). Where the models differed was in predicting variability in nitrate-N concentrations at shorter timescales, particularly when there were major shifts in both the nitrate-N concentrations and soil moisture contents in deeper soil layers (e.g., October 2020). Results of within-event variability are compared for Scenarios 1 and 3 using the hysteresis analysis (table 4; figure 6).

Figure 4. (a) Measured nitrate-N concentrations are plotted against the volumetric water content measured at four different depths within the soil profile (10, 20, 50, and 100-cm) throughout the monitoring period of 29 August 2018 through 4 August 2021. (b) Period from 7 August to 20 November 2019 is emphasized to demonstrate varied temporal response of soil moisture with depth and the associated impacts on nitrate-N concentrations. (c) Rain event on 25 to 26 January 2021 showing the activation of the various layers within the soil profile and the time lag associated with this activation during a rain event.

Similarly, flow rate modeling improved when including soil moisture data but highlighted the importance of representing the 100 cm soil layer dynamics to accurately predict hydrographs (table 1; figs. 5d and 5e). Best fit models for Scenario 1 and Scenario 2 had NSE values of 0.65 and 0.16, respectively, which would fall in the “Satisfactory” and “Not Satisfactory” categories for daily flow model evaluations outlined in Moriasi et al. (2015). Both models captured baseflow variability seasonally; however, Scenario 2 was not able to capture event-scale hydrograph dynamics (figs. 5d and 5e). Scenario 1 improved capturing general trends of hydrographs during events but often missed peak flow predictions. When including 100 cm soil depth data in Scenario 3, visual and statistical results improved markedly over Scenario 1, with NSE values increasing to 0.93 and an improved representation of peak flows and hydrograph recessions (fig. 5f).

Comparing results from the integrated gradients analysis supports the importance of both soil moisture and temperature for predicting stream nitrate-N concentrations and flow rates (table 3). The soil moisture and temperatures at various depths within the soil profile received the highest attribution scores for both nitrate-N concentration and flow rate predictions. For nitrate-N modeling, soil parameters had attribution scores summing to 0.82 (0.39 for soil moisture, 0.43 for soil temperature) and 0.80 (0.42 for soil moisture and 0.38 for soil temperature) for Scenarios 1 and 3, respectively. Similarly, soil parameters had attribution scores summing to 0.81 (0.52 soil moisture and 0.29 soil temperature) and 0.80 (0.41 soil moisture and 0.39 soil temperature) for Scenarios 1 and 3 of the flow rate models. This finding suggests similar importance of both soil moisture and temperature in both hydrologic and nitrate-N concentration simulations and marginal contributions <0.2 of all other variables. Surprisingly, precipitation was the least important parameter in modeling, likely reflecting the lag time between precipitation occurrences and the subsequent hydrologic response at the watershed outlet.

Table 1. Summary of the variables included in each set of model training scenarios for both nitrate-N and discharge. Each scenario was trained three times to develop an average. The highlighted cells indicate which variables were included in each set of runs. Average Nash Sutcliffe Efficiency results for both nitrate-N concentrations and discharge are provided for each scenario.
VariablesScenario 1
(Aug 2018- Aug 2021)
Scenario 2
(Aug 2018- Aug 2021)
Scenario 3
(Aug 2020- Aug 2021)
Atmospheric variablesPrecipitation, wind speed, solar radiation,  air temp, surface temp, relative humidity
Soil Profile10 cm
20 cm
50 cm
100 cm
Nash Sutcliffe Efficiency (NSE)Nitrate-N Model0.890.380.94
Flow rate Model0.650.160.93
Table 2. Summary of the rain events chosen to perform the hysteresis analysis. Three events of varying sizes were chosen from each season to evaluate the TELM model’s ability to capture storm event dynamics. Captured events during the summer months in our monitoring duration did not meet the criteria for a large event.
SeasonSizeStart DateEnd DateMax Q
(cms)
SummerSmall29 July 20209 August 20200.18
Med3 July 201913 July 20190.40
LargeNoneNoneNone
FallSmall19 December 202023 December 20200.14
Med28 October 202010 November 20200.36
Large30 October 20197 November 20191.85
WinterSmall10 February 202121 February 20210.28
Med30 January 202110 February 20210.84
Large27 February 202114 March 20212.76
SpringSmall23 April 202129 April 20210.08
Med2 May 202122 May 20210.76
Large6 June 202113 June 20211.53

The results of the model evaluation and integrated gradients analysis suggest the importance of soil moisture and temperature dynamics to regulate both hydrologic and biochemical processes impacting watershed nitrate loadings. Regarding nitrate-N concentrations, soil temperature and moisture were expected to be important predictors of nitrate-N concentrations, given that they are well recognized to impact a variety of N transformations such as mineralization, nitrification, and denitrification (De Neve et al., 2003; Guntinas et al., 2012; Miller and Geisseler, 2018). However, the sensitivity of flow rate predictions to both moisture and temperature of the different soil layers also likely reflects the ability of the model to capture variable hydrologic connectivity throughout an event. The importance of soil moisture variables likely reflects that stored matrix water becomes hydrologically connected to the watershed outlet via matrix-macropore interaction, conveying the water and constituents to subsurface conduits and springs (Donner et al., 2004; Yevenes and Mannaerts, 2011; Tian et al., 2015; Husic et al., 2019).

The importance of soil temperature parameters in flow modeling is less intuitive. One possible explanation is that soil temperature variability reflects infiltration of influent rainfall and preferential flows, which will impact the extent of transport to the monitored surface channel and hence enable improved understanding of water attenuation by the soil layers. Similarly, for nitrate-N concentrations, soil moisture temperature is a non-conservative tracer (e.g., Chi et al., 2020) and will reflect the extent of mixing of pre-event matrix waters and new water from the event. Therefore, it may have aided in accounting for the time-varying concentration dynamics of the pore waters that are then transported to preferential flow paths during events. While the exact reasons for the sensitivity of these variables remain uncertain, and are outside the scope of our study, our results highlight the importance of representing soil profile variability when simulating karst watershed hydrology and nitrate-N concentrations and provide avenues for future investigation.

Hysteresis Analysis

Qualitative concentration versus discharge plots for measured and modeled (Scenario 1 and 3) nitrate-N concentrations demonstrated improvements when including the 100 cm soil depth measurements (fig. 6). Scenario 3 displayed an improved representation of measured hysteresis dynamics in most events. As an example, for the event from 20 to 21 December, Scenario 1 (fig. 6e) predicts a clockwise hysteresis pattern with concentration increases early in the event, leveling off during most of the rising limb, and slowly decreasing during the falling limb of the hydrograph. Scenario 3 more closely tracks the observed trends for most of the rising limb. It also predicts increased concentrations on the falling limb, resulting in an anti-clockwise looping pattern. Further, for the event from 4 to 5 May (fig. 6j), both scenarios predicted anti-clockwise hysteresis patterns, and the rising limbs of both scenario predictions showcased similar magnitudes and trends. However, Scenario 3 more accurately predicted concentrations on the falling limb, which aligns with the lag-time associated with deeper portions of the soil profile.

(c)
Figure 5. Timeseries of the measured vs. predicted nitrate-N concentrations (a-c) and discharge (d-f) for Scenario 1 (a,d), Scenario 2 (b,e), and Scenario 3 (c,f). Note Scenario 3 has a different timeframe (29 July 2020 through 4 August 2021) from Scenarios 1 and 2.

The hysteresis index provided further quantitative evidence that Scenario 3 improved the simulation of within-event dynamics as compared to Scenario 1 (table 4). For the eleven observed events, all hysteresis indices were negative, indicating anti-clockwise patterns, and fell between -0.20 and -0.63 (average HI = -0.39). Regarding model results, Scenario 1 predicted that the small events in fall and spring had positive HI values, indicating clockwise hysteresis patterns, and the nine other events had negative HI values, ranging from -0.52 to 0.32 (table 4a). For the events simulated in Scenario 3, the HI values were all negative and fell between -0.57 and -0.19, with an average of -0.33 (table 4b). As a result, the difference between the observed HI and predicted HI for scenario 1 was 0.29, and it was 0.08 for scenario 3 on average. These findings indicate that inclusion of the deepest soil layer (100 cm) was important in capturing the counterclockwise hysteresis behavior across all events and reflects the delay in saturation for the deepest soil layer observed later in the precipitation event (fig. 4c). Collectively, these findings underscore the importance of including vertical soil variability dynamics in accurately representing within-event nitrate-N concentration dynamics.

Implications for Modeling Nitrate  Dynamics in Karst Agroecosystems

The availability of novel datasets, including soil moisture, hydrology, and nitrate measurements enabled a highly accurate TELM model of flow and nitrate-N concentrations in a heterogenous karst agroecosystem in this study, but the model may have limitations in broad-scale applicability given that long-term soil moisture and temperature data at different depths are scarcely available. In the absence of soil moisture data, recent innovations have suggested time lags between input and response variables can be represented using transformations and decomposition of input and response variable signals, including approaches such as wavelet transforms (Barzegar et al., 2018a), particle swarm optimization (Zhu et al., 2020), or empirical mode decomposition (Prasad et al., 2018). For instance, Prasad et al. (2018) employed a hybrid ELM model integrated with ensemble empirical mode decomposition to forecast upper and lower layer soil moisture using meteorological data (solar radiation, precipitation, minimum and maximum daily temperatures) along with continental parameter maps (albedo, soil characteristics, and seasonality of vegetation), reporting an R2 of 0.97 between the observed and predicted soil moisture. An adaptation to the ELM model that incorporates one of these methods could prove to be an effective tool in developing models that can accurately represent and predict nitrate exports when soil data are lacking or sparse.

Figure 6. Concentration vs. discharge hysteresis loops for observed and predicted results for the eleven rain events (a-k) selected for the hysteresis comparison (table 2). Note that only nine events occurred during the period simulated for Scenario 3.

Regarding process-based and conceptual models, the findings of this study underscore the importance of inclusion of vertical soil variability when quantifying nitrate exports. Existing modeling approaches for water quality simulations in karst utilize reservoir-style models that often lump the entire soil matrix or the soil matrix and epikarst into a single reservoir (e.g., Husic et al., 2019). Our findings suggest that root zone, and deeper soil moisture dynamics (and subsequent matrix-macropore exchange), should be more robustly represented given the gradients observed in nitrate-N concentrations between these zones throughout the year and within events. One approach to accomplishing this is coupling process-based models of soil hydrologic and biochemical processes at the watershed-scale with reservoir models often used to represent epikarst and phreatic zones. Some studies have applied modified versions of the Soil Water Assessment Tool (SWAT) (e.g., Amin et al., 2017) but have had varying success due to high epistemic and parametric uncertainties. Alternatively, better discretization of the soil layer in parsimonious reservoir models (e.g., surface, root zone, and deeper layer) may be sufficient, would significantly reduce parameterization requirements over models such as SWAT, and will be an important area for further investigation.

The results of the hysteresis analysis illustrate the importance of integrating quantitative hysteresis indices into model evaluation to ensure accurate representation of event-scale dynamics and reduce prediction uncertainty. Previous studies suggested the importance of incorporating hysteresis metrics into numerical model evaluation (Liu et al., 2021). A recent study by Husic et al. (2023) integrated nitrate hysteresis metrics into a cascading linear reservoir model simulating flow and nitrate at a daily timestep in a mature karst watershed. The authors found that including hysteresis indices as a model evaluation metric reduced uncertainty associated with both model parameterization and nitrate loading. Our results build on the findings of their study, highlighting the importance of soil profile variability on nitrate hysteresis dynamics, particularly when anti-clockwise loops are

Table 3. Results of the integrated gradients analysis showing average attribution values assigned to each of the variables used in the tested scenarios. Each set represents the averaged results from at least three runs of the specified scenario for the nitrate-N concentration and discharge models.
Attributions for Nitrate Models from IG analysisAttributions for Flow rate Models from IG analysis
Scenario 1Avg Soil Mstr 50-cm0.23Avg Soil Mstr 50-cm0.33
20-cm Temp0.2150-cm Temp0.14
50-cm Temp0.17Avg Soil Mstr 10-cm0.11
Avg Soil Mstr 10-cm0.0820-cm Temp0.10
Air Temp0.07Avg Soil Mstr 20-cm0.09
Avg Soil Mstr 20-cm0.07Air Temp0.08
Sur Temp0.0610-cm Temp0.06
10-cm Temp0.06Sur Temp0.04
Avg Rel Hum0.03Avg Rel Hum0.04
1.5m Wind Speed0.011.5m Wind Speed0.02
Sol Rad<0.01Sol Rad0.01
Precip<0.01Precip<0.01
Scenario 2Sur Temp0.49Sur Temp0.52
Avg Rel Hum0.241.5m Wind Speed0.22
Air Temp0.17Avg Rel Hum0.14
Sol Rad0.08Air Temp0.08
1.5m Wind Speed0.02Sol Rad0.04
Precip<0.01Precip<0.01
Scenario 3Avg Soil Mstr 20-cm0.16Avg Soil Mstr 50-cm0.20
20-cm Temp0.1320-cm Temp0.19
Avg Soil Mstr 10-cm0.12Avg Soil Mstr 100-cm0.12
100-cm Temp0.1150-cm Temp0.10
10-cm Temp0.1010-cm Temp0.09
Avg Soil Mstr 100-cm0.08Sur Temp0.07
Avg Soil Mstr 50-cm0.06Avg Soil Mstr 20-cm0.06
Sur Temp0.06Rel Hum0.05
Air Temp0.05Air Temp0.04
Avg Rel Hum0.05Avg Soil Mstr 10-cm0.03
50-cm Temp0.03100-cm Temp0.03
Sol Rad0.02Sol Rad0.02
1.5m Wind Speed0.021.5m Wind Speed0.01
Precip<0.01Precip<0.01

Table 4. The results for the hysteresis index (HI) analysis for Scenario 1 and Scenario 3 included TELM model predicted HI, observed HI, and the difference in hysteresis index between predicted and observed HI.
SeasonSizeStart DateEnd DateMax Q
(cms)
Predicted
HI
Observed
HI
Difference
Rain Event
Hysteresis
Scenario 1
SummerSmall29 July 20209 August 20200.18-0.12-0.630.51
Med3 July 201913 July 20190.40-0.05-0.380.34
LargeNoneNoneNoneNoneNoneNone
FallSmall19 December 202023 December 20200.140.32-0.550.87
Med28 October 202010 November 20200.36-0.52-0.260.26
Large30 October 20197 November 20191.85-0.20-0.20<0.01
WinterSmall10 February 202121 February 20210.28-0.15-0.230.07
Med30 January 202110 February 20210.84-0.19-0.410.22
Large27 February 202114 March 20212.76-0.01-0.310.30
SpringSmall23 April 202129 April 20210.080.12-0.330.45
Med2 May 202122 May 20210.76-0.16-0.400.24
Large6 June 202113 June 20211.53-0.16-0.590.43
Average-0.10-0.390.29
Rain Event
Hysteresis
Scenario 3
SummerSmall29 July 20209 August 20200.18-0.26-0.630.37
Med3 July 201913 July 20190.40NoneNoneNone
LargeNoneNoneNoneNoneNoneNone
FallSmall19 December 202023 December 20200.14-0.46-0.550.09
Med28 October 202010 November 20200.36-0.57-0.260.31
Large30 October 20197 November 20191.85NoneNoneNone
WinterSmall10 February 202121 February 20210.28-0.31-0.230.08
Med30 January 202110 February 20210.84-0.40-0.410.01
Large27 February 202114 March 20212.76-0.35-0.310.04
SpringSmall23 April 202129 April 20210.08-0.21-0.330.12
Med2 May 202122 May 20210.76-0.25-0.400.14
Large6 June 202113 June 20211.53-0.19-0.590.41
Average-0.33-0.410.08

prominent. Our findings illustrate that strong NSE values can be achieved with or without 100 cm soil moisture data; however, errors in HI values were three-fold greater when the 100 cm data were excluded, despite both modeling approaches generally capturing anti-clockwise hysteresis patterns. The synthesis of studies that integrate hysteresis metrics into model evaluation will be important to developing criteria for HI indices.

Conclusions

TELM models were developed in this study to assess the impact of soil hydrological processes on flow rate and nitrate-N concentrations exported from a heterogenous karst agroecosystem watershed. The conclusions are summarized as follows:

  1. A novel high-frequency dataset of streamflow, nitrate-N concentration, and soil moisture (at depths throughout and below the root zone) highlighted tight connectivity between soil moisture dynamics and nitrate-N concentrations spanning event to seasonal timescales.
  2. A TELM model successfully predicted both flow rate and nitrate exports from these complex systems because the time lag associated with the soil conditions was represented in model training.
  3. The inclusion of soil moisture and temperature data well below the effective root zone was important to accurately capture the storm event hysteresis dynamics observed in the exported nitrate signals.
  4. The findings have implications for revising physically based and conceptual modeling frameworks as well as model evaluation procedures for karst hydrology and water quality modeling studies.

Acknowledgments

We thank the three anonymous reviewers and the Associate Editor for their valuable comments and suggestions. We gratefully acknowledge funding support for this research from the National Science Foundation (NSF-1632888) and support from the USDA National Institute of Food and Agriculture, Multistate Project S-1089. The authors also thank the Department of Biosystems and Agricultural Engineering for partial funding support for the graduate research assistant. The authors thank Alex Fogle and the graduate and undergraduate research assistants at the University of Kentucky who assisted with sensor installation, maintenance, data collection, and analysis.

References

Agostinelli, F., Hoffman, M., Sadowski, P., & Baldi, P. (2014). Learning activation functions to improve deep neural networks. arXiv preprint arXiv: 1412.6830. https://doi.org/10.48550/arXiv.1412.6830

Amin, M. G., Veith, T. L., Collick, A. S., Karsten, H. D., & Buda, A. R. (2017). Simulating hydrological and nonpoint source pollution processes in a karst watershed: A variable source area hydrology model evaluation. Agric. Water Manag., 180, part B, 212-223. https://doi.org/10.1016/j.agwat.2016.07.011

Banerjee, C., Mukherjee, T., & Pasiliao, E. (2020). Feature representations using the reflected rectified linear unit (RReLU) activation. Big Data Min. Anal., 3(2), 102-120. https://doi.org/10.26599/BDMA.2019.9020024

Baran, N., Lepiller, M., & Mouvet, C. (2008). Agricultural diffuse pollution in a chalk aquifer (Trois Fontaines, France): Influence of pesticide properties and hydrodynamic constraints. J. Hydrol., 358(1), 56-69. https://doi.org/10.1016/j.jhydrol.2008.05.031

Barzegar, R., Asghari Moghaddam, A., Adamowski, J., & Ozga-Zielinski, B. (2018a). Multi-step water quality forecasting using a boosting ensemble multi-wavelet extreme learning machine model. Stoch. Environ. Res. Risk Assess., 32(3), 799-813. https://doi.org/10.1007/s00477-017-1394-z

Barzegar, R., Moghaddam, A. A., Deo, R., Fijani, E., & Tziritis, E. (2018b). Mapping groundwater contamination risk of multiple aquifers using multi-model ensemble of machine learning algorithms. Sci. Total Environ., 621, 697-712. https://doi.org/10.1016/j.scitotenv.2017.11.185

Basu, N. B., Destouni, G., Jawitz, J. W., Thompson, S. E., Loukinova, N. V., Darracq, A.,... Rao, P. S. (2010). Nutrient loads exported from managed catchments reveal emergent biogeochemical stationarity. Geophys. Res. Lett., 37(23). https://doi.org/10.1029/2010GL045168

Bishop, C. M. (1995). Training with noise is equivalent to Tikhonov regularization. Neural Comput., 7(1), 108-116. https://doi.org/10.1162/neco.1995.7.1.108

Blaen, P. J., Khamis, K., Lloyd, C., Comer-Warner, S., Ciocca, F., Thomas, R. M.,... Krause, S. (2017). High-frequency monitoring of catchment nutrient exports reveals highly variable storm event responses and dynamic source zone activation. J. Geophys. Res. Biogeosci., 122(9), 2265-2281. https://doi.org/10.1002/2017JG003904

Buda, A. R., & DeWalle, D. R. (2009). Dynamics of stream nitrate sources and flow pathways during stormflows on urban, forest and agricultural watersheds in central Pennsylvania, USA. Hydrol. Process., 23(23), 3292-3305. https://doi.org/10.1002/hyp.7423

Bunnell, N. L., Ford, W. I., Fogle, A. W., & Taraba, J. (2020). Reach-scale model of aquatic vegetation quantifies N fate in a bedrock-controlled karst agroecosystem stream. Water, 12(9), 2458. https://doi.org/10.3390/w12092458

Carey, R. O., Wollheim, W. M., Mulukutla, G. K., & Mineau, M. M. (2014). Characterizing storm-event nitrate fluxes in a fifth order suburbanizing watershed using in situ sensors. Environ. Sci. Technol., 48(14), 7756-7765. https://doi.org/10.1021/es500252j

Chang, L., Ming, X., Groves, C., Ham, B., Wei, C., & Yang, P. (2022). Nitrate fate and decadal shift impacted by land use change in a rural karst basin as revealed by dual nitrate isotopes. Environ. Pollut., 299, 118822. https://doi.org/10.1016/j.envpol.2022.118822

Chen, W., Li, M., Nie, J., Li, D., & Xie, Z. (2021). Experimental study on macropore flow effects in unsaturated soil on subsurface drainage and soil desalination. Irrig. Drain., 70(4), 832-844. https://doi.org/10.1002/ird.2579

Chi, G., Xing, L., Xing, X., Li, C., & Dong, F. (2020). Seepage characteristics of karst water system using temperature tracer technique. Earth Space Sci., 7(8), e2019EA000712. https://doi.org/10.1029/2019EA000712

De Neve, S., Hartmann, R., & Hofman, G. (2003). Temperature effects on N mineralization: changes in soil solution composition and determination of temperature coefficients by TDR. Eur. J. Soil Sci., 54(1), 49-62. https://doi.org/10.1046/j.1365-2389.2003.00521.x

Deo, R. C., & Sahin, M. (2016). An extreme learning machine model for the simulation of monthly mean streamflow water level in eastern Queensland. Environ. Monit. Assess., 188(2), 90. https://doi.org/10.1007/s10661-016-5094-9

Diamond, H. J., Karl, T. R., Palecki, M. A., Baker, C. B., Bell, J. E., Leeper, R. D.,... Thorne, P. W. (2013). U.S. Climate reference network after one decade of operations: Status and assessment. Bull. Amer. Meteor. Soc., 94(4), 485-498. https://doi.org/10.1175/BAMS-D-12-00170.1

Donner, S. D., Kucharik, C. J., & Foley, J. A. (2004). Impact of changing land use practices on nitrate export by the Mississippi River. Global Biogeochem. Cycles, 18(1). https://doi.org/10.1029/2003GB002093

EPA. (2007). SW-846 Test Method 9056A: Determination of inorganic anions by ion chromatography. Environmental Protection Agency. Retrieved from https://www.epa.gov/hw-sw846/sw-846-test-method-9056a-determination-inorganic-anions-ion-chromatography

Fijani, E., Barzegar, R., Deo, R., Tziritis, E., & Skordas, K. (2019). Design and implementation of a hybrid model based on two-layer decomposition method coupled with extreme learning machines to support real-time environmental monitoring of water quality parameters. Sci. Total Environ., 648, 839-853. https://doi.org/10.1016/j.scitotenv.2018.08.221

Ford, W. I., Husic, A., Fogle, A., & Taraba, J. (2019). Long-term assessment of nutrient flow pathway dynamics and in-stream fate in a temperate karst agroecosystem watershed. Hydrol. Process., 33(11), 1610-1628. https://doi.org/10.1002/hyp.13427

Ford, W., Williams, M. R., Young, M. B., King, K. W., & Fischer, E. (2018). Assessing intra-event phosphorus dynamics in drainage water using phosphate stable oxygen isotopes. Trans. ASABE, 61(4), 1379-1392. https://doi.org/10.13031/trans.12804

Geonor. (2016). T-200B Series precipitation gauge. Rev 10.10.

https://irp.cdn-website.com/05996b94/files/uploaded/T-200B-US-Manual.-rev-10.10-20150128.pdf

Germer, K., & Braun, J. (2015). Macropore — Matrix water flow interaction around a vertical macropore embedded in fine sand — Laboratory investigations. Vadose Zone J., 14(7). https://doi.org/10.2136/vzj2014.03.0030

Green, C. T., Liao, L., Nolan, B. T., Juckem, P. F., Shope, C. L., Tesoriero, A. J., & Jurgens, B. C. (2018). Regional variability of nitrate fluxes in the unsaturated zone and groundwater, Wisconsin, USA. Water Resour. Res., 54(1), 301-322. https://doi.org/10.1002/2017WR022012

Guntiñas, M. E., Leirós, M. C., Trasar-Cepeda, C., & Gil-Sotres, F. (2012). Effects of moisture and temperature on net soil nitrogen mineralization: A laboratory study. Eur. J. Soil Biol., 48, 73-80. https://doi.org/10.1016/j.ejsobi.2011.07.015

Hartmann, A. (2016). Putting the cat in the box: Why our models should consider subsurface heterogeneity at all scales. WIREs Water, 3(4), 478-486. https://doi.org/10.1002/wat2.1146

Hartmann, A., Goldscheider, N., Wagener, T., Lange, J., & Weiler, M. (2014). Karst water resources in a changing world: Review of hydrological modeling approaches. Rev. Geophys., 52(3), 218-242. https://doi.org/10.1002/2013RG000443

Huang, G., Zhou, H., Ding, X., & Zhang, R. (2012). Extreme learning machine for regression and multiclass classification. IEEE Trans. Syst. Man. Cybern. B Cybern., 42(2), 513-529. https://doi.org/10.1109/TSMCB.2011.2168604

Husic, A., Fox, J. F., Clare, E., Mahoney, T., & Zarnaghsh, A. (2023). Nitrate hysteresis as a tool for revealing storm-event dynamics and improving water quality model performance. Water Resour. Res., 59(1), e2022WR033180. https://doi.org/10.1029/2022WR033180

Husic, A., Fox, J., Adams, E., Ford, W., Agouridis, C., Currens, J., & Backus, J. (2019). Nitrate pathways, processes, and timing in an agricultural karst system: Development and application of a numerical model. Water Resour. Res., 55(3), 2079-2103. https://doi.org/10.1029/2018WR023703

Igbal, M. Z., & Krothe, N. C. (1995). Infiltration mechanisms related to agricultural waste transport through the soil mantle to karst aquifers of southern Indiana, USA. J. Hydrol., 164(1), 171-192. https://doi.org/10.1016/0022-1694(94)02573-T

Jackson, L., & Polk, J. S. (2020). Seasonal d13CDIC sourcing and geochemical flux in telogenetic epikarst of south-central Kentucky. Earth Surf. Process. Landf., 45(4), 785-799. https://doi.org/10.1002/esp.4768

Jensen, A. K., & Ford, W. I. (2019). Quantifying nitrate dynamics of a confluence floodplain wetland in a disturbed Appalachian watershed: High-resolution sensing and modeling. Trans. ASABE, 62(6), 1545-1565. https://doi.org/10.13031/trans.13278

Jin, X., Xu, C., Feng, J., Wei, Y., Xiong, J., & Yan, S. (2015). Deep learning with S-shaped rectified linear activation units. arXiv preprint arXiv: 1512.07030. https://doi.org/10.48550/arXiv.1512.07030

Kennedy, C. D., Bataille, C., Liu, Z., Ale, S., VanDeVelde, J., Roswell, C. R.,... Bowen, G. J. (2012). Dynamics of nitrate and chloride during storm events in agricultural catchments with different subsurface drainage intensity (Indiana, USA). J. Hydrol., 466-467, 1-10. https://doi.org/10.1016/j.jhydrol.2012.05.002

Klaus, J., Zehe, E., Elsner, M., Külls, C., & McDonnell, J. J. (2013). Macropore flow of old water revisited: Experimental insights from a tile-drained hillslope. Hydrol. Earth Syst. Sci., 17(1), 103-118. https://doi.org/10.5194/hess-17-103-2013

Krishnan, S. R., Nallakaruppan, M. K., Chengoden, R., Koppu, S., Iyapparaja, M., Sadhasivam, J., & Sethuraman, S. (2022). Smart water resource management using artificial intelligence - A review. Sustainability, 14(20), 13384. https://doi.org/10.3390/su142013384

Liu, W., Birgand, F., Tian, S., & Chen, C. (2021). Event-scale hysteresis metrics to reveal processes and mechanisms controlling constituent export from watersheds: A review. Water Res., 200, 117254. https://doi.org/10.1016/j.watres.2021.117254

Liu, Z., Li, Q., Sun, H., & Wang, J. (2007). Seasonal, diurnal and storm-scale hydrochemical variations of typical epikarst springs in subtropical karst areas of SW China: Soil CO2 and dilution effects. J. Hydrol., 337(1), 207-223. https://doi.org/10.1016/j.jhydrol.2007.01.034

Lloyd, C. E., Freer, J. E., Johnes, P. J., & Collins, A. L. (2016). Technical Note: Testing an improved index for analysing storm discharge-concentration hysteresis. Hydrol. Earth Syst. Sci., 20(2), 625-632. https://doi.org/10.5194/hess-20-625-2016

McCuen, R. H., Knight, Z., & Cutter, A. G. (2006). Evaluation of the Nash-Sutcliffe efficiency index. J. Hydrol. Eng., 11(6), 597-602. https://doi.org/10.1061/(ASCE)1084-0699(2006)11:6(597)

McGill, T. (2022). Assessing machine learning utility in predicting hydrologic and nitrate dynamics in karst agroecosystems. MS thesis. University of Kentucky, Biosystems and Agricultural Engineering. Retrieved from https://uknowledge.uky.edu/bae_etds/90

Mehdi, B., Schürz, C., Grath, B., & Schulz, K. (2021). Storm event impacts on in-stream nitrate concentration and discharge dynamics: A comparison of high resolution in-situ measured data with model simulations. Sci. Total Environ., 755, 143406. https://doi.org/10.1016/j.scitotenv.2020.143406

Miller, K. S., & Geisseler, D. (2018). Temperature sensitivity of nitrogen mineralization in agricultural soils. Biol. Fertil. Soils, 54(7), 853-860. https://doi.org/10.1007/s00374-018-1309-2

Moriasi, D. N., Arnold, J. G., Van Liew, M. W., Bingner, R. L., Harmel, R. D., & Veith, T. L. (2007). Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE, 50(3), 885-900. https://doi.org/10.13031/2013.23153

Moriasi, D. N., Gitau, M. W., Pai, N., & Daggupati, P. (2015). Hydrologic and water quality models: Performance measures and evaluation criteria. Trans. ASABE, 58(6), 1763-1785. https://doi.org/10.13031/trans.58.10715

Nazari, S., Ford, W. I., & King, K. W. (2022). Impact of flow pathway and source water connectivity on subsurface sediment and particulate phosphorus dynamics in tile-drained agroecosystems. Agric. Water Manag., 269, 107641. https://doi.org/10.1016/j.agwat.2022.107641

NOAA. (2021). Instruments. https://www.ncdc.noaa.gov/crn/instruments.html

Pellerin, B. A., Bergamaschi, B. A., Downing, B. D., Saraceno, J. F., Garrett, J. D., & Olsen, L. D. (2013). Optical techniques for the determination of nitrate in environmental waters: Guidelines for instrument selection, operation, deployment, maintenance, quality assurance, and data reporting. Techniques and Methods 1-D5. 48. USGS. 10.3133/tm1D5

Prasad, R., Deo, R. C., Li, Y., & Maraseni, T. (2018). Soil moisture forecasting by a hybrid machine learning technique: ELM integrated with ensemble empirical mode decomposition. Geoderma, 330, 136-161. https://doi.org/10.1016/j.geoderma.2018.05.035

Radcliff, C., Ford, W. I., Nazari, S., & Shepard, C. (2021). Impact of water source dynamics on dissolved reactive phosphorus loadings in heterogenous karst agroecosystems with phosphatic limestones. Hydrol. Process., 35(11), e14422. https://doi.org/10.1002/hyp.14422

Reed, T. M., McFarland, J. T., Fryar, A. E., Fogle, A. W., & Taraba, J. L. (2010). Sediment discharges during storm flow from proximal urban and rural karst springs, central Kentucky, USA. J. Hydrol., 383(3), 280-290. https://doi.org/10.1016/j.jhydrol.2009.12.043

Rezaie-Balf, M., & Kisi, O. (2017). New formulation for forecasting streamflow: Evolutionary polynomial regression vs. extreme learning machine. Hydrol. Res., 49(3), 939-953. https://doi.org/10.2166/nh.2017.283

Robertson, D. M., & Saad, D. A. (2021). Nitrogen and phosphorus sources and delivery from the Mississippi/Atchafalaya River Basin: An update using 2012 SPARROW models. JAWRA, 57(3), 406-429. https://doi.org/10.1111/1752-1688.12905

Roushangar, K., Alizadeh, F., & Nourani, V. (2017). Improving capability of conceptual modeling of watershed rainfall-runoff using hybrid wavelet-extreme learning machine approach. J. Hydroinf., 20(1), 69-87. https://doi.org/10.2166/hydro.2017.011

Rusjan, S., Brilly, M., & Mikoš, M. (2008). Flushing of nitrate from a forested watershed: An insight into hydrological nitrate mobilization mechanisms through seasonal high-frequency stream nitrate dynamics. J. Hydrol., 354(1), 187-202. https://doi.org/10.1016/j.jhydrol.2008.03.009

Seybold, E., Gold, A. J., Inamdar, S. P., Adair, C., Bowden, W. B., Vaughan, M. C.,... Schroth, A. W. (2019). Influence of land use and hydrologic variability on seasonal dissolved organic carbon and nitrate export: insights from a multi-year regional analysis for the northeastern USA. Biogeochemistry, 146(1), 31-49. https://doi.org/10.1007/s10533-019-00609-x

Steppa, C., & Holch, T. L. (2019). HexagDLy — Processing hexagonally sampled data with CNNs in PyTorch. SoftwareX, 9, 193-198. https://doi.org/10.1016/j.softx.2019.02.010

Sundararajan, M., Taly, A., & Yan, Q. (2017). Axiomatic attribution for deep networks. arXiv preprint arXiv: 1703.01365. https://doi.org/10.48550/arXiv.1703.01365

Tian, S., Youssef, M. A., Richards, R. P., Liu, J., Baker, D. B., & Liu, Y. (2015). Different seasonality of nitrate export from an agricultural watershed and an urbanized watershed in Midwestern USA. J. Hydrol., 541, Part B, 1375-1384. https://doi.org/10.1016/j.jhydrol.2016.08.042

Wang, F., Chen, H., Lian, J., Fu, Z., & Nie, Y. (2018). Preferential flow in different soil architectures of a small karst catchment. Vadose Zone J., 17(1), 180107. https://doi.org/10.2136/vzj2018.05.0107

Weary, D. J., & Doctor, D. H. (2014). Karst in the United States: A digital map compilation and database. Open-File Report 2014-1156. 27. USGS. https://doi.org/10.3133/ofr20141156

Yan, Y., Yang, Y., & Dai, Q. (2023). Effects of preferential flow on soil nutrient transport in karst slopes after recultivation. Environ. Res. Lett., 18(3), 034012. https://doi.org/10.1088/1748-9326/acb8cc

Yang, S., Wang, Y., Liu, R., Zhang, A., & Yang, Z. (2017). Effect of nitrate leaching caused by swine manure application in fields of the Yellow River irrigation zone of Ningxia, China. Sci. Rep., 7(1), 13693. https://doi.org/10.1038/s41598-017-12953-9

Yaseen, Z. M., Jaafar, O., Deo, R. C., Kisi, O., Adamowski, J., Quilty, J., & El-Shafie, A. (2016). Stream-flow forecasting using extreme learning machines: A case study in a semi-arid region in Iraq. J. Hydrol., 542, 603-614. https://doi.org/10.1016/j.jhydrol.2016.09.035

Yaseen, Z. M., Sulaiman, S. O., Deo, R. C., & Chau, K.-W. (2019). An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction. J. Hydrol., 569, 387-408. https://doi.org/10.1016/j.jhydrol.2018.11.069

Yevenes, M. A., & Mannaerts, C. M. (2011). Seasonal and land use impacts on the nitrate budget and export of a mesoscale catchment in Southern Portugal. Agric. Water Manag., 102(1), 54-65. https://doi.org/10.1016/j.agwat.2011.10.006

Yue, F.-J., Waldron, S., Li, S.-L., Wang, Z.-J., Zeng, J., Xu, S.,... Oliver, D. M. (2019). Land use interacts with changes in catchment hydrology to generate chronic nitrate pollution in karst waters and strong seasonality in excess nitrate export. Sci. Total Environ., 696, 134062. https://doi.org/10.1016/j.scitotenv.2019.134062

Zhu, B., Feng, Y., Gong, D., Jiang, S., Zhao, L., & Cui, N. (2020). Hybrid particle swarm optimization with extreme learning machine for daily reference evapotranspiration prediction from limited climatic data. Comput. Electron. Agric., 173, 105430. https://doi.org/10.1016/j.compag.2020.105430