Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Automated calibration requires the use of an objective function to direct the optimisation process. The Source calibration tool implements single objective optimisation, which reduces the comparison between the observed and modelled data during the calibration period to a single number to be optimised (multiple objective optimisation is also available, see Multi-objective optimisation - Insight - SRG for information).

Source implements five different basic types of objective function:

...

Overview information on the four optimisation techniques in Source is available in Vaze et al. (2011).  Further information is in textbooks and papers, particularly for the genetic algorithm and uniform random sampling[DB1] .  Publications on the shuffled complex evolution method include papers by Duan et al. (1992) and Sorooshian et al. (1993).  Publications on the Rosenbrock method include the paper by Rosenbrock (1960).

...

Availability

Provided with Source.

Implementation

 

Some of these objective functions can be combined to create composite objective functions. For composite objective functions, the user is often able to enter a weight that determines the relative contribution of each objective function component to the 

 

  1. Nash Sutcliffe Coefficient of Efficiency (NSE) of Daily Flows
  2. Minimise Absolute Bias between Observed and Modelled Flows (calculated using daily flows)
  3. Match to NSE of Daily Flows but Penalise Biased Solutions
  4. Match to NSE of Monthly Flows
  5. Match to NSE of Monthly Flows but Penalise Biased Solutions
  6. Combined Match to NSE and Match to Flow Duration Curve (Daily)
  7. Combined Match to NSE and Match to Logarithm of Flow Duration Curve (Daily)
  8. Combined Match to NSE of Logarithms of Daily Flows with Bias Penalty
  9. Combined Bias, Daily Flows and Daily Exceedance (Flow Duration) Curve (SDEB)

Further information on the first seven of these objective functions is available in Vaze et al. (2011), Section 6. Guidance on model calibration is available in many publications, including various eWater Best Modelling Practice Guidelines (Black et al., 2011; Vaze et al., 2011; Black and Podger, 2012; and Lerat, 2012).

Structure & processes

Background

As the optimisation techniques and statistical measures of calibration performance used in Source are well established, they are not re-described here. However, as the objective functions used in the optimisation techniques have been customised for Source, further information on these follows and as many of them rely on the Nash Sutcliffe Coefficient of Efficiency (NSE), its formulation is restated below.

The choice of any particular objective function will depend on the intended application. Each of the pre-defined objective functions are formulated to put emphasis (reproduce as closely as possible) on different flow characteristics (Vaze et al, 2011).

The discussion below assumes that the objective functions are being applied to streamflow data but they can be applied to any time series data.

...

Minimise a combination of the bias, daily Flows and daily exceedance (flow duration) curve

 

Missing Data

It is common for observed time series of hydrological processes to contain missing values. Also, the observed and modelled time series may have different start and end dates. The Source calibration tool calculates the objective function values using only data from those time steps for which both observed and modelled data is available.

The descriptions of the objective function equations assume that the observed and modelled data has been filtered to include only:

  • data from within the calibration period, and
  • data for time steps with complete data pairs.

Nash Sutcliffe Coefficient of Efficiency (NSE)

NSE Daily

Application of this objective function involves maximising the NSE (i.e. getting it as close to 1.0 as possible).  The calculation of the NSE is in accordance with Nash and Sutcliffe (1970) and uses observed and modelled daily flow data for all days within the calibration period for which observed daily flow data, including zero flow values (i.e. cease to flow), is available.

The NSE tends to produce solutions that match high and moderate flows very well but often will produce poor fits to low flows. It will also tend to favour solutions that provide a good match to the timing and shape of runoff events (Vaze et al., 2011).

The traditional formula for the NSE is:

...

Image Removed

where:

Qobsi    is the observed flow on day i,

Qsimi    is the modelled flow on day i,

N           is the number of days

Alternatively, the NSE may be written as:

...

Image Removed

This formulation obviates the necessity to calculate the average of the observed flows before evaluating the denominator in the traditional version.

NSE Log Daily

This objective function uses the same equation as for the NSE of daily flows (equation (1)), but applies it to log transformed data:

...

Image Removed

where c is a small constant equal to the maximum of 1 ML and the 10th percentile of the observed flow.

NSE Monthly

This objective function uses the same equation as for the NSE of daily flows (equation (1)), but applies it to monthly rather than daily data:

  • If the model is run on a daily time step, Qobsbecomes the sum of the observed flows for month i and Qsimi becomes the sum of the modelled flow for month i. The NSE calculation ignores observed and modelled data for all months where there are one or more days of missing data in the observed flow series. 
  • If the model is run on a monthly time step, then the monthly values are unchanged.

The NSE of monthly flows can be useful for initial calibration because it tends to find solutions that will match the overall movement of water through the conceptual stores in the rainfall-runoff model, without being influenced by the timing of individual runoff events (Vaze et al., 2011).

Flow Duration 

Flow Duration

The flow duration objective function sorts the observed and modelled data values in increasing order and then calculates the NSE of the sorted data.

Log Flow Duration

This objective function calculates the flow duration objective function using the log transformed flows in Equation (3).

Absolute Bias

This objective function will produce a match on the overall volume of flow generated but often will produce a poor fit to the timing of flows (Vaze et al., 2011). It has the following form:

...

Image Removed

Background

The optimisation techniques and statistical measures of calibration performance used in Source are well established and are not described in detail here. Guidance on model calibration is available in many publications, including various eWater Best Modelling Practice Guidelines (Black et al., 2011; Vaze et al., 2011; Black and Podger, 2012; and Lerat, 2012).

The choice of an appropriate objective function for calibration depends on the intended application of the model. Different objective functions are designed with the intention of emphasizing the fit of modelled flow to different aspects of the observed hydrograph (Vaze et al., 2011). The objective functions available in Source are listed in the table below, including useful references for further information. 

Objective Function NameDescriptionReference
NSE DailyMaximise the NSE of daily flowsVaze et al. (2011), Section 6
NSE MonthlyMaximise the NSE of monthly flowsVaze et al. (2011), Section 6
NSE Log DailyMaximise the NSE of the logarithm of daily flows 
Absolute BiasMinimise the Absolute value of the relative biasVaze et al. (2011), Section 6

NSE Daily & Bias PenaltyMaximise the NSE of daily flows and bias penaltyVaze et al. (2011), Section 6
NSE Log Daily & Bias PenaltyMaximise the NSE of the logarithm of daily flows and bias penalty
NSE Monthly & Bias PenaltyMaximise the NSE of monthly flows and bias penaltyVaze et al. (2011), Section 6
NSE Daily & Flow DurationMaximise the NSE of daily flows and the NSE of the flow durationVaze et al. (2011), Section 6
NSE Daily & Log Flow DurationMaximise the NSE of daily flows and the NSE of the flow duration of log flowsVaze et al. (2011), Section 6
Square-root Daily, Exceedance and Bias

Minimise a combination of the bias, daily Flows and daily exceedance (flow duration) curve

Lerat et al., 2013

Since some of the objective functions have been customised for use in Source, the objective function equations are defined in the following sub-sections. The discussion assumes that the objectives are being applied to streamflow data, but the equations are applicable to any time series data.

Missing Data

It is common for observed time series of hydrological processes to contain missing values. Also, the observed and modelled time series may have different start and end dates. The Source calibration tool calculates the objective function values using only data from those time steps for which both observed and modelled data is available.

The descriptions of the objective function equations assume that the observed and modelled data has been filtered to include only:

  • data from within the calibration period, and
  • data for time steps with complete data pairs.

Nash Sutcliffe Coefficient of Efficiency (NSE)

NSE Daily

Application of this objective function involves maximising the NSE (i.e. getting it as close to 1.0 as possible). The calculation of the NSE is in accordance with Nash and Sutcliffe (1970) and uses observed and modelled daily flow data for all days within the calibration period for which observed daily flow data, including zero flow values (i.e. cease to flow), is available.

The NSE tends to produce solutions that match high and moderate flows very well but often will produce poor fits to low flows. It will also tend to favour solutions that provide a good match to the timing and shape of runoff events (Vaze et al., 2011).

The traditional formula for the NSE is:

Equation 1

Image Added

where:

Qobsi    is the observed flow on day i,

Qsimi    is the modelled flow on day i,

N           is the number of days

An alternative, but equivalent, formulation of the NSE is:

Equation 2

Image Added

This formulation obviates the necessity to calculate the average of the observed flows before evaluating the denominator in the traditional version.

NSE Log Daily

This objective function uses the same equation as for the NSE of daily flows (equation (1)), but applies it to log transformed data:

Equation 3
Image Added

where c is a small constant equal to the maximum of 1 ML and the 10th percentile of the observed flows. The use of this constant is intended to de-emphasise very small flows, which tend to be unreliable, and overcome the problem of trying to take logarithms of zero flows.

NSE Monthly

This objective function uses the same equation as for the NSE of daily flows (equation (1)), but applies it to monthly rather than daily data:

  • If the model is run on a daily time step, Qobsbecomes the sum of the observed flows for month i and Qsimi becomes the sum of the modelled flow for month i. The NSE calculation ignores observed and modelled data for all months where there are one or more days of missing data in the observed flow series. 
  • If the model is run on a monthly time step, then the monthly values are unchanged.

The NSE of monthly flows can be useful for initial calibration because it tends to find solutions that will match the overall movement of water through the conceptual stores in the rainfall-runoff model, without being influenced by the timing of individual runoff events (Vaze et al., 2011).

Flow Duration 

Flow Duration

The flow duration objective function sorts the observed and modelled data values in increasing order and then calculates the NSE of the sorted data.

Log Flow Duration

This objective function uses log transformed flows, as described in Equation (3). The log transformed observed and modelled flows are sorted increasing order and the NSE is calculated on the sorted data.

Absolute Bias

This objective function will produce a match on the overall volume of flow generated but often will produce a poor fit to the timing of flows (Vaze et al., 2011). It has the following form:

Equation 4

Image Added

The evaluation of this objective function uses observed and modelled daily flow data for all days within the calibration period for which observed daily flow data, including zero flow values, is available.

Bias Penalty

The bias penalty objective function is described in Viney et al. (2009). The equation is given by:

Equation 5

Image Added

where is the absolute value of the relative bias, as defined in equation (4).

In Source, the Bias Penalty is always used in combination with other objective functions and is not available on its own.

Combinations of the NSE, Flow Duration and Bias Penalty Objective Functions

NSE Daily and Bias Penalty

This objective function is a weighted combination of the daily NSE and the bias penalty developed by Viney et al. (2009), and the aim is to maximise its value. 

Equation 6
NSE Daily & Bias Penalty = NSE Daily - Bias Penalty

where 

NSE Daily is defined in equation (1)

Bias Penalty is defined in equation (5)

This formulation makes sure that the models are calibrated predominantly to optimise NSE while ensuring a low bias in the total streamflow. It avoids solutions that produce biased estimates of overall runoff, which can produce marginal improvements in low flow performance over the NSE objective function. However, NSE-Bias will still be strongly influenced by moderate and high flows and by the timing of runoff events, which can still often result in poor fits to low flows (Vaze et al., 2011). 


The evaluation of this objective function uses observed and modelled daily flow data for all days within the calibration period for which observed daily flow data, including zero flow values, is available.

NSE Log Daily & Bias Penalty

The bias penalty This objective function is described in Viney et al. (2009). The equation is given by:

...

Image Removed

where is the absolute value of the relative bias, as defined in equation (3).

In Source, the Bias Penalty is always used in combination with other objective functions and is not available on its own.

Combinations of the NSE, Flow Duration and Bias Penalty Objective Functions

NSE Daily and Bias Penalty

...

given by:

Equation 7

NSE Log Daily & Bias Penalty = NSE Log Daily – Bias Penalty 

where 

NSE Log Daily is defined in equations (1) and (3)

Bias Penalty is defined in equation (5)

This objective function captures the model’s ability to fit the shape of the observed daily flow hydrograph, with an emphasis on mid-range to low flows (in contrast to the NSE Daily which tends to put an emphasis on medium to high flows), while ensuring a low bias in the total streamflow.

NSE Monthly and Bias Penalty

This objective function is the weighted combination of the monthly NSE and the bias penalty (Viney et al., 2009), and the aim is to find its maximum value:

Equation 48
NSE Daily Monthly & Bias Penalty = NSE Daily Monthly - Bias Penalty

where 

NSE Daily Monthly is defined in equation ()above

Bias Penalty is defined in equation (5)

This formulation makes sure that the models are calibrated predominantly to optimise NSE while ensuring a low bias in the total streamflow. It avoids solutions that produce biased estimates of overall runoff, which can produce marginal improvements in low flow performance over the NSE objective function. However, NSE-Bias will still be strongly influenced by moderate and high flows and by the timing of runoff events, which can still often result in poor fits to low flows (Vaze et al, 2011).The evaluation of this objective function uses observed and modelled daily flow data for all days within the calibration period for which observed daily flow data, including zero flow values, is available.

NSE Monthly and Bias Penalty

This objective function is the weighted combination of the monthly NSE and a logarithmic function of bias (Viney et al, 2009), and the aim is to find its maximum value.  The equation used is the same as for the case “Match to NSE of Daily Flows but Penalise Biased Solutions” above.  The NSE and Bias calculations ignore observed and modelled data for all months where there are one or more days of missing data in the observed flow series.

6. Combined Match to NSE and Match to Flow Duration Curve (Daily)

For this case the aim is to maximise the objective function, where:

...

where:

A is

NSE Daily and Flow Duration

For this case, the aim is to maximise the objective function given by:

Equation 9NSE Daily & Flow Duration = a * NSE Daily + (1 - a) * Flow Duration

where:

a is user-defined weighting factor (0  a ≤ 1); and

NSE Daily is defined in equation (1)

Flow Duration is defined above

NSE Daily and Log Flow Duration

For this case the aim is to maximise the objective function, where:

Equation 10NSE Daily & Log Flow Duration = a * NSE Daily + (1 - a) * Log Flow Duration

where:

is user-defined weighting factor ( a ≤ 1); and

NSE Daily is defined in equation (1)

Flow Duration is defined above

Combined Bias, Daily Flows and Daily Exceedance (Flow Duration) Curve (SDEB)

This objective function is based on the function introduced by Coron et al. (2012) and has been successfully applied in a number of projects (e.g. Lerat et al., 2013).  It has the following equation:

Equation 11
Image Added

where:

α is a weighting factor whose value can be set by the modeller (0 A 0 ≤ α ≤ 1); and

NSE daily FDC is calculated using ranked value pairs of Qobsi and Qsimi.

This objective function and the following objective function are hybrids that compromise between the fit to the timing of high and moderate flows from the NSE component and the fit to the shape of the whole flow duration curve (FDC).  The NSE-logFDC (below) will produce the closer fit to low flows (Vaze et al, 2011).

7. Combined Match to NSE and Match to Logarithm of Flow Duration Curve (Daily)

For this case the aim is to maximise the objective function, where:

Equation 7Objective function = A * NSE daily flows + (1 - A) * NSE log10(daily FDC)

where:

...

NSE log10(daily FDC) is calculated using ranked value pairs of log10(Qobsi+c) and log10(Qsimi+c).

c is the maximum of 1 ML and the 10th percentile of the observed flows.  The use of this constant is intended to de-emphasise very small flows, which tend to be unreliable, and overcome the problem of trying to take logarithms of zero flows.

8.  NSE Log Daily & Bias Penalty Objective Function

This objective function is given by:

...

NSE(logarithms of daily flows) is calculated using value pairs of  ln(Qobsi+c) and ln(Qsimi+c), where B and v are defined in the same way as above.  The Bias Penalty is based on Viney et al (2009) and is: 

...

Image Removed

This objective function captures the model’s ability to fit the shape of the observed daily flow hydrograph, with an emphasis on mid-range to low flows (in contrast to the arithmetic form of the NSE which tends to put an emphasis on medium to high flows), while ensuring a low bias in the total streamflow.

 

Combined Bias, Daily Flows and Daily Exceedance (Flow Duration) Curve (SDEB)

This objective function is based on the function introduced by Coron et al. (2012) and has been successfully applied in a number of projects (e.g. Lerat et al., 2013).  It has the following equation:

...

Image Removed

where:

α is a weighting factor whose value can be set by the modeller (0 ≤ α ≤ 1).

RQobsk is the k’th ranked observed flow of a total of N ranked flows,

RQsimk is the k’th ranked modelled flow of a total of N ranked flows, and

Other terms are as defined previously.

As explained by Lerat et al. (2013), this function combines three terms: (i) the sum of squared errors on power transform of flow, (ii) the same sum on sorted flow values and (iii) the relative simulation bias.

The coefficient α and the power transform are used to balance the three terms within the objective function.  The weighting factor α is used to reduce the impact of the timing errors on the objective function. This type of error can have a significant effect on the first term in the equation, where a slight misalignment of observed and simulated peak flow timing can result in large amplitude errors. Conversely, the second term is based on sorted flow values, which remain unaffected by timing errors. By way of example, Lerat et al. (2013) in their study of the Flinders and Gilbert Rivers in Northern Australia used values of α of 0.1 for the Flinders calibration and 1.0 for the Gilbert calibration.

Using values of power transform of less than 1 has the effect of reducing the weight of the errors in high flows, where the flow data are known to be less accurate.  Lerat et al (2013) found that a power transform of ½ led to the best compromise between high and low flow performance in their project. This value has been adopted in Source.

Data

Input data

Details on data to be input by the modeller are provided in the Source User Guide.  Requirements for data series inputs to the various objective functions are included in the descriptions of each objective function, above.

Parameters or settings

Modellers have the option of selecting one optimisation technique, multiple optimisation techniques (in parallel), or combinations two optimisation techniques (in series), or not using optimisation.  Modellers can also select which objective function they wish to use.  The other parameters the modeller can input are described in the following table:

...

Weighting factor for the objective function in case 9

...

Objective Function

...

.

RQobsk is the k’th ranked observed flow of a total of N ranked flows,

RQsimk is the k’th ranked modelled flow of a total of N ranked flows, and

Other terms are as defined previously.

As explained by Lerat et al. (2013), this function combines three terms: (i) the sum of squared errors on power transform of flow, (ii) the same sum on sorted flow values and (iii) the relative simulation bias.

The coefficient α and the power transform are used to balance the three terms within the objective function.  The weighting factor α is used to reduce the impact of the timing errors on the objective function. This type of error can have a significant effect on the first term in the equation, where a slight misalignment of observed and simulated peak flow timing can result in large amplitude errors. Conversely, the second term is based on sorted flow values, which remain unaffected by timing errors. By way of example, Lerat et al. (2013) in their study of the Flinders and Gilbert Rivers in Northern Australia used values of α of 0.1 for the Flinders calibration and 1.0 for the Gilbert calibration.

Using values of power transform of less than 1 has the effect of reducing the weight of the errors in high flows, where the flow data are known to be less accurate.  Lerat et al. (2013) found that a power transform of ½ led to the best compromise between high and low flow performance in their project. This value has been adopted in Source.

Data

Input data

Details on data to be input by the modeller are provided in the Source User Guide. Requirements for data series inputs to the various objective functions are included in the descriptions of each objective function, above.

Parameters or settings

Modellers have the option of selecting one optimisation technique, two optimisation techniques (in series), or manual optimisation.  Modellers can also select which objective function they wish to use.  The other parameters the modeller can input are described in the following table:

Objective Function
ParameterParameter DescriptionUnitsDefaultRange
NSE Daily & Flow DurationaWeight on NSE in the combined objectiveDimensionless0.50 ≤ α ≤ 1
NSE Daily & Log Flow DurationaWeight on NSE in the combined objectiveDimensionless0.50 ≤ α ≤ 1

Output data

Outputs include results of the evaluation of the selected objective function and other calibration performance statistics. 

...

Viney, N.R., Perraud, J-M., Vaze, J., Chiew F.H.S., Post, D.A. and Yang, A. (2009). The usefulness of bias constraints in model calibration for regionalisation to ungauged catchments.  In: 18th World IMACS Congress and MODSIM09 International Congress on Modelling and Simulation, July 2009, Cairns: Modelling and Simulation Society of Australian and New Zealand and International Association for Mathematics and Computers in Simulation: 3421-3427.