pySWATPlus is a python package for running and calibrating default or custom SWAT+ projects with Python.
With this package and by providing an existing SWAT+ model, modelers can do the following:
- Acces the TxtInOut folder used by SWAT+ and navigate through all its files in order to read, modify and write them.
- Calibrate the different SWAT+ input parameters in order to optimize the output through the Pymoo.
pySWATPlus is open source software released by ICRA. It is available for download on PyPI.
pySWATPlus can be installed via PyPI and requires additional packages to be installed first for its proper functioning. These are the commands required for installing the necessary packages:
pip install pandas
pip install numpy
pip install pymoo
pip install tqdm
pip install dask
To use this package, a Python version above 3.6 is required.
After all the requirements are met the package can be installed through the following command:
pip install -i https://test.pypi.org/simple/ pySWATPlus
https://github.com/icra/pySWATPlus/tree/main/examples
The package consists of three main features:
TxtinoutReader
FileReader
SWATProblem
This feature inicializes a TxtinoutReader
class instance that allows users to work with SWAT model data. It requires a path to the SWAT model folder as shown in the following example
from pySWATPlus.TxtinoutReader import TxtinoutReader
reader = TxtinoutReader(txtinout_folder_path)
Returns the path of the used TxtInOut folder
reader.root_folder
The path to the main SWAT executable file
reader.swat_exe_path
It allows the user to modify the begining and end year in the time.sim
file.
It takes two parameters:
beginning
(int): specifies the begining yearend
(int): specifies the end year
reader.set_beginning_and_end_year(beginning, end)
reader.set_beginning_and_end_year(2010, 2020)
This function allows the user to modify the warmup period (years) in the "time.sim" file.
As a parameter it takes the warmup
(int) value.
reader.set_warmup(warmup)
reader.set_warmup(3)
Enable or update an object in the print.prt
file. If obj
is not a default identifier, it will be added at the end of the file.
It takes the following parameters:
obj
(str): The object name or identifierdaily
(bool): Flag for daily print freuencymonthly
(bool): Flag for monthly print frequencyyearly
(bool): Flag for yearly print frequencyavann
(bool): Flag for average annual print frequency
reader.enable_object_in_print_prt(obj='object_name', daily=True, monthly=False, yearly=False, avann=False)
reader.enable_object_in_print_prt('channel_sd', True, False, False, False)
Enable CSV print in the print.prt
file
It takes no parameters
reader.enable_csv_print()
Disable CSV print in the print.prt
file
It takes no parameters
reader.disable_csv_print()
Register a file to work with in the SWAT model.
The function takes the following parameters:
filename
(str): The name of the file to registerhas_units
(bool): Indicates if the file has units information (default is False)index
(str, optional): The name of the index column (default is None)usecols
(List[str], optional): A list of column names to read (default is None)filter_by
(Dict[str, List[str]], optional): A dictionary of column names and values (list of str) to filter by (default is an empty dictionary)
The function returns a FileReader
instance for the registered file.
file = reader.register_file('filename', has_units = False, index = 'index_name', usecols=['index_name', 'col1', 'col2', 'col3'], filter_by={'col1': ['filter_value1', 'filter_value2']})
file = reader.register_file('exco.con', has_units = False, index = "name", usecols = ['name', 'gis_id', 'area', 'hyd_typ'], filter_by={'gis_id':[5, 11, 35]})
Run the SWAT simulation with modified input parameters.
The function takes the following parameters:
params
(Dict[str, Tuple[str, List[Tuple[str, str, int]]], optional): A dictionary containing modifications to input files. Format: {filename: (id_col, [(id, col, value)])}show_output
(bool, optional): If True, print the simulation output; if False, suppress output (default is True)
The function returns the path to the directory where the simulation was executed (str)
txt_in_out_result = reader.run_swat(params = {'file_name': ('id_col', [('id', 'col', value)])}, show_output=False)
txt_in_out_result = reader.run_swat(params = {'plants.plt': ('name', [('bana', 'bm_e', 45)])}, show_output=False)
For running the SWAT model without any parameter modification:
txt_in_out_result = reader.run_swat()
Copy the SWAT model files to a specified directory.
The function takes the following parameters:
-
dir (str, optional)
: The target directory where the SWAT model files will be copied. If None, a temporary folder will be created (default is None). -
overwrite
(bool, optional): If True, overwrite the content ofdir
; if False, create a new folder insidedir
(default is False).
The function returns the path to the directory where the SWAT model files were copied (str)
new_path = reader.copy_swat(dir = 'new_path', overwrite = False)
Copy the SWAT model files to a specified directory, modify input parameters, and run the simulation
The function takes the following parameters:
dir
(str): The target directory where the SWAT model files will be copiedoverwrite
(bool, optional): If True, overwrite the content ofdir
; if False, create a new folder insidedir
(default is False)params
(Dict[str, Tuple[str, List[Tuple[str, str, int]]], optional): A dictionary containing modifications to input files. Format: {filename: (id_col, [(id, col, value)])}show_output
(bool, optional): If True, print the simulation output; if False, suppress output (default is True)
The function returns the path to the directory where the SWAT simulation was executed
txt_in_out_result = reader.copy_and_run(dir="directory_path", overwrite=False, params={'file_name': [('id_col', ['id', 'col', value)])}, show_output=False)
txt_in_out_result = reader.copy_and_run(dir = "directory_path", overwrite=False, params = {'plants.plt': [('name', ['bana', 'bm_e', 45)])}, show_output=False)
Run SWAT simulations in parallel with modified input parameters.
Parameters:
params
(List[Dict[str, Tuple[str, List[Tuple[str, str, int]]]]): A list of dictionaries containing modifications to input files. Format: [{filename: (id_col, [(id, col, value)])}]n_workers
(int, optional): The number of parallel workers to use (default is 1)dir
(str, optional): The target directory where the SWAT model files will be copied (default is None)parallelization
(str, optional): The parallelization method to use ('threads' or 'processes') (default is 'threads')
The function returns a list of paths to the directories where the SWAT simulations were executed. list[str]
txt_in_out_result = reader.run_parallel_swat(params = [{'file_name': [('id_col', ['id', 'col', value)])}], n_workers = n_workers, parallelization = 'parallelization_mode')
txt_in_out_result = reader.run_parallel_swat(params = [{'plants.plt': ('name', [('bana', 'bm_e', 45)])}, {'plants.plt': ('name', [('bana', 'bm_e', 40)])}], n_workers = 2, parallelization = 'threads')
This featuer inicializes a FileReader
instance to read data from a file generated for/from SWAT.
To incicialize this class the following parameters are required:
path
(str): The path to the file.has_units
(bool): Indicates if the file has units (default is False).index
(str, optional): The name of the index column (default is None).usecols
(List[str], optional): A list of column names to read (default is None).filter_by
(Dict[str, List[str]]): A dictionary of column names and values (list of str) to filter by (default is an empty dictionary).
from pySWATPlus.FileReader import FileReader
reader = FileReader('path', has_units = False, index = 'index', usecols=['col1', 'col2', 'col3'], filter_by={'col1': 'filter'})
from pySWATPlus.FileReader import FileReader
reader = FileReader('TxtInOut\\plants.plt', has_units = False, index = 'name', usecols=['name', 'plnt_typ', 'gro_trig'], filter_by={'plnt_typ': 'perennial'})
Returns a reference to the pandas DataFrame containing the data read from the file.
reader.df
Overwrite the original file with the DataFrame.
It doesn't take any parameters and neither returns anything.
reader.overwrite_file()
This feature inicializes a SWATProblem
instance, which is used to perform optimization of the desired SWAT+ parameters by using the pymoo
library.
The SWATProblem
class takes the following parameters:
-
params
(Dict[str, Tuple[str, List[Tuple[str, str, int, int]]]]): A dictionary containing the range of values to optimize. Format: {filename: (id_col, [(id, col, upper_bound, lower_bound)])} -
function_to_evaluate
(Callable): An objective function to minimize. This function, which has to be created entirely by the user, should be responsible for adjusting the necessary values based on the calibration iteration, running SWAT, reading the results, comparing them with observations, and calculating an error measure. The function must receive a single argument, which is a dictionary that can contain any user-defined items. However, it must receive at least one item (named as indicated byparam_arg_name
), which takes a dictionary in the format {filename: (id_col, [(id, col, value)])}, representing the current calibration values. Format: function_to_evaluate(Dict[Any, Any]) -> Tuple[int, Dict[str, str]] where the first element is the error produced in the observations and the second element is a dictionary containing a user-desired identifier as the key and the location where the simulation has been saved as the value. -
param_arg_name
(str): The name of the argument withinfunction_to_evaluate
function where the current calibration parameters are expected to be passed. This parameter must be included in**kwargs
-
n_workers
(int, optional): The number of parallel workers to use (default is 1). -
parallelization
(str, optional): The parallelization method to use ('threads' or 'processes') (default is 'threads'). -
debug
(bool, optional): If True, print debug output during optimization (default is False). -
**kwargs
: Additional keyword arguments, that will be passed to thefunction_to_evaluate
.
from pySWATPlus.SWATProblem import SWATProblem
problem = SWATProblem(params = {"filename": (id_col, [(id, col, lb, up)])}, function_to_evaluate = function, param_arg_name = "name", n_workers = 2, parallelization = 'threads', debug = False, kwarg1 = arg1, kwarg2 = arg2)
This function performs the optimization by using the pymoo library.
It takes the following parameters:
problem
(pyswatplus SWATProblem): The optimization problem defined using the SWATProblem class.algorithm
(Algorithm): The optimization algorithm defined using the pymoo Algorithm class.termination
(Termination): The termination criteria for the optimization defined using the pymoo Termination class.seed
(Optional[int], optional): The random seed for reproducibility (default is None).verbose
(bool, optional): If True, print verbose output during optimization (default is False).callback
(Optional[Callable], optional): A callback function that is called after each generation (default is None).
It returns the best solution found during the process. The output format is a tuple containing the decision variables, the path to the output files, and the error (Tuple[np.ndarray, Dict[str, str], float]).
from pySWATPlus.SWATProblem import SWATProblem, minimize_pymoo
x, path, error = minimize_pymoo(self.swat_problem, algorithm, termination, seed = 1, verbose = True, callback = MyCallback())
This class serves the same purpose as SWATProblem
, with the added capability of running another model before executing SWAT+. This enables running a prior model in the same calibration process, wherein the parameters are calibrated simultaneously. For example, the prior model can modify an input file of SWAT+ before initiating SWAT+ (according to the parameters of the calibration).
The SWATProblemMultimodel
class takes the following parameters:
-
params
(Dict[str, Tuple[str, List[Tuple[str, str, int, int]]]]): A dictionary containing the range of values to optimize. Format: {filename: (id_col, [(id, col, upper_bound, lower_bound)])} -
function_to_evaluate
(Callable): An objective function to minimize. This function, which has to be created entirely by the user, should be responsible for adjusting the necessary values based on the calibration iteration, running SWAT, reading the results, comparing them with observations, and calculating an error measure. The function must receive a single argument, which is a dictionary that can contain any user-defined items. However, it must receive at least one item (named as indicated byparam_arg_name
), which takes a dictionary in the format {filename: (id_col, [(id, col, value)])}, representing the current calibration values. Format: function_to_evaluate(Dict[Any, Any]) -> Tuple[int, Dict[str, str]] where the first element is the error produced in the observations and the second element is a dictionary containing a user-desired identifier as the key and the location where the simulation has been saved as the value. -
param_arg_name
(str): The name of the argument withino
function_to_evaluatefunction where the current calibration parameters are expected to be passed. This parameter must be included in
**kwargs``` -
n_workers
(int, optional): The number of parallel workers to use (default is 1). -
parallelization
(str, optional): The parallelization method to use ('threads' or 'processes') (default is 'threads'). -
ub_prior
(List[int], optional): Upper bounds list of calibrated parameters of the prior model. Default is None. -
lb_prior
(List[int], optional): Lower bounds list of calibrated parameters of the prior model. Default is None. -
function_to_evaluate_prior
(Callable, optional): Prior function to be used for modifying parameters before SWAT+ simulation. Must take the name indicated byargs_function_to_evaluate_prior
as a mandatory argument, and must be a np.ndarray, so in the source code the following is done:function_to_evaluate_prior(args_function_to_evaluate_prior = np.ndarray, ...)
. Must return a value that will be used to modify a parameter in the kwargs dictionary. Default is None. -
args_function_to_evaluate_prior
(Dict[str, Any], optional): Additional arguments for function_to_evaluate_prior.args_function_to_evaluate_prior
does not have to be included here. This dictionary will be unpacked and passed as keyword arguments ofargs_function_to_evaluate_prior
. -
debug
(bool, optional): If True, print debug output during optimization (default is False). -
param_arg_name_to_modificate_by_prior_function
(str, optional): Parameter modified in kwargs by the return offunction_to_evaluate_prior
, so in the source code the following is done:kwargs[param_arg_name_to_modificate_by_prior_function] = function_to_evaluate_prior(...)
-
**kwargs
: Additional keyword arguments, that will alse be passed to thefunction_to_evaluate
.
from pySWATPlus.SWATProblem import SWATProblem
problem = SWATProblem(params = {"filename": (id_col, [(id, col, lb, up)])}, function_to_evaluate = function, param_arg_name = "name", n_workers = 2, parallelization = 'threads', ub_prior = [ub_param1, ub_param2, ub_param3], lb_prior = [lb_param1, lb_param2, lb_param3], args_function_to_evaluate_prior = 'name2', function_to_evaluate_prior = function_to_evaluate_prior, param_arg_name_to_modificate_by_prior_function = "param_name", debug=False, kwarg1 = arg1, kwarg2 = arg2)
This function performs the optimization by using the pymoo library.
It takes the following parameters:
problem
(pyswatplus SWATProblem): The optimization problem defined using the SWATProblem class.algorithm
(Algorithm): The optimization algorithm defined using the pymoo Algorithm class.termination
(Termination): The termination criteria for the optimization defined using the pymoo Termination class.seed
(Optional[int], optional): The random seed for reproducibility (default is None).verbose
(bool, optional): If True, print verbose output during optimization (default is False).callback
(Optional[Callable], optional): A callback function that is called after each generation (default is None).
It returns the best solution found during the process. The output format is a tuple containing the decision variables, the path to the output files, and the error (Tuple[np.ndarray, Dict[str, str], float]).
from pySWATPlus.SWATProblem import SWATProblem, minimize_pymoo
x, path, error = minimize_pymoo(self.swat_problem, algorithm, termination, seed = 1, verbose = True, callback = MyCallback())