Using the Virtual Ecosystem#

This page provides a brief demonstration of the Virtual Ecosystem model in operation. Once you have installed the Virtual Ecosystem, you should be able to replicate this example on your own computer using the commands below.

Example model data#

The demonstration requires an installation of the example data provided with the Virtual Ecosystem package. If you have previously attempted to run this example then the simulation will refuse to overwrite existing output files. You can either:

delete the existing example data folder and reinstall it,
create a fresh installation using a different location, or
create and use a new output directory with the existing example data folder.

It is worth re-reading the example data page to get an overview of the directory structure and the configuration and data files.

The `ve_run` command#

You’ve already used this command to install the example data but most of the options to the ve_run command are used to run the simulation. The --help option can be used to show the various arguments that can be used to set how a model runs:

macOS/Linux

ve_run --help

Windows (CMD)

ve_run --help

Windows (Powershell)

ve_run --help

usage: ve_run [-h] [--version] [--install-example INSTALL_EXAMPLE]
              [-o OUTPATH] [-c CLI_CONFIG] [--validate-config-only]
              [-p CLI_PATHS] [--logfile LOGFILE] [-q]
              [cfg_paths ...]

Configure and run a Virtual Ecosystem simulation.

This program sets up and runs a Virtual Ecosystem simulation. The program expects
to be provided with paths to TOML formatted configuration files for the simulation.
The configuration is modular: a directory path can be used to add all TOML
configuration files in the directory, or individual file paths can be used to select
specific combinations of configuration files. These are combined and validated and
then used to initialise and run the model.

As an alternative to providing configuration paths, the `--install-example` option
allows users to provide a location where a simple example set of datasets and
configuration files provided with the Virtual Ecosystem package can be installed.
This option will create a `ve_example` directory in the location, and users can
examine the input files and run the simulation from that directory:

`ve_run /provided/install/path/ve_example`

The output directory for simulation results is typically set in the configuration
files, but can be overwritten using the `--outpath` option. A log file path can be
provided for logging output. If this is not provided then the log will be written to
the console, but the logging is typically verbose and it is usually better to
redirect the log to a file.

When logging is redirected to a file, a short progress report is written to stdout.
By default, the command reports: the start and end of the simulation and log
location; the completion of simulation stages; and a progress bar over the time
steps of the model. The `--quiet` command can be used to incrementally mute this
output: `-q` will remove the progress bar, `-qq` just prints the start and stop and
`-qqq` mutes the report entirely.

The `--config` option can be used to override configuration settings provided in the
file or to add additional settings. This is typically used to run a set of parallel
simulations that vary configuration settings of interest around a central
configuration setup, without the need to write a specific configuration file for
each permutation.

The `--data-path` option can be used to dynamically set the location of data paths
in the configuration. A file path in the config can be set as a path marker, which
must be a string starting with a "$", for example "$CLIMATE_DATA". This option can
then be used to substitute different files into that marker for different runs:
`--data-path CLIMATE_DATA=/path/to/file.nc`.

The `--validate-config-only` flag can be used to only run the configuration
validation part of the model setup and the exit before running any models.

The resolved complete configuration will then be written to a single consolidated
config file in the output path with a default name of
`ve_full_model_configuration.toml`. This can be disabled by setting the
`core.data_output_options.save_merged_config` option to false. Note that the merged
configuration automatically converts all file paths within the merged configurations
to absolute file paths - this ties the merged configuration to the file system where
the run is executed.

positional arguments:
  cfg_paths             Paths to config files

options:
  -h, --help            show this help message and exit
  --version             show program's version number and exit
  --install-example INSTALL_EXAMPLE
                        Install the Virtual Ecosystem example data to the
                        given location
  -o, --outpath OUTPATH
                        Path for output files
  -c, --config CLI_CONFIG
                        Override configuration settings
  --validate-config-only
                        Exit after validating configuration
  -p, --data-path CLI_PATHS
                        Set data paths used for input data
  --logfile LOGFILE     A file path to use for logging a Virtual Ecosystem
                        simulation
  -q, --quiet           Quieten the default progress reporting

Running the example model#

The code below runs a simulation using the example data. The command uses the command line options to set three things:

It points to the files in config directory that should be used configure the model. N.B. this directory contains configuration for all possible model combinations so providing just the config directory as a path will result in an invalid configuration.
It sets the output directory to be used by the simulation to out. You could create a new output directory (e.g. out_test_2) and change this to run a new simulation using the existing data.
It redirects the model logging to a file in the output directory, rather than printing it all to screen.

When the detailed logging is redirected to a file, the command generates a short progress report to show the model running. This can be made shorter or completely muted by using the -q argument: repeat the argument to remove more details (e.g. -qq or -qqq).

Warning

If the path provided for the log points to a file that already exists the detailed logging is added to the end of the file rather than creating a new file. We would recommend creating a new logfile for each simulation as reusing files in this way can create confusion.

In the example code below, the ve_example folder has been previously installed under the directory /tmp/

macOS/Linux

ve_run /tmp/ve_example/config/data_config.toml \
    /tmp/ve_example/config/abiotic_simple_config.toml \
    /tmp/ve_example/config/animal_config.toml \
    /tmp/ve_example/config/hydrology_config.toml \
    /tmp/ve_example/config/litter_config.toml \
    /tmp/ve_example/config/plant_config.toml \
    /tmp/ve_example/config/soil_config.toml \
    --out /tmp/ve_example/out \
    --logfile /tmp/ve_example/out/logfile.log

Windows (CMD)

ve_run C:\tmp\ve_example\config\data_config.toml ^
    C:\tmp\ve_example\config\abiotic_simple_config.toml ^
    C:\tmp\ve_example\config\animal_config.toml ^
    C:\tmp\ve_example\config\hydrology_config.toml ^
    C:\tmp\ve_example\config\litter_config.toml ^
    C:\tmp\ve_example\config\plant_config.toml ^
    C:\tmp\ve_example\config\soil_config.toml ^
    --out C:\tmp\ve_example\out ^
    --logfile C:\tmp\ve_example\logfile.log

Windows (Powershell)

ve_run C:\tmp\ve_example\config\data_config.toml `
    C:\tmp\ve_example\config\abiotic_simple_config.toml `
    C:\tmp\ve_example\config\animal_config.toml `
    C:\tmp\ve_example\config\hydrology_config.toml `
    C:\tmp\ve_example\config\litter_config.toml `
    C:\tmp\ve_example\config\plant_config.toml `
    C:\tmp\ve_example\config\soil_config.toml `
    --out C:\tmp\ve_example\out `
    --logfile C:\tmp\ve_example\logfile.log

* Starting Virtual Ecosystem simulation using v0.2.0.
* Logging to: ve_example/out/logfile.log
* Loading configuration
* Configuration validated
* Saved compiled configuration: ve_example/out/compiled_configuration.toml
* Built core model components
* Initial data loaded
* Models initialised: abiotic_simple, animal, hydrology, litter, plants, soil
* Initialisation data export complete.
* Starting simulation
100%|██████████████████████████████████████████| 24/24 [01:05<00:00,  2.73s/it]
* Simulation completed
Virtual Ecosystem run complete.

The log file is very long and shows the step by step process of running the model - it is primarily used for diagnosing problems with the model. You can view a sample of the contents in the dropdown below:

Partial log output

[INFO] - main - ve_run(286) - Using Virtual Ecosystem v0.2.0.
[INFO] - config_builder - _collect_config_paths(427) - Config paths resolve to 7 files
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/data_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/abiotic_simple_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/animal_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/hydrology_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/litter_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/plant_config.toml
[INFO] - config_builder - _load_config_toml(450) - Config TOML loaded from ve_example/config/soil_config.toml
[INFO] - config_builder - _compile_data(374) - Configuration data compiled.
[INFO] - registry - _register_module(163) - Registering module: virtual_ecosystem.core
[INFO] - registry - _register_module(176) - Configuration class registered for virtual_ecosystem.core
[INFO] - registry - _register_module(163) - Registering module: virtual_ecosystem.models.abiotic_simple
[INFO] - registry - _get_model(237) - Registering model class for virtual_ecosystem.models.abiotic_simple: AbioticSimpleModel
[INFO] - registry - _register_module(176) - Configuration class registered for virtual_ecosystem.models.abiotic_simple
[INFO] - registry - _register_module(163) - Registering module: virtual_ecosystem.models.animal
[INFO] - registry - _get_model(237) - Registering model class for virtual_ecosystem.models.animal: AnimalModel
[INFO] - registry - _register_module(176) - Configuration class registered for virtual_ecosystem.models.animal
[INFO] - registry - _register_module(163) - Registering module: virtual_ecosystem.models.hydrology
[INFO] - registry - _get_model(237) - Registering model class for virtual_ecosystem.models.hydrology: HydrologyModel
--- many lines omitted ---
[INFO] - data - __setitem__(237) - Replacing data array for 'plant_pft_propagules'
[INFO] - plants_model - update_canopy_layers(864) - Updated canopy data on 1
[INFO] - data - __setitem__(237) - Replacing data array for 'shortwave_absorption'
[INFO] - data - __setitem__(237) - Replacing data array for 'plant_ammonium_uptake'
[INFO] - data - __setitem__(237) - Replacing data array for 'plant_nitrate_uptake'
[INFO] - data - __setitem__(237) - Replacing data array for 'plant_phosphorus_uptake'
[INFO] - data - __setitem__(237) - Replacing data array for 'stem_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'senesced_leaf_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'plant_reproductive_tissue_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'root_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_vegetation_cnp'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_seedbank_cnp'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_vegetation_litter_cnp'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_seedbank_litter_cnp'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_vegetation_litter_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_seedbank_litter_lignin'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_ammonium_uptake'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_nitrate_uptake'
[INFO] - data - __setitem__(237) - Replacing data array for 'subcanopy_phosphorus_uptake'
[INFO] - base_model - update(439) - Updating animal model

Looking at the results#

The Virtual Ecosystem writes data out to a Zarr data store. This is a relatively new open-source format that is designed for use with large multi-dimensional data. It is used in the Virtual Ecosystem because the format supports appending data at each time step, making it easy to build up a single integrated dataset during the model run. This also means that if a model crashes, then the output variables will be structured in a single datastore as time series up until the crash point.

The default output filename is model_data.zarr but this can be changed in the output configuration. Although the file suffix makes that look like a single file, the Zarr format is actually a directory structure, containing variables or groups of variables. The Virtual Ecosystem outputs data into one of three groups, depending on when the values are calculated.

The inputs group: variables that loaded at the start of the simulation from the model data configuration. Obviously, these data are in your input files, but it can be convenient to have them packaged within a single data source alongside the model outputs.
The init group: variables that are calculated during the initialisation process of science models are written to this group. These values capture the state of the model before the model starts to iterate through time. You might not plot them as part of a time series but they can be very useful for understanding how your input data is used to set the model running.
The outputs group: variables that are calculated by the science models at each time step. The data are added to the data store at the end of each time step.

For more details on the variables used in the Virtual Ecosystem and to see which variables are part of the inputs, initialisation and outputs, see the variables table.

Note

The configuration for the example model exports all of the variables used in the model. If you only require some of the variables for your analayses, you can alter the output configuration to export a subset of the variables.

The sections below go into more detail on each data group, the code below uses the xarray and matplotlib Python packages to load and visualise output data. You may need to install these to replicate these outputs on your own computer.

import matplotlib.pyplot as plt
import numpy as np
import xarray

Using the Virtual Ecosystem#

Example model data#

The `ve_run` command#

Running the example model#

Looking at the results#

Input data#

Model initialisation#

Model outputs#

Spatial data#

Temporal data#

Vertical structure#

Using the Virtual Ecosystem#

Example model data#

The ve_run command#

Running the example model#

Looking at the results#

Input data#

Model initialisation#

Model outputs#

Spatial data#

Temporal data#

Vertical structure#

The `ve_run` command#