Ctrl K

ETHOS.TSAM

tsam is a python package which uses different machine learning algorithms for the aggregation of time series.

606
mentions
2
contributors
Get started
301 commitsLast commit ≈ 3 weeks ago173 stars37 forks

Description

Version Conda Version Documentation Status PyPI - License codecov badge

tsam - Time Series Aggregation Module

tsam is a python package which uses different machine learning algorithms for the aggregation of time series. The data aggregation can be performed in two freely combinable dimensions: By representing the time series by a user-defined number of typical periods or by decreasing the temporal resolution. tsam was originally designed for reducing the computational load for large-scale energy system optimization models by aggregating their input data, but is applicable for all types of time series, e.g., weather data, load data, both simultaneously or other arbitrary groups of time series.

The documentation of the tsam code can be found here.

Features

  • flexible handling of multidimensional time-series via the pandas module
  • different aggregation methods implemented (averaging, k-means, exact k-medoids, hierarchical, k-maxoids, k-medoids with contiguity), which are based on scikit-learn, or self-programmed with pyomo
  • hypertuning of aggregation parameters to find the optimal combination of the number of segments inside a period and the number of typical periods
  • novel representation methods, keeping statistical attributes, such as the distribution
  • flexible integration of extreme periods as own cluster centers
  • weighting for the case of multidimensional time-series to represent their relevance

Installation

To avoid dependency conflicts, it is recommended that you install Tsam in its own environment. You can use either uv or conda/mamba ) to manage environments and installations. Before proceeding, you must install either UV or Conda/Mamba, or both.

Quick Install with uv

The fastest way to install tsam is using uv:

uv venv tsam_env
uv pip install tsam

** Install from conda forge into a new environment** Or from conda-forge:

conda create -n tsam_env -c conda-forge tsam

conda and mamba can be used interchangeably

Development Installation

git clone https://github.com/FZJ-IEK3-VSA/tsam.git
cd tsam

Using uv (recommended)

uv venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
uv pip install -e ".[develop]"

Using conda-forge

conda env create -n tsam_env --file=requirements_dev.yml
conda activate tsam_env
pip install -e . --no-deps

Set up pre-commit hooks

pre-commit install


See [CONTRIBUTING.md](CONTRIBUTING.md) for detailed development guidelines.

### MILP Solver for k-medoids

[HiGHS](https://github.com/ERGO-Code/HiGHS) is installed by default. For better performance on large problems, commercial solvers (Gurobi, CPLEX) are recommended if you have a license


## Examples

### Basic workflow

A small example how tsam can be used is decribed as follows
```python
	import pandas as pd
	import tsam.timeseriesaggregation as tsam

Read in the time series data set with pandas

	raw = pd.read_csv('testdata.csv', index_col = 0)

Initialize an aggregation object and define the length of a single period, the number of typical periods, the number of segments in each period, the aggregation method and the representation method - here duration/distribution representation which contains the minimum and maximum value of the original time series

	aggregation = tsam.TimeSeriesAggregation(raw,
						noTypicalPeriods = 8,
						hoursPerPeriod = 24,
						segmentation = True,
						noSegments = 8,
						representationMethod = "distributionAndMinMaxRepresentation",
						distributionPeriodWise = False
						clusterMethod = 'hierarchical'
						)

Run the aggregation to typical periods

	typPeriods = aggregation.createTypicalPeriods()

Store the results as .csv file

	typPeriods.to_csv('typperiods.csv')

Detailed examples

A first example shows the capabilites of tsam as jupyter notebook.

A second example shows in more detail how to access the relevant aggregation results required for paramtrizing e.g. an optimization.

The example time series are based on a department publication and the test reference years of the DWD.

License

MIT License

Copyright (C) 2017-2025 Leander Kotzur (FZJ IEK-3), Maximilian Hoffmann (FZJ IEK-3), Peter Markewitz (FZJ IEK-3), Martin Robinius (FZJ IEK-3), Detlef Stolten (FZJ IEK-3)

You should have received a copy of the MIT License along with this program. If not, see https://opensource.org/licenses/MIT

The core developer team sits in the Institute of Energy and Climate Research - Techno-Economic Energy Systems Analysis (IEK-3) belonging to the Forschungszentrum Jülich.

Citing and further reading

If you want to use tsam in a published work, please kindly cite our latest journal articles:

If you are further interested in the impact of time series aggregation on the cost-optimal results on different energy system use cases, you can find a publication which validates the methods and describes their cababilites via the following link. A second publication introduces a method how to model state variables (e.g. the state of charge of energy storage components) between the aggregated typical periods which can be found here. Finally yet importantly the potential of time series aggregation to simplify mixed integer linear problems is investigated here.

The publications about time series aggregation for energy system optimization models published alongside the development of tsam are listed below:

Acknowledgement

This work is supported by the Helmholtz Association under the Joint Initiative "Energy System 2050 A Contribution of the Research Field Energy" and the program "Energy System Design" and within the BMWi/BMWk funded project METIS.

Keywords
Programming language
  • Python 100%
License
</>Source code
Packages
pypi.org
anaconda.org

Participating organisations

Forschungszentrum Jülich

Reference papers

Mentions

Contributors

OO
Olalekan Omoyele
Maintainer
Forschungszentrum Jülich
MH
Maximilian Hoffmann
Maintainer (former)
Forschungszentrum Jülich

Helmholtz Program-oriented Funding IV