Seasonal Decomposition of Time Series by Loess
The Seasonal-Trend-Loess (STL) algorithm decomposes a time series into seasonal, trend and residual components. The algorithm uses Loess interpolation (original paper here) to smooth the cyclic sub-series (e.g. all January values in the CO2 data shown in the example below). After removing the seasonality from the signal, the remainder is smoothed (in multiple steps) to find the trend. This process is repeated and may include robustness iterations that take advantage of the weighted-least-squares underpinnings of Loess to remove the effects of outliers. The details are described in STL: A Seasonal-Trend Decomposition Procedure Based on Loess.
stl-decomp-4j is a Java port of the original Ratfor/Fortran available from Netlib (original source here; also included as part of examples/StlPerfTest/fortran_benchmark
), extended to support local quadratic interpolation. stl-decomp-4j expects equally spaced data with no missing values, similar to the original Fortran version (and the R and Python versions, which both use the original Fortran version under the hood).
Check out the the stl-decomp-4j wiki for TODOs, etc.
Example
At a minimum, the STL algorithm requires specifying the periodicity of the data (e.g. 12 for monthly) and the width of the Loess smoother used to smooth the cyclic seasonal sub-series. In general, there are three Loess smoothers that each require three parameters: a width, a degree, and a jump. The width specifies the number of data points that the local interpolation uses to smooth each point, the degree specifies the degree of the local polynomial that is fit to the data, and the jump specifies how many points are skipped between Loess interpolations, with linear interpolation being done between these points. Because of this complexity, construction is done via Builder
objects, as shown in this simple example.
double[] values = getSomeMonthlyData(); // Monthly time-series data
SeasonalTrendLoess.Builder builder = new SeasonalTrendLoess.Builder();
SeasonalTrendLoess smoother = builder.
setPeriodLength(12). // Data has a period of 12
setSeasonalWidth(35). // Monthly data smoothed over 35 years
setNonRobust(). // Not expecting outliers, so no robustness iterations
buildSmoother(values);
SeasonalTrendLoess.Decomposition stl = smoother.decompose();
double[] seasonal = stl.getSeasonal();
double[] trend = stl.getTrend();
double[] residual = stl.getResidual();
The StlDemoRestServer example includes a copy of the Monthly CO2 Measurement Data and a simple REST server that reads this data, performs an STL decomposition on the data, and serves up the results to http://localhost:4567/stldemo
. The examples/StlDemoRestServer/index.html
file loads the data from this server and plots the resulting decomposition. The code that does the actual decomposition is identical to that shown above, resulting in the following decomposition:
Benchmarking
The StlPerfTest example times the STL running on the CO2 data mentioned above or on a time-series of 10 years of random hourly data. The same benchmark is implemented in the original Fortran here. See Performance Tests for instructions on running these tests.
Limited testing show the Java to be about half the speed of the Fortran. A comparison of the resulting decompositions is shown in the StlJavaFortranComparison notebook.
Extension to include exogenous inputs
The STL method is extended to incorporate exogenous inputs where details can be found in the StlExogenous example.
To apply exogenous inputs, build the smoother with them (double[][] exogenousinputs
with each row being one exogenous input) as shown in the example below. After calling the .decompose()
method, you can get the STL components. Note that the .getTrend()
method now extracts the combined effect of the trend and exogenous inputs.
SeasonalTrendLoess smoother = builder.buildSmoother(values, exogenousinputs);
Documentation
The implementation of the quadratic extension of LoessInterpolator
is described, mathematically, in ImplementationNotes.
TODO: JavaDoc link goes here
Build Dependencies
To include stl-decomp-4j in your maven project, add the following to your pom file:
<dependency>
<groupId>com.github.servicenow.stl4j</groupId>
<artifactId>stl-decomp-4j</artifactId>
<version>1.0.5</version>
</dependency>
The stl-decomp-4j implementation has no external dependencies.
The unit tests depend on junit
, commons-math3
and slf4j-simple
.
The examples have further dependencies on commons-cli
, opencsv
, spark-core
, and jackson-mapper-asl
.