In mathematics and statistics, a stationary process (or a strict/strictly stationary process or strong/strongly stationary process) is a stochastic process whose unconditional joint probability distribution does not change when shifted in time.^{[1]} Consequently, parameters such as mean and variance also do not change over time.
Since stationarity is an assumption underlying many statistical procedures used in time series analysis, nonstationary data are often transformed to become stationary. The most common cause of violation of stationarity is a trend in the mean, which can be due either to the presence of a unit root or of a deterministic trend. In the former case of a unit root, stochastic shocks have permanent effects, and the process is not meanreverting. In the latter case of a deterministic trend, the process is called a trendstationary process, and stochastic shocks have only transitory effects after which the variable tends toward a deterministically evolving (nonconstant) mean.
A trend stationary process is not strictly stationary, but can easily be transformed into a stationary process by removing the underlying trend, which is solely a function of time. Similarly, processes with one or more unit roots can be made stationary through differencing. An important type of nonstationary process that does not include a trendlike behavior is a cyclostationary process, which is a stochastic process that varies cyclically with time.
For many applications strictsense stationarity is too restrictive. Other forms of stationarity such as widesense stationarity or Nthorder stationarity are then employed. The definitions for different kinds of stationarity are not consistent among different authors (see Other terminology).
Strictsense stationarity
Definition
Formally, let be a stochastic process and let represent the cumulative distribution function of the unconditional (i.e., with no reference to any particular starting value) joint distribution of at times . Then, is said to be strictly stationary, strongly stationary or strictsense stationary if^{[2]}^{:p. 155}

(Eq.1) 
Since does not affect , is not a function of time.
Examples
White noise is the simplest example of a stationary process.
An example of a discretetime stationary process where the sample space is also discrete (so that the random variable may take one of N possible values) is a Bernoulli scheme. Other examples of a discretetime stationary process with continuous sample space include some autoregressive and moving average processes which are both subsets of the autoregressive moving average model. Models with a nontrivial autoregressive component may be either stationary or nonstationary, depending on the parameter values, and important nonstationary special cases are where unit roots exist in the model.
Example 1
Let be any scalar random variable, and define a timeseries , by
Then is a stationary time series, for which realisations consist of a series of constant values, with a different constant value for each realisation. A law of large numbers does not apply on this case, as the limiting value of an average from a single realisation takes the random value determined by , rather than taking the expected value of .
The time average of does not converge since the process is not ergodic.
Example 2
As a further example of a stationary process for which any single realisation has an apparently noisefree structure, let have a uniform distribution on and define the time series by
Then is strictly stationary.
Example 3
Keep in mind that a white noise is not necessarily strictly stationary. Let be a random variable uniformly distributed in the interval and define the time series
It can be shown that:
, , and .
So is a white noise, however it is not strictly stationary.
Nthorder stationarity
In Eq.2, the distribution of samples of the stochastic process must be equal to the distribution of the samples shifted in time for all . Nthorder stationarity is a weaker form of stationarity where this is only requested for all up to a certain order . A random process is said to be Nthorder stationary if:^{[2]}^{:p. 152}

(Eq.2) 
Weak or widesense stationarity
Definition
A weaker form of stationarity commonly employed in signal processing is known as weaksense stationarity, widesense stationarity (WSS), or covariance stationarity. WSS random processes only require that 1st moment (i.e. the mean) and autocovariance do not vary with respect to time and that the 2nd moment is finite for all times. Any strictly stationary process which has a finite mean and a covariance is also WSS.^{[3]}^{:p. 299}
So, a continuous time random process which is WSS has the following restrictions on its mean function and autocovariance function :

(Eq.3) 
The first property implies that the mean function must be constant. The second property implies that the covariance function depends only on the difference between and and only needs to be indexed by one variable rather than two variables.^{[2]}^{:p. 159} Thus, instead of writing,
the notation is often abbreviated by the substitution :
This also implies that the autocorrelation depends only on , that is
The third property says that the second moments must be finite for any time .
Motivation
The main advantage of widesense stationarity is that it places the timeseries in the context of Hilbert spaces. Let H be the Hilbert space generated by {x(t)} (that is, the closure of the set of all linear combinations of these random variables in the Hilbert space of all squareintegrable random variables on the given probability space). By the positive definiteness of the autocovariance function, it follows from Bochner's theorem that there exists a positive measure on the real line such that H is isomorphic to the Hilbert subspace of L^{2}(μ) generated by {e^{−2πiξ⋅t}}. This then gives the following Fouriertype decomposition for a continuous time stationary stochastic process: there exists a stochastic process with orthogonal increments such that, for all
where the integral on the righthand side is interpreted in a suitable (Riemann) sense. The same result holds for a discretetime stationary process, with the spectral measure now defined on the unit circle.
When processing WSS random signals with linear, timeinvariant (LTI) filters, it is helpful to think of the correlation function as a linear operator. Since it is a circulant operator (depends only on the difference between the two arguments), its eigenfunctions are the Fourier complex exponentials. Additionally, since the eigenfunctions of LTI operators are also complex exponentials, LTI processing of WSS random signals is highly tractable—all computations can be performed in the frequency domain. Thus, the WSS assumption is widely employed in signal processing algorithms.
Definition for complex stochastic process
In the case where is a complex stochastic process the autocovariance function is defined as and, in addition to the requirements in Eq.3, it is required that the pseudoautocovariance function depends only on the time lag. In formulas, is WSS, if

(Eq.4) 
Joint stationarity
The concept of stationarity may be extended to two stochastic processes.
Joint strictsense stationarity
Two stochastic processes and are called jointly strictsense stationary if their joint cumulative distribution remains unchanged under time shifts, i.e. if

(Eq.5) 
Joint (M + N)thorder stationarity
Two random processes and is said to be jointly (M + N)thorder stationary if:^{[2]}^{:p. 159}

(Eq.6) 
Joint weak or widesense stationarity
Two stochastic processes and are called jointly widesense stationary if they are both widesense stationary and their crosscovariance function depends only on the time difference . This may be summarized as follows:

(Eq.7) 
Relation between types of stationarity
 If a stochastic process is Nthorder stationary, then it is also Mthorder stationary for all .
 If a stochastic process is second order stationary () and has finite second moments, then it is also widesense stationary.^{[2]}^{:p. 159}
 If a stochastic process is widesense stationary, it is not necessarily secondorder stationary.^{[2]}^{:p. 159}
 If a stochastic process is strictsense stationary and has finite second moments, it is widesense stationary.^{[3]}^{:p. 299}
 If two stochastic processes are jointly (M + N)thorder stationary, this does not guarantee that the individual processes are Mth respectively Nthorder stationary.^{[2]}^{:p. 159}
Other terminology
The terminology used for types of stationarity other than strict stationarity can be rather mixed. Some examples follow.
 Priestley uses stationary up to order m if conditions similar to those given here for wide sense stationarity apply relating to moments up to order m.^{[4]}^{[5]} Thus wide sense stationarity would be equivalent to "stationary to order 2", which is different from the definition of secondorder stationarity given here.
 Honarkhah and Caers also use the assumption of stationarity in the context of multiplepoint geostatistics, where higher npoint statistics are assumed to be stationary in the spatial domain.^{[6]}
 Tahmasebi and Sahimi have presented an adaptive Shannonbased methodology that can be used for modeling of any nonstationary systems.^{[7]}
Differencing
One way to make some time series stationary is to compute the differences between consecutive observations. This is known as differencing. Differencing can help stabilize the mean of a time series by removing changes in the level of a time series, and so eliminating trend and seasonality.
Transformations such as logarithms can help to stabilize the variance of a time series.
One of the ways for identifying nonstationary times series is the ACF plot. For a stationary time series, the ACF will drop to zero relatively quickly, while the ACF of nonstationary data decreases slowly.^{[8]}
See also
 Lévy process
 Stationary ergodic process
 Wiener–Khinchin theorem
 Ergodicity
 Statistical regularity
 Autocorrelation
 Whittle likelihood
References
 ^ Gagniuc, Paul A. (2017). Markov Chains: From Theory to Implementation and Experimentation. USA, NJ: John Wiley & Sons. pp. 1–256. ISBN 9781119387558.
 ^ ^{a} ^{b} ^{c} ^{d} ^{e} ^{f} ^{g} Park,Kun Il (2018). Fundamentals of Probability and Stochastic Processes with Applications to Communications. Springer. ISBN 9783319680743.
 ^ ^{a} ^{b} Ionut Florescu (7 November 2014). Probability and Stochastic Processes. John Wiley & Sons. ISBN 9781118593202.
 ^ Priestley, M. B. (1981). Spectral Analysis and Time Series. Academic Press. ISBN 0125649223.
 ^ Priestley, M. B. (1988). Nonlinear and Nonstationary Time Series Analysis. Academic Press. ISBN 0125649118.
 ^ Honarkhah, M.; Caers, J. (2010). "Stochastic Simulation of Patterns Using DistanceBased Pattern Modeling". Mathematical Geosciences. 42 (5): 487–517. doi:10.1007/s1100401092767.
 ^ Tahmasebi, P.; Sahimi, M. (2015). "Reconstruction of nonstationary disordered materials and media: Watershed transform and crosscorrelation function" (PDF). Physical Review E. 91 (3): 032401. doi:10.1103/PhysRevE.91.032401. PMID 25871117.
 ^ "8.1 Stationarity and differencing  OTexts". www.otexts.org. Retrieved 20160518.
Further reading
 Enders, Walter (2010). Applied Econometric Time Series (Third ed.). New York: Wiley. pp. 53–57. ISBN 9780470505397.
 Jestrovic, I.; Coyle, J. L.; Sejdic, E (2015). "The effects of increased fluid viscosity on stationary characteristics of EEG signal in healthy adults". Brain Research. 1589: 45–53. doi:10.1016/j.brainres.2014.09.035. PMC 4253861. PMID 25245522.
 Hyndman, Athanasopoulos (2013). Forecasting: Principles and Practice. Otexts. https://www.otexts.org/fpp/8/1