The power spectrum of a time series describes the distribution of power into frequency components composing that signal.^{[1]} According to Fourier analysis, any physical signal can be decomposed into a number of discrete frequencies, or a spectrum of frequencies over a continuous range. The statistical average of a certain signal or sort of signal (including noise) as analyzed in terms of its frequency content, is called its spectrum.
When the energy of the signal is concentrated around a finite time interval, especially if its total energy is finite, one may compute the energy spectral density. More commonly used is the power spectral density (or simply power spectrum), which applies to signals existing over all time, or over a time period large enough (especially in relation to the duration of a measurement) that it could as well have been over an infinite time interval. The power spectral density (PSD) then refers to the spectral energy distribution that would be found per unit time, since the total energy of such a signal over all time would generally be infinite. Summation or integration of the spectral components yields the total power (for a physical process) or variance (in a statistical process), identical to what would be obtained by integrating over the time domain, as dictated by Parseval's theorem.^{[2]}
The spectrum of a physical process often contains essential information about the nature of . For instance, the pitch and timbre of a musical instrument are immediately determined from a spectral analysis. The color of a light source is determined by the spectrum of the electromagnetic wave's electric field as it fluctuates at an extremely high frequency. Obtaining a spectrum from time series such as these involves the Fourier transform, and generalizations based on Fourier analysis. In many cases the time domain is not specifically employed in practice, such as when a dispersive prism is used to obtain a spectrum of light in a spectrograph, or when a sound is perceived through its effect on the auditory receptors of the inner ear, each of which is sensitive to a particular frequency.
However this article concentrates on situations in which the time series is known (at least in a statistical sense) or directly measured (such as by a microphone sampled by a computer). The power spectrum is important in statistical signal processing and in the statistical study of stochastic processes, as well as in many other branches of physics and engineering. Typically the process is a function of time, but one can similarly discuss data in the spatial domain being decomposed in terms of spatial frequency.^{[3]}
Explanation
Any signal that can be represented as a variable that varies in time has a corresponding frequency spectrum. This includes familiar entities such as visible light (perceived as color), musical notes (perceived as pitch), radio/TV (specified by their frequency, or sometimes wavelength) and even the regular rotation of the earth. When these signals are viewed in the form of a frequency spectrum, certain aspects of the received signals or the underlying processes producing them are revealed. In some cases the frequency spectrum may include a distinct peak corresponding to a sine wave component. And additionally there may be peaks corresponding to harmonics of a fundamental peak, indicating a periodic signal which is not simply sinusoidal. Or a continuous spectrum may show narrow frequency intervals which are strongly enhanced corresponding to resonances, or frequency intervals containing almost zero power as would be produced by a notch filter.
In physics, the signal might be a wave, such as an electromagnetic wave, an acoustic wave, or the vibration of a mechanism. The power spectral density (PSD) of the signal describes the power present in the signal as a function of frequency, per unit frequency. Power spectral density is commonly expressed in watts per hertz (W/Hz).^{[4]}
When a signal is defined in terms only of a voltage, for instance, there is no unique power associated with the stated amplitude. In this case "power" is simply reckoned in terms of the square of the signal, as this would always be proportional to the actual power delivered by that signal into a given impedance. So one might use units of V^{2} Hz^{−1} for the PSD and V^{2} s Hz^{−1} for the ESD (energy spectral density)^{[5]} even though no actual "power" or "energy" is specified.
Sometimes one encounters an amplitude spectral density (ASD), which is the square root of the PSD; the ASD of a voltage signal has units of V Hz^{−1/2}.^{[6]} This is useful when the shape of the spectrum is rather constant, since variations in the ASD will then be proportional to variations in the signal's voltage level itself. But it is mathematically preferred to use the PSD, since only in that case is the area under the curve meaningful in terms of actual power over all frequency or over a specified bandwidth.
In the general case, the units of PSD will be the ratio of units of variance per unit of frequency; so, for example, a series of displacement values (in meters) over time (in seconds) will have PSD in units of m^{2}/Hz. For random vibration analysis, units of g^{2} Hz^{−1} are frequently used for the PSD of acceleration. Here g denotes the gforce.^{[7]}
Mathematically, it is not necessary to assign physical dimensions to the signal or to the independent variable. In the following discussion the meaning of x(t) will remain unspecified, but the independent variable will be assumed to be that of time.
Definition
Energy spectral density
Energy spectral density describes how the energy of a signal or a time series is distributed with frequency. Here, the term energy is used in the generalized sense of signal processing;^{[8]} that is, the energy of a signal is
The energy spectral density is most suitable for transients—that is, pulselike signals—having a finite total energy. Finite or not, Parseval's theorem ^{[9]} (or Plancherel's theorem) gives us an alternate expression for the energy of the signal:
where
is the Fourier transform of the signal and is the frequency in Hz, i.e., cycles per second and is regarded as the amplitude spectral density. Often used is the angular frequency . Since the integral on the righthand side is the energy of the signal, the integrand can be interpreted as a density function describing the energy per unit frequency contained in the signal at the frequency . In light of this, the energy spectral density of a signal is defined as^{[9]}

(Eq.1) 
As a physical example of how one might measure the energy spectral density of a signal, suppose represents the potential (in volts) of an electrical pulse propagating along a transmission line of impedance , and suppose the line is terminated with a matched resistor (so that all of the pulse energy is delivered to the resistor and none is reflected back). By Ohm's law, the power delivered to the resistor at time is equal to , so the total energy is found by integrating with respect to time over the duration of the pulse. To find the value of the energy spectral density at frequency , one could insert between the transmission line and the resistor a bandpass filter which passes only a narrow range of frequencies (, say) near the frequency of interest and then measure the total energy dissipated across the resistor. The value of the energy spectral density at is then estimated to be . In this example, since the power has units of V^{2} Ω^{−1}, the energy has units of V^{2} s Ω^{−1} = J, and hence the estimate of the energy spectral density has units of J Hz^{−1}, as required. In many situations, it is common to forgo the step of dividing by so that the energy spectral density instead has units of V^{2} Hz^{−1}.
This definition generalizes in a straightforward manner to a discrete signal with an infinite number of values such as a signal sampled at discrete times :
where is the discretetime Fourier transform of and is the complex conjugate of The sampling interval is needed to keep the correct physical units and to ensure that we recover the continuous case in the limit ; however, in the mathematical sciences, the interval is often set to 1.
Power spectral density
The above definition of energy spectral density is suitable for transients (pulselike signals) whose energy is concentrated around one time window; then the Fourier transforms of the signals generally exist. For continuous signals over all time, such as stationary processes, one must rather define the power spectral density (PSD); this describes how power of a signal or time series is distributed over frequency, as in the simple example given previously. Here, power can be the actual physical power, or more often, for convenience with abstract signals, is simply identified with the squared value of the signal. For example, statisticians study the variance of a function over time (or over another independent variable), and using an analogy with electrical signals (among other physical processes), it is customary to refer to it as the power spectrum even when there is no physical power involved. If one were to create a physical voltage source which followed and applied it to the terminals of a 1 ohm resistor, then indeed the instantaneous power dissipated in that resistor would be given by watts.
The average power of a signal over all time is therefore given by the following time average, where the period is centered about some arbitrary time :
However, for the sake of dealing with the math that follows, it is more convenient to deal with time limits in the signal itself rather than time limits in the bounds of the integral. As such, we have an alternative representation of the average power, where and is unity within the arbitrary period and zero elsewhere.
Note that a stationary process, for instance, may have a finite power but an infinite energy. After all, energy is the integral of power, and the stationary signal continues over an infinite time. That is the reason that we cannot use the energy spectral density as defined above in such cases.
In analyzing the frequency content of the signal , one might like to compute the ordinary Fourier transform ; however, for many signals of interest the Fourier transform does not formally exist.^{[N 1]} Regardless, Parseval's Theorem tells us that we can rewrite the average power as follows.
Then the power spectral density is simply defined as the integrand above.^{[11]}^{[12]}

(Eq.2) 
From here, we can also view as the Fourier transform of the time convolution of and
Now, if we divide the time convolution above by the period and take the limit as , it becomes the autocorrelation function of the nonwindowed signal , which is denoted as , provided all outcomes of are equiprobable, which is typically but not generally true^{[13]}.
From here we see that in such equiprobable cases, we can also define the power spectral density as the Fourier transform of the autocorrelation function (Wiener–Khinchin theorem).

(Eq.3) 
Many authors use this equality to actually define the power spectral density.^{[14]}
The power of the signal in a given frequency band , where , can be calculated by integrating over frequency. Since , an equal amount of power can be attributed to positive and negative frequency bands, which accounts for the factor of 2 in the following form (such trivial factors dependent on conventions used):
More generally, similar techniques may be used to estimate a timevarying spectral density. In this case the time interval is finite rather than approaching infinity. This results in decreased spectral coverage and resolution since frequencies of less than are not sampled, and results at frequencies which are not an integer multiple of are not independent. Just using a single such time series, the estimated power spectrum will be very "noisy"; however this can be alleviated if it is possible to evaluate the expected value (in the above equation) using a large (or infinite) number of shortterm spectra corresponding to statistical ensembles of realizations of evaluated over the specified time window.
Just as with the energy spectral density, the definition of the power spectral density can be generalized to discrete time variables . As before, we can consider a window of with the signal sampled at discrete times for a total measurement period .
Note that a single estimate of the PSD can be obtained through a finite number of samplings. As before, the actual PSD is achieved when (and thus ) approach infinity and the expected value is formally applied. In a realworld application, one would typically average a finitemeasurement PSD over many trials to obtain a more accurate estimate of the theoretical PSD of the physical process underlying the individual measurements. This computed PSD is sometimes called a periodogram. This periodogram converges to the true PSD as the number of estimates as well as the averaging time interval approach infinity (Brown & Hwang)^{[15]}.
If two signals both possess power spectral densities, then the crossspectral density can similarly be calculated; as the PSD is related to the autocorrelation, so is the crossspectral density related to the crosscorrelation.
Properties of the power spectral density
Some properties of the PSD include:^{[16]}
 The spectrum of a real valued process (or even a complex process using the above definition) is real and an even function of frequency: .
 If the process is continuous and purely indeterministic^{[clarification needed]}, the autocovariance function can be reconstructed by using the Inverse Fourier transform
 The PSD can be used to compute the variance (average power) of a process by integrating over frequency:
 Being based on the Fourier transform, the PSD is a linear function of the autocovariance function in the sense that if is decomposed into two functions
 ,
 then
The integrated spectrum or power spectral distribution is defined as^{[dubious – discuss]}^{[17]}
Cross power spectral density
Given two signals and , each of which possess power spectral densities and , it is possible to define a cross power spectral density (CPSD) or cross spectral density (CSD). To begin, let us consider the average power of such a combined signal.
Using the same notation and methods as used for the power spectral density derivation, we exploit Parseval's theorem and obtain
where, again, the contributions of and are already understood. Note that , so the full contribution to the cross power is, generally, from twice the real part of either individual CPSD. Just as before, from here we recast these products as the Fourier transform of a time convolution, which when divided by the period and taken to the limit becomes the Fourier transform of a crosscorrelation function.^{[18]}
where is the crosscorrelation of with and is the crosscorrelation of with . In light of this, the PSD is seen to be a special case of the CSD for . For the case that and are voltage or current signals, their associated amplitude spectral densities and are strictly positive by convention. Therefore, in typical signal processing, the full CPSD is just one of the CPSDs scaled by a factor of two.
For discrete signals x_{n} and y_{n}, the relationship between the crossspectral density and the crosscovariance is
Estimation
The goal of spectral density estimation is to estimate the spectral density of a random signal from a sequence of time samples. Depending on what is known about the signal, estimation techniques can involve parametric or nonparametric approaches, and may be based on timedomain or frequencydomain analysis. For example, a common parametric technique involves fitting the observations to an autoregressive model. A common nonparametric technique is the periodogram.
The spectral density is usually estimated using Fourier transform methods (such as the Welch method), but other techniques such as the maximum entropy method can also be used.
Properties
 The spectral density of and the autocorrelation of form a Fourier transform pair (for PSD versus ESD, different definitions of autocorrelation function are used). This result is known as Wiener–Khinchin theorem.
 One of the results of Fourier analysis is Parseval's theorem which states that the area under the energy spectral density curve is equal to the area under the square of the magnitude of the signal, the total energy:
 The above theorem holds true in the discrete cases as well. A similar result holds for power: the area under the power spectral density curve is equal to the total signal power, which is , the autocorrelation function at zero lag. This is also (up to a constant which depends on the normalization factors chosen in the definitions employed) the variance of the data comprising the signal.
Related concepts
 The spectral centroid of a signal is the midpoint of its spectral density function, i.e. the frequency that divides the distribution into two equal parts.
 The spectral edge frequency of a signal is an extension of the previous concept to any proportion instead of two equal parts.
 The spectral density is a function of frequency, not a function of time. However, the spectral density of small windows of a longer signal may be calculated, and plotted versus time associated with the window. Such a graph is called a spectrogram. This is the basis of a number of spectral analysis techniques such as the shorttime Fourier transform and wavelets.
 A "spectrum" generally means the power spectral density, as discussed above, which depicts the distribution of signal content over frequency. This is not to be confused with the frequency response of a transfer function which also includes a phase (or equivalently, a real and imaginary part as a function of frequency). For transfer functions, (e.g., Bode plot, chirp) the complete frequency response may be graphed in two parts, amplitude versus frequency and phase versus frequency (or less commonly, as real and imaginary parts of the transfer function). The impulse response (in the time domain) , cannot generally be uniquely recovered from the amplitude spectral density part alone without the phase function. Although these are also Fourier transform pairs, there is no symmetry (as there is for the autocorrelation) forcing the Fourier transform to be realvalued. See spectral phase and phase noise.
Applications
Electrical engineering
The concept and use of the power spectrum of a signal is fundamental in electrical engineering, especially in electronic communication systems, including radio communications, radars, and related systems, plus passive remote sensing technology. Electronic instruments called spectrum analyzers are used to observe and measure the power spectra of signals.
The spectrum analyzer measures the magnitude of the shorttime Fourier transform (STFT) of an input signal. If the signal being analyzed can be considered a stationary process, the STFT is a good smoothed estimate of its power spectral density.
Cosmology
Primordial fluctuations, density variations in the early universe, are quantified by a power spectrum which gives the power of the variations as a function of spatial scale.
See also
 Noise spectral density
 Spectral density estimation
 Spectral efficiency
 Spectral power distribution
 Brightness temperature
 Colors of noise
 Spectral leakage
 Window function
 Bispectrum
 Whittle likelihood
Notes
 ^ Some authors (e.g. Risken^{[10]}) still use the nonnormalized Fourier transform in a formal way to formulate a definition of the power spectral density
 ,
References
 ^ P Stoica & R Moses (2005). "Spectral Analysis of Signals" (PDF).
 ^ P Stoica & R Moses (2005). "Spectral Analysis of Signals" (PDF).
 ^ P Stoica & R Moses (2005). "Spectral Analysis of Signals" (PDF).
 ^ Gérard Maral (2003). VSAT Networks. John Wiley and Sons. ISBN 9780470866849.
 ^ Michael Peter Norton & Denis G. Karczub (2003). Fundamentals of Noise and Vibration Analysis for Engineers. Cambridge University Press. ISBN 9780521499132.
 ^ Michael Cerna & Audrey F. Harvey (2000). "The Fundamentals of FFTBased Signal Analysis and Measurement" (PDF).
 ^ Alessandro Birolini (2007). Reliability Engineering. Springer. p. 83. ISBN 9783540493884.
 ^ Oppenheim; Verghese. Signals, Systems, and Inference. pp. 32–4.
 ^ ^{a} ^{b} Stein, Jonathan Y. (2000). Digital Signal Processing: A Computer Science Perspective. Wiley. p. 115.
 ^ Hannes Risken (1996). The Fokker–Planck Equation: Methods of Solution and Applications (2nd ed.). Springer. p. 30. ISBN 9783540615309.
 ^ Fred Rieke; William Bialek & David Warland (1999). Spikes: Exploring the Neural Code (Computational Neuroscience). MIT Press. ISBN 9780262681087.
 ^ Scott Millers & Donald Childers (2012). Probability and random processes. Academic Press. pp. 370–5.
 ^ The Wiener–Khinchin theorem makes sense of this formula for any widesense stationary process under weaker hypotheses: does not need to be absolutely integrable, it only needs to exist. But the integral can no longer be interpreted as usual. The formula also makes sense if interpreted as involving distributions (in the sense of Laurent Schwartz, not in the sense of a statistical Cumulative distribution function) instead of functions. If is continuous, Bochner's theorem can be used to prove that its Fourier transform exists as a positive measure, whose distribution function is F (but not necessarily as a function and not necessarily possessing a probability density).
 ^ Dennis Ward Ricker (2003). Echo Signal Processing. Springer. ISBN 9781402073953.
 ^ Robert Grover Brown & Patrick Y.C. Hwang (1997). Introduction to Random Signals and Applied Kalman Filtering. John Wiley & Sons. ISBN 9780471128397.
 ^ Storch, H. Von; F. W Zwiers (2001). Statistical analysis in climate research. Cambridge University Press. ISBN 9780521012300.
 ^ An Introduction to the Theory of Random Signals and Noise, Wilbur B. Davenport and Willian L. Root, IEEE Press, New York, 1987, ISBN 0879422351
 ^ William D Penny (2009). "Signal Processing Course, chapter 7".