Using a Nature Run as Truth Versus
a Succession of Analyses
in
Observing System Simulation Experiments.
A nature run is a long, uninterrupted forecast by a model whose statistical behavior matches that of the real atmosphere. The ideal nature run would be a coupled atmosphere-ocean-cryosphere model with a fully interactive lower boundary. Meteorological science is approaching this ideal but has not yet reached it. We still supply the lower boundary conditions (SST and ice cover) appropriate for the span of time being simulated.
The advantage of a long, free-running forecast is that the simulated atmospheric system evolves continuously in a dynamically consistent way. One can extract atmospheric states at any time. Because the real atmosphere is a chaotic system governed mainly by conditions at its lower boundary, it does not matter that the nature run diverges from the real atmosphere a few weeks after the simulation begins provided that the climatological statistics of the simulation match those of the real atmosphere. The nature run should be a separate universe, ultimately independent from but parallel to the real atmosphere.
One of the challenges of an OSSE is to demonstrate that the nature run does have the same statistical behavior as the real atmosphere in every aspect relevant to the observing system under scrutiny. For example, an OSSE for a wind-finding lidar aboard a satellite requires a nature run with a realistic cloud climatology.
A succession of analyses is a collection of snapshots of the real atmosphere. Though (in the case of 4DVAR) the analyses may each be a realizable model state, they all lie on different model trajectories. Each analysis marks a discontinuity in model trajectory. Considering a succession of analyses as truth seems to be a serious compromise in the attempt to conduct a “clean” experiment.
I favor a long, free-running forecast as the basis for defining “truth” in an OSSE.
Tom Schlatter (9/21/2006)
From OSSE chapter in "Data Assimilation: Making
sense of Observation" from Springer
Masutani, M., T. W. Schlatter, R. M. Errico, A. Stoffelen, E. Andersson, W. Lahoz, J. S.. Woollen, G. D. Emmitt,L.-P. Riishøjgaard, S. J. Lord : Observing System Simulation Experiments. Data Assimilation: Making sense of observations, to be published from Springer in 2009. The manuscript is posted at
3. The
Nature Run
The Nature Run is a long, uninterrupted forecast by a NWP
model whose statistical behaviour matches that of the
real atmosphere. The ideal Nature Run would be a coupled atmosphere-ocean-cryosphere model with a fully interactive lower boundary.
However, it is still customary to supply the lower boundary conditions (sea
surface temperature, SST, and ice cover) appropriate for the span of time being
simulated. Meteorological science is approaching this ideal, but such coupled
systems are not yet mature enough to be used for Nature Runs. Although fully
coupled systems are available, their usefulness and accuracy for OSSEs is
unknown. Preliminary tests, however, suggest that coupled systems may be good
enough for operational NWP in near future (Saha et al. 2006; Kistler
et al. 2008).
The advantage of using a long, free-running forecast to
simulate the Nature Run is that the simulated atmospheric system evolves
continuously in a dynamically consistent way. One can extract atmospheric
states at any time. Because the real atmosphere is a chaotic system governed
mainly by conditions at its lower boundary, it does not matter that the Nature
Run diverges from the real atmosphere a few weeks after the simulation begins provided that the climatological statistics of the
simulation match those of the real atmosphere. A Nature Run should be a
separate universe, ultimately independent from but parallel to the real
atmosphere.
3.1 Characteristics of the Nature Run
One of the challenges for an OSSE is to demonstrate that the
Nature Run does have the same statistical behaviour
as the real atmosphere in every aspect relevant to the observing system under
scrutiny. For example, an OSSE for a wind-finding lidar on board a satellite
requires a Nature Run with realistic cloud climatology because lidars operate at wavelengths for which thick clouds are
opaque. The cloud distribution thus determines the location and number of
observations.
The Nature Run is central to an OSSE. It defines the true atmospheric state against which forecasts
using simulated observations will be evaluated. This concept deserves more
explanation. In 1986, Andrew Lorenc suggested the
following definition of the “truth”: the projection of the true state of the
atmosphere onto the model basis. As an example, if a spectral model produces a
Nature Run, the true atmospheric state might be represented by spectral
coefficients corresponding to triangular truncation at total wave number n (Tn) on L vertical levels. Atmospheric features too small to be captured by
the model resolution are not incorporated in this truth.
The Nature Run is also the source of simulated observations. For
each observing system, existing or future, a set of realistic observing times
and locations is developed along with a list of observed parameters. An
interpolation algorithm looks at the accumulated output of the Nature Run, goes
to the proper time and location and then extracts the value of the observed
parameter. If the Nature Run does not explicitly provide an observed parameter,
the parameter is estimated from related variables that the model does provide. Because
observations extracted from the Nature Run are the same as the defined truth
(they are “perfect”), various sources of error must also be simulated and added
to form observations with realistic accuracy with respect to the Nature Run
itself.
Some OSSEs have used a
succession of atmospheric analyses as a substitute for a Nature Run (Keil 2004; Lahoz et al. 2005). A succession of analyses
is a collection of snapshots of the real atmosphere. For example, in the case
of four-dimensional variational assimilation (4D-VAR,
link to theory
chapter), although the analyses may each be a realizable model state, they all
lie on different model trajectories. The background (first guess) lies on the
same model trajectory as the previous analysis because, in 4D-VAR the analysis
is a realizable model state (it does not require separate initialization or
balancing). Once this background is adjusted by new data in 4D-VAR, the model
lies on a new trajectory, which may be close to the old one (the one that the
background was on) but is nonetheless different. Each analysis marks a
discontinuity in model trajectory, determined by the information content
extracted by a DAS from the existing global observing systems (link to Thépaut
chapter). Furthermore, residual systematic effects due to the spatially
non-uniform and often biased observations, the DAS or the model state, may
either favourably or unfavourably
affect the potential of new observing systems to improve the forecasts. Thus,
considering a succession of analyses as truth seriously compromises the attempt
to conduct a “clean” experiment.