Top: Introduction
Forward: Deformable mirrors
The image of a point source (star) in an ideal telescope without
atmosphere is shaped by the diffraction and is described by an
Airy function:
![]() |
(1) |
The first dark ring is at an angular distance of
from
the center. This is often taken as a measure of resolution in an ideal
telescope.
Image
of an astronomical object
can be considered as a multitude of points, each point spread into an
Airy function. This is written as a convolution:
![]() |
(2) |
Question: How the does diffraction-limited resolution depend on the wavelength?
Question: Compute the diffraction-limited resolution of human eye.
What happens if the telescope is not ideal? An image of a point source
would not be as good as Airy function, the resolution would be
degraded even more. But the imaging equation will still be
valid! So, the Point Spread Function (PSF)
is
all we need to know to characterize the imaging. The width of PSF is a
measure of resolution.
Note 1. Implicitly, we assumed in the above equations that
is an image of a star of unit intensity, i.e. the
integral of
over
is equal to 1. Thus, the
imaging equation preserves the total flux of an astronomical object,
only distributes it differently between the pixels.
Note 2. We assumed that PSF has the same shape over the whole field of view. This condition is called isoplanatism. It is not always true in astronomical imaging, particularly with Adaptive Optics, because PSF slowly changes in the field. In this case imaging equation can be applied to parts of the field.
The shape of PSF may be irregular; what numeric measures of resolution are used in this case?
1. FWHM = Full Width at Half Maximum of the PSF.
2. Strehl ratio
, i.e. the central intensity of
PSF compared to the central intensity of the Airy function. The higher is
Strehl ratio, the better is resolution. Diffraction-limited image is the
best, hence
always.
3. Encircled energy. By definition, the integral of PSF
is equal to 1. The PSF integral over the circle of radius
is called the encircled energy. This characteristic is
important for observations of faint objects, when one wants to
concentrate the photons as much as possible.
The example of the PSF with compensation of turbulence is given in the picture below.
Question: Supposing that PSF is two times narrower, by how much the Strehl ratio would change?
Question: What would be the Strehl ratio if one half of the
(ideal) telescope objective is given a phase delay of ?
Another way to look at the imaging equation is to use the Fourier Transforms (FT; we denote it by tilde). The convolution becomes a product, so
![]() |
(3) |
The
is called Optical transfer function
(OTF). It describes the change of the modulus and phase of the object
FT in the imaging process. The modulus of OTF is called
Modulation transfer function (MTF). For astronomical (incoherent)
imaging,
. Typically, MTF decreases with
increasing frequencies, hence the small (high-frequency) details in
the image are weakened and eventually lost.
It is known that for any optical system
for
, where
is called cutoff
frequency, and
is the maximal size of aperture. It means that the
information at the spatial frequencies above
is irrevocably
lost. We need larger telescopes to look at smaller objects!
The relation between PSF and OTF is Fourier transform, so if you know
one you know another, they are the same thing expressed
differently. From the FT properties it follows that
(PSF normalization), and that the Strehl ratio is proportional to the
integral of OTF over frequencies.
Question: What is the minimum size of a telescope needed to resolve a fence with 10 cm bar spacing from a distance of 5 km?
Atmospheric turbulence may be regarded as random phase aberrations
added to the telescope. These aberrations constantly change in time, so
does the PSF. Here we consider the average PSF, corresponding to long
exposure times. The theory leads to the expression
![]() |
(4) |
The atmospheric OTF is related to the statistics of atmospheric phase
aberrations, the so-called phase structure function
(see the next section):
![]() |
(5) |
Question: Suppose that the shape of the atmospheric PSF is Gaussian. What is the corresponding shape of the structure function?
Question: Is the shape of the atmospheric PSF sensitive to the
structure function in the regions where
and
?
The atmospherically distorted wave-front can be visualized as a smashed paper sheet. The wave arriving from a star is plane before entering the atmosphere. Then some parts of it go through the hotter-than-average air (less refractive index) and are advanced, other parts are retarded, and the plane wave-front is deformed. It is the purpose of Adaptive Optics to compensate these distortions. But first, we need to describe them in a statistical sense.
Air is slightly dispersive, but this is usually neglected, so that the
perturbations of the optical path length are considered
as achromatic. However, the phase of the optical wave
strongly depends on
the wavelength
! Speaking of perturbations, we assume that
their average values are zero,
(angular
brackets denote statistical averaging).
Although random processes like are typically described
by correlation functions or covariances, in the atmospheric science
the structure functions are preferred. Structure function is
the average difference between the two values of a random process:
![]() |
(6) |
Question: What is the relation between the structure function
and the covariance function
?
Question: How does the atmospheric
depend on the
wavelength
?
The Kolmogorov model of the turbulence distortions prescribes
the specific form of the phase structure function, namely
![]() |
(7) |
The above model, although it may seem primitive, is the basis of the whole theory of imaging through the turbulence, including Adaptive Optics. Of course, at large distances (more than few meters) and small distances (less than 1 cm) the model is not good, but this turns out to be not very important.
Question: What is the r.m.s. (root-mean-square) atmospheric phase
difference at the baseline of in radians and wavelengths?
Question: If cm at 0.5 micron wavelength, what is
at 2.2 micron wavelength?
Now we put this model into long-exposure atmospheric OTF, and get it in the form:
![]() |
(8) |
![]() |
(9) |
The Strehl number of the atmospheric PSF is exactly the same as in an
ideal telescope of diameter (this is the reason why a strange
coefficient 6.88 appears). So, for a large telescope,
,
the Strehl ratio is simply
.
Question: What is the Strehl ratio of long-exposure images at the 4 m telescope under 1 arcsecond seeing at the wavelengths of 0.5 and 2.2 microns?
The Fried radius is sometimes identified with the characteristic
scale of atmospheric perturbations. This is not quite true: we see
that Kolmogorov law does not have any characteristic scale. However,
only the perturbations of the size of the order of
are relevant
for long-exposure imaging. At smaller scales, the distortions are much
less than
, at larger scales the
becomes so large
that the atmospheric OTF is zero.
Locally, the strength of turbulent fluctuations of refractive index in
the air are described by a refractive index structure constant
which is measured in strange units, m
. The dependence
of
on altitude is called the turbulence profile. Seeing
depends on the integrated effect of all atmospheric layers:
![]() |
(10) |
.
This figure shows an example of the turbulence profile at Cerro Paranal, plotted in relative units (full line). The fraction of the turbulent energy up to a given altitude is overplotted in dashed line. Although in this instance a significant part of turbulence was concentrated in only two laters, still some 1/3 of the total energy is distributed continuously over all altitudes.
Question: From this relation, find how and seeing depend
on the zenith angle
.
Turbulence can often be modeled as fixed phase screens which are
driven by the wind in front of the telescope. Knowing the spatial
properties of the phase screens (structure function) and the wind
velocity, we know also the temporal behavior of the
perturbations. The atmospheric time constant
is defined
as
![]() |
(11) |
Question: Taking the typical value of =20 m/s, what
would be the atmospheric time constant at 0.5 and 2.2 micron
wavelengths under 1 arcsecond seeing?
The images of astronomical objects taken with exposure time
or shorter are called short-exposure images. They correspond to
fixed (frozen) atmospheric aberrations. At longer exposure times, the
aberrations are averaged, and for exposures much longer than
the long-exposure PSF is obtained.
The long-exposure atmospheric PSF is independent of the viewing
direction (isoplanatic), because the turbulence and its structure
function are statistically the same everywhere in the field. But the
instantaneous atmospheric phase aberrations do depend on the
direction: the telescope beam as projected on the atmospheric layer at
10 km shifts by 0.5 m for an angular offset of 10 arcseconds.
The standard definition of the atmospheric isoplanatic angle
is
![]() |
(12) |
Question: What is the isoplanatic angle at
wavelengths of 0.5 and 2.2 micron under 1 arcsecond seeing?
This phenomenon is very troublesome for Adaptive Optics, because it limits the distance between guide star and the scientific objects. It turns out that for most objects there is no suitable (bright and close-by) guide star, hence artificial laser guide stars are required. Alternatively, a 3-dimensional correction of turbulence ( Multi-Conjugate Adaptive Optics, MCAO) may be attempted to increase the corrected field.
In optics, the aberrations are often represented as a sum of special polynomials, called Zernike polynomials. Atmospheric random aberrations can be considered in the same way; however, the coefficients of these aberrations (defocus, astigmatism, etc.) are now random functions changing in time.
A Zernike polynomial
is defined in polar coordinates
on a circle of unit radius
. They are characterized
by a radial order
and an azimuthal order
(for a given
,
takes the values from 0 to
). Frequently, instead of two
indices
and
a continuous numeration with single index
is
used. For a given radial order
, there are a total of
Zernike polynomials.
The first Zernike modes are named as common aberrations, and have simple meaning (see the Table of the first 15 Zernike modes).
The power of Zernike modes comes from the fact that they are
orthonormal, that is the scalar product is equal to 1 if
and zero otherwise. The scalar product is defined as integral
over telescope aperture:
![]() |
(13) |
Now, any phase aberration
inside the telescope pupil
can be represented as an infinite sum of Zernike polynomials
![]() |
(14) |
![]() |
(15) |
Often a limited number of Zernike modes gives already a good enough representation of atmospheric aberrations. If these modes are corrected by Adaptive Optics, an almost diffraction-limited image quality is obtained.
Piston mode corresponds to a constant phase, which does not influence image. Usually the piston mode is ignored.
Question: A 4 m telescope, , is defocussed by 1
mm. Compute the resulting aberration
for wavelengths of 0.5 and
2.2 microns.
Question: Suppose that atmospheric aberration contains only
random tips and tilts with equal amplitudes
. Write the corresponding phase structure
function.
The orthonormality of Zernike modes gives an easy way to compute the
phase variance integrated over the pupil. For one mode, it is
. The variance for all modes is equal to the sum of squared
coefficients starting from 2 (piston excluded).
Given the Kolmogorov turbulence model, we can obtain the statistical
properties of the coefficients corresponding to the atmospheric
phase aberrations. The mathematical manipulations lead to a simple
formula:
![]() |
(16) |
i \ j | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
2 | 0.449 | 0 | 0 | 0 | 0 | 0 | 0.0142 | 0 | 0 |
3 | 0 | 0.449 | 0 | 0 | 0 | 0.0142 | 0 | 0 | 0 |
4 | 0 | 0 | 0.0232 | 0 | 0 | 0 | 0 | 0 | 0 |
5 | 0 | 0 | 0 | 0.0232 | 0 | 0 | 0 | 0 | 0 |
6 | 0 | 0 | 0 | 0 | 0.0232 | 0 | 0 | 0 | 0 |
7 | 0 | 0.0142 | 0 | 0 | 0 | 0.00619 | 0 | 0 | 0 |
8 | 0.0142 | 0 | 0 | 0 | 0 | 0 | 0.00619 | 0 | 0 |
9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00619 | 0 |
10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.00619 |
As you may see, the Noll matrix is almost diagonal (less so for higher
orders, however). Why the coefficient is missing? For
Kolmogorov turbulence it is infinite! However, the first (piston) mode
is of no importance to imaging.
Question: For 4m telescope and 1 arcsecond seeing, compute the r.m.s. amplitude of tilt in radians (for 0.5 micron wavelength). Convert it to the r.m.s. amplitude of stellar image motion. Does this amplitude depend on the wavelength?
What happens if we correct the first modes with Adaptive Optics? The
corresponding coefficients become zero, and the total phase variance
is reduced. Denoting the pupil-averaged phase variance as
, we write
![]() |
(17) |
The full, uncorrected atmospheric phase variance (all modes except
piston) corresponds to
.
In other words, in a
telescope with aperture diameter of
the atmospheric phase
variance is about 1 square radians.
If tip and tilt are
corrected, then
. It means that tip and tilt
contribute 87% of the total phase variance. Correcting the modes up
to radial order 2 leaves us with
, the radial order
3 leaves uncorrected variance of
. As you see,
further reduction of phase variance demands the correction of larger
and larger number of Zernike modes.
For a large number of corrected modes (
, as happens in real
systems), the assymptotic formula of Noll is very useful:
![]() |
(18) |
Question: Using the values of and
given
above, compute
.
How many modes do we need to correct? Opticians know that when the
residual phase is less than 1 radian, the image quality approaches the
diffraction limit. You have by now all the tools to predict the
required number of modes as a function of telescope diameter, seeing,
and wavelength! It is sufficient to write
and to work back all the formulas ( try to do it!). The
result is
![]() |
(19) |
Question: How many Zernike modes must be corrected at the 4 m telescope for imaging at 0.5 and 2.2 microns under 1 arcsecond seeing?
Is it necessary to correct the turbulence using Zernike modes? Of course not, the phase aberrations can be measured and corrected with any other set of basis functions, or without any modes at all, operating directly on the wave-fronts. It turns out that Zernike modes are the second best choice (the best set of modes is called Karunen-Loeve modes). The choice depends on the number of controlled parameters (modes) needed to achieve a given degree of correction; for Zernike modes it is less than for local wave-front control.
Summary. In this chapter, the fundamentals of imaging in ideal
and aberrated telescopes were reminded (PSF, OTF, diffraction limit,
Strehl ratio). Then the basic atmospheric parameters relevant for
Adaptive Optics (phase structure function, seeing, , time constant and isoplanatic angle)
were introduced. The de-composition of the random phase aberrations on
Zernike modes was studied. Now we are able to predict the number of
modes that need to be corrected.
TOP: Introduction
FORWARD: Deformable mirrors