PolarizationReturn to Table of Contents


Diffraction

If we look at the shadow cast by an opaque object, we would find that it is very intricate. In fact, the shadow would consist of bright and dark regions which are not expected from everyday geometrical optics. This is known as diffraction, and it was first shown in the 1600s to be a general characteristic of wave phenomenon which occurs whenever a portion of a wavefront is obstructed in some way. In particular, if a wave encounters an obstacle, then diffraction occurs when a region of the wavefront is altered in amplitude or phase.

It is important to realize that there is not physical difference between interference and diffraction. However, it is traditional to consider a phenomenon as interference when it involves the superposition of only a few waves, and as diffraction when a large number of waves are involved. Another aspect that is important to understand is the fact that every optical instrument only uses a portion of the full incident wavefront. Because of this, diffraction plays a significant role in the detailed understanding of the light train through the device. Even in all of the potential defects in the lens system were eliminated, the ultimate sharpness of the image would be limited by diffraction.

In order to begin to understand diffraction, let's return to Huygen's principle. Recall that this told us that each point on a wavefront can be viewed as a source of secondary spherical wavelets. From this, the progress of the wavefront as it moves through space can theoretically be determined. At any particular time, the shape of the wavefront is made up from the envelope of the secondary wavelets. There is a problem with this approach. In only considering the envelope of the secondary wavelets, Huygen's principle ignores most of the secondary wavelet and retains only the portion which is common to the envelope. As a result of this, Huygen's principle is unable to account for the details of the diffraction process. An example of this can be seen by comparing radio and visible light waves. Radio waves are seen to "bend" around large objects, such as buildings and telephone poles, but visible light creates a fairly distinct shadow. Huygen's principle is independent of any wavelength consideration and predicts the same wavefront configuration in both situations.

This problem was resolved when Fresnel added to Huygen's principle with the idea of interference. The resulting principle, known as the Huygens­Fresnel principle, states that every unobstructed point of a wavefront, at a given instant in time, serves as a source of spherical secondary wavelets, with the same frequency as that of the primary wave. The amplitude of the optical field at any point beyond is the superposition of all these wavelets, taking into consideration their amplitudes and relative phases. As an example of this, consider the following drawing

drawing

Define the maximum optical path length difference as formula. Assume that formula. Then when formula, we also have that formula. Since the waves were initially in phase, they must all interfere constructively, no matter where P happens to be. On the other hand, when formula, the area where formula is limited to a small region extending out directly in from of the aperture, and it is only there that all of the wavelets interfere constructively. Beyond this region, some of the wavelets can interfere destructively. This is the geometric shadow. Remember that the idealized geometric shadow corresponds to formula.

Coherent Oscillators

As a bridge between interference and diffraction, consider an arrangement made up of a linear array of N coherent point oscillators, which are all identical. Assume that the oscillators have no intrinsic phase difference. Then the rays that are emitted by the oscillators will be practically parallel. If the spatial extent of the oscillator array is small compared to the wavelength of the radiation, then the amplitudes of the separate waves arriving at some observation point P will be essentially equal,

 formula,                            (14.1)

where ri is the distance from the ith oscillator to the observation point P. The sum of the interfering spherical wavelets yields a composite electric field at P that is the real part of

 formula                         (14.2)

This can be rearranged to be

formula.

The phase difference between adjacent sources is obtained from the expression formula, where the maximum optical-path length difference is formula in a medium with an index of refraction n. But, since d is the distance between two adjacent oscillators, it can be easily seen that d sintheta = r2 - r1. Thus, the field at P becomes

 formula,           (14.3)

where formula is the distance from the center of the line of oscillators to the point P.

Once we know what the electric field is from the line of oscillators, the flux density can be determined. Using (14.3), the flux density is

 formula                                                      (14.4)

where I0 is the flux density that comes from any single source arriving at P. The principal maxima occur when formula, where m = 0, plus minus1, plus minus2, …, or using the definition formula,

 formula.                                                        (14.5)

As an example of this, consider an idealized line source of oscillators, i.e. one for which the width of the slit is much less than lambda. Each point source emits a spherical wavelet,

 formula                                                    (14.6)

where epsilon0 is the source strength. If we consider a infinitesimal segment of the array dyi, there are formula sources, where D is the entire length of the array and N is the total number of sources in the array. Letting N approach infinity, energy conservation requires that epsilon0 goes to zero in order for the total energy output to remain finite. This allows us to define the source strength per unit length of the array as

 formula.                                                       (14.7)

Using this, the electric field at some point P is found from

 formula                                               (14.8)

where r(y) is the distance from the element under consideration to P.

Fraunhofer Diffraction

Consider the case where the point of observation is very distant from the array line and R >> D. Then r(y) does not deviate very significantly from R. In this case, Eq. (14.8) becomes

 formula                                              (14.9)

If we write r as an explicit function of y, we get

 formula                                            (14.10)

where theta is measured from the x-z plane. The non-linear terms in y can be ignored when their contribution to the phase is insignificant. This is true whenever formula is negligible; a condition that is satisfied for all values of theta whenever R is large. This is known as the Fraunhofer condition, where the distance r is linear in y. In turn, this leads to the fact that the distance to the point of observation, and thus the phase, can be written as a linear function of the aperture variables.

Returning to Eq. (14.10), we see that Eq. (14.9) becomes

formula

which can be integrated to yield

 formula.                             (14.11)

In order to simplify (14.11), define

 formula                                                            (14.12)

Then

formula.

From E, the irradiance can be determined to be

 formula.                        (14.13)

Notice that when theta = 0, formula and I(theta) is a maximum. This maximum is known as the principal maximum, and the irrandiance resulting from an idealized coherent line source in the Fraunhofer approximation becomes

 formula.                                             (14.14)

Since formula, when D >> lambda, the irradiance drops extremely rapidly as theta deviates from zero. Also, when D >> lambda, the source, which is a relatively long coherent line source, can be viewed as a single point emitter radiating predominately in the forward direction. When the opposite is true, namely that lambda >> D, then beta is small and the irradiance remains essentially constant for all values of theta. This means that the line source more closely resembles a point source emitting spherical waves.

The Rectangular Aperture

Consider the following configuration
drawing

We wish to find the far-field flux density at some arbitrary point P. According to the Huygens-Fresnel principle, a differential area dS within the aperture may be envisioned as being covered with coherent secondary point sources. Since dS is much smaller in extent than lambda, all the contributions at P will remain in phase and interfere constructively. This is true regardless of theta. Let epsilonA be the source strength per unit area, and assume that it is constant over the entire aperture. Then the disturbance at P due to dS is either the real or imaginary part of

   formula.                                                 (14.15)

The distance from dS to P is

 formula.                                          (14.16)

The Fraunhofer condition allows us to replace r by R provided that the aperture is relatively small. However, we also need to have a constant phase. This causes Eq. (14.16) to become

 formula.                           (14.17)

When R is very large compared to the aperture, this simplifies to

formula

The total electrical field at P is then

 formula.                                    (14.18)

If the incident wave is originally propagating in the x direction and the aperture is oriented in the y-z plane, then the integral becomes

 formula                              (14.19)

where A = ab is the area of the aperture, formula, and formula. Thus, the flux density is

 formula                                            (14.20)

where I(0) is the irradiance at the center of the aperture.

The Circular Aperture

Fraunhofer diffraction through a circular aperture can be found in a manner similar to that used for the rectangular aperture. In this case, instead of using rectangular coordinates, the symmetry of the situation dictates the use of cylindrical coordinates. Thus Eq. (14.18) becomes

 formula,                                (14.21)

where

formula.

From symmetry, the result must not depend on capital phi, so we can set it to zero in Eq. (14.21). Consider the azimuthal integral first. The quantity

 formula                                             (14.22)

is known as a Bessel function of the first kind. Comparing it to the azimuthal integral in (14.21), we see that

 formula                                     (14.23)

Using the recurrence relationship for Bessel functions,

formula

where

formula,

this can be evaluated as

 formula                             (14.24)

where the relationship formula was used. The irradiance becomes

 formula.                                        (14.25)

At the center of the aperture, the irradiance is

 formula                                                       (14.26)

and so Eq. (14.25) becomes

 formula.                                        (14.27)

Resolution of Circular Images

The center of the aperture has a large circular maximum. This maximum is known as the Airy disk. The size of the Airy disk can be used to determine the maximum resolution of a lens system. For simplicity, consider two incoherent distant point sources of equal irradiance. The radius of the Airy disk is given by

 formula.                                                         (14.28)

If capital deltatheta is the corresponding angular measure, then, using the fact that

formula,

we find that

formula

The Airy disk for each source will be spread out over a half width capital deltatheta, centered on the geometric image point. If the angular separation of the two points is capital deltascript phi, and if formula the images will be distinct and easily resolved. As the two sources approach each other, their respective images would also approach each other, overlap, and blend into a single set of fringes. We can use Lord Rayleigh's criterion to determine when the two objects are just resolved. This criterion states that the resolution of two fringes of equal flux density requires that the principal maximum of one coincide with the first minimum of the other. Using this criterion, the two objects are just resolved when the center of one Airy disk falls on the first minimum of the Airy pattern of the other object. Thus, the angular limit of resolution is

 formula.                                               (14.29)

Fresnel Diffraction

In Fraunhofer diffraction, the diffracting system was relatively small and the point of observation was very distant. This allowed the potential problems associated with the Huygens-Fresnel principle to be completely passed over. Now we are concerned with the near-field region, which extends right up to the diffracting element itself.

We must reconsider the Huygens-Fresnel principle more closely to understand what is happening in this region. Recall that we can envision every point on the primary wavefront as a continuous emitter of spherical secondary wavelets. However, if each wavelet is radiating uniformly in all directions, then there would be a wave traveling back towards the source in addition to the normal outgoing wave. Since no such wave is found experimentally, we must somehow modify the radiation pattern of the secondary emitters. This can be done by introducing the obliquity, or inclination factor, K(theta). The obliquity is used to describe the directionality of the secondary emission. Kirchoff was the first person to analytically define the obliquity as

                                                    formula                                                                 (14.30)

where theta is the angle made with the normal, k, to the primary wavefront.

Consider a spherical wave emitted from a point S at a time t = 0. A time t' later, the wave has a radius of rho and is described by

                                                   formula.                                               (14.31)

We can divide the wavefront into a series of annular regions. The boundaries of the various regions correspond to the intersections of the wavefront with a series of spheres centered at some observation point, P, with radii given by

formula

where r0 is the minimum distance from P to the wavefront. These spheres are known as the Fresnel, or half period, zones. Since each zone is finite in extent, we can define a ring shaped differential area element dS associated with the zone. All of the point sources within dS are coherent, and we can assume that each one radiates in phase with the primary wave. Thus, in any zone each of the secondary wavelets travel a distance r to reach P at a time t, and all of the wavelets arrive there with the same phase, formula. We can assume that the source strength per unit area epsilonA of the secondary emitters on dS is proportional to the amplitude of the primary wave,

formula.

The contribution to the optical disturbance at P from the secondary sources on dS is therefore

                                               formula.                                (14.32)

The obliquity factor must vary slowly and thus can be assumed to be constant over a single Fresnel zone. Consider the following drawing

drawing

The area element dS is seen to be

formula

which, combined with the law of cosines

formula

yields

                                                    formula                                                  (14.33)

Substituting this into Eq. (14.32) and integrating yields

                                 formula                              (14.34)

If there are a total of m zones on the wavefront, then the sum of the optical disturbances from all m zones at P is

                                          formula                                           (14.35)

If m is odd, the series can be written in one of two ways. The first way is

                          formula                   (14.36)

while the second is

                       formula           (14.37)

This means that either formula or formula. Using Eqs. (14.36) and (14.37), these conditions become

                                                        formula                                                    (14.38)

and

                                                        formula                                                    (14.39)

from which is can be concluded that

                                                        formula                                                   (14.40)

If m is even, similar arguments leads to a result of

                                                       formula                                                    (14.41)

Fresnel showed that the last contributing zone satisfied

formula

so that (14.40) and (14.41) both reduces to

                                                               formula                                                       (14.42)

Thus, we see that the optical disturbance generated by the entire unobstructed wavefront is approximately equal to half the contribution from the first zone. 


PolarizationReturn to Table of Contents

Last updated: July 24, 1997

Comments to: D-Suson@tamuk.edu