| E-M Home Page/Notes/ | First Created 11/12/04 |
The Dirac delta function |
First, what is the Dirac delta function?
Recall that in elementary E&M, some problems were about point charges, and some problems were about charge distributions. Those distributions could be spread out over lines, surfaces, or volumes, depending on the facts. In first-year problems the distributions were almost always given as uniform, but the discussion would indicate that the density could vary from place to place, and that the same physics concepts would apply but the math would be more challenging.
Of course the idea of a smooth distribution is clearly an approximation, given that we know about atomic structure and the graininess of matter. But given quantum mechanical uncertainties, it might be less of an approximation that it seems. In any case, we know that around the house the graininess of atomic structure will almost never be relevant, whereas trying to deal with electrical matters in terms of lots of point charges is going to be unrealistically complicated; dealing with smooth, or "averaged", distributions will clearly be much simpler. Further, for most purposes any observation will determine only an average field, and we show in this course that the average of the exact field is quite accurately given by using only an averaged source.
For these various reasons, then, we generally write the physics relations in terms of the charge densities.
This then raises the question, given that the equations use only charge
densities, how do we use them to deal with a true point charge? - with an
object containing a nonzero specific amount of charge in a volume which is
experimentally indistinguishable from zero?
The answer is, we use the Dirac delta function. Specifically, the number
density (number per volume) for a point particle at the origin is a
physical model for the Dirac delta function
This delta function is a "three-dimensional" delta function. Easier to
indicate graphically is a "one-dimensional" Dirac delta function, usually
written as
The question arises, however, how drastically infinite is this function? To
show how this can be relevant, consider comparing densities for a point
charge of 1 unit and for a point charge of 2 units. Both are zero almost
everywhere, and 2*0 is 0, which is sensible. But both have infinity as
the only nonzero value of charge density, but isn't infinity the same as
infinity? We need a way of saying that one delta-function density is twice
the other. (And, in fact, infinity = 2*infinity is an accepted truth in
most branches of math that accept infinity as a value. But 3*infinity and
infinity*infinity are also infinity; how do I tell whether my infinity is
twice yours, or infinity times, or only 1/infinity times, yours?)
In the case of the delta function, we answer questions like these by
postulating a result for the integral of the function. Specifically,
Note that since the value of the delta function is zero almost everywhere,
we don't have to integrate over all space to get this definite answer. In
fact, the integral for any region that excludes getting argument zero, will
give zero; the integral for any region that includes the point where the
argument is zero, will give 1. (However, if the zero-argument point is on
the edge of the integration region, the result is ambiguous. This is
associated with the fact that the Dirac delta is not a function in the
usual technical sense, since infinity is not a number in the usual sense.
The technical term is that it is a "distribution"; distributions act like
functions in most ways, but not in all ways. It is also true that if
integrating a delta function arises in a physics problem and the
zero-argument point is on the region boundary, then the physical problem
probably was not well-posed, and the question should be restated in some
way. Often that will mean with the region slightly shifted, or the 'point'
contribution spread out, or both. Occasionally just changing the
mathematical description of the region can make the delta function's
treatment clear.)
However, this also means that the delta function by itself can't represent
a point charge's charge density function; the integral over all space of
a charge density is the total charge, not a number. So we multiply the
delta function by a factor of the charge value; now the integral over all
space gives the charge value times 1, or the charge, as it should.
All of this discussion is based on a discussion of delta(r) or
delta(r - a), or delta(x) or delta(x-a). A delta function
with some other form of argument still satisfies the description that it is
zero unless the argument is zero (whether the zero vector or the number
zero), and that it is infinite when the argument
is zero. But how strong the infinity is, which usually relates
to what value arises from an integral with this delta function in it, has
to be addressed by using rules for changing variables in integrals.
To start off, consider
Now consider
If we consider our final results as if we had obtained them from basic-form
delta functions, we find
A special case is delta(m x) = 1/|m| delta(x) (if the delta is
one-dimensional), and in terms of this result the general case can
be obtained by another argument:
Some WARNINGS:
i. Frequently delta is written where a purist would put delta3.
You can always be sure if you check out the argument; a vector argument
requires that delta3 is meant. Note that this does not apply
if merely there is a vector in the argument;
delta(a.x - 5) is a one-dimensional delta, the
argument is a scalar even though it is in terms of vectors.
ii. If the formula for delta( f(x) ) gives you a problem such as division
by zero, or if something using a delta function gives you delta(0), then
you need to go back to basic relationships, and possibly to the
idealizations that led to using the delta function. A physically
well-posed problem should generally not have such difficulties, but
taking that well-posed problem and using insufficiently-justified
idealizations, or occasionally ill-advised variable choices, can
bring in extraneous problems.
delta(r),
sometimes less ambiguously written as
delta3(r).
For a point particle elsewhere than the origin, say at a, we write
delta(r - a).
In either case, we postulate a "function" which has a value of zero
everywhere else, but a value of infinity at some particular point.
delta(x) or delta(x-a).
I believe the one-dimensional version is actually the function
Dirac introduced, since he introduced it in discussing the theory
of quantum mechanics. I am confident he was discussing quantum
mechanics in one dimension, since the expressions being discussed would
be more elaborate without being more informative if he used
three-dimensional illustrations.
The three-dimensional delta function can be constructed from the
one-dimensional function by writing
delta(r) = delta(x) delta(y) delta(z),
or in general
delta(V) = delta(Vx) delta(Vy)
delta(Vz).
Conversely, the one-dimensional function is a slice through the
three-dimensional function. (Note that a "graph" of the three-dimensional
function needs four-dimensional "graph paper": three for x,y,z and
one for the function value, call that axis f; the "slice" that gives the
one-dimensional function is a slice in the plane defined by the f axis
and one of the others, say the x-f plane as a slice in x,y,z,f space.)
integral,all space, delta(r) = 1,
or in one dimension
integral,-infinity to infinity, delta(x) = 1.
This is consistent with the model
I introduced; the integral over all space of a number density is the number
of particles, which in the case of a single point particle is the number 1.
integral,-3 to 4, of f(x) delta(x - 2) dx.
Unless x=2, it doesn't matter what f(x) gives, since whatever-it-is
will multiply zero. Hence we get the same result if we replace x with 2
in f(x). This gives
integral,-3 to 4, of f(2) delta(x - 2) dx.
(Notice that we replace x with 2 in what multiplies the delta; we
do not replace x in the delta function itself. That would change
the integrand to something*delta(0)=something*infinity=infinity at every
point, whereas the original integrand was something*delta(x-1) giving
something*zero=0 almost everywhere.) Now f(2) is a constant; it can
factor out of the integral, giving
f(2) integral,-3 to 4, of delta(x - 2) dx.
Now the integral gives 1, and the answer is f(2).
This generalizes to the rule
integral,a to b, f(x) delta(x - c) dx = f(c) if
a < c < b; = 0 if c < a < b or a < b < c.
If a > b we use the rule integral,a to b = - integral,b to a.
(If c = a or c = b, we go back and look at whether we are asking a realistic,
well-posed problem.)
integral,-5 to 5, cos x delta(x2 - 9) dx.
Here we will have two contributions, since x2 - 9 is zero for
each of two values of x, and both are in the integration region.
Introduce
I1 = integral,-5 to 0, cos x
delta(x2 - 9) dx
and
I2 = integral,0 to 5, cos x
delta(x2 - 9) dx.
Then as a first step,
I1 = cos(-3) integral,-5 to 0,
delta(x2 - 9) dx.
Next we can change variables, u = x2 - 9, giving
du = 2 x dx and
I1 = cos(-3) integral,16 to -9,
delta(u) du/(-2 sqrt(u + 9)).
Notice that, besides having delta(u), both dx and the
integration limits have changed. (The square root that replaces
x must be the negative square root, because for the original integral
x was negative or zero.) Notice that the order of the limits on u is
backward, too; we started a sum on x at -5 and progressed to 0; the
corresponding terms for u start at +16 and progress to -9.
This change in order of limits introduces a sign change, so we get
I1 = cos(-3) integral,-9 to 16,
delta(u) du/(2 sqrt(u + 9))
= cos(-3)/(2 sqrt(9)) = cos(3)/6.
An additional sign check is provided by noting that
integral,-5 to 0, delta(x2 - 9) dx
is the integral, in the normal sense, of a nonnegative function; it should
come out nonnegative, and we obtained +1/6 for it.
Similar manipulations apply to I2; this time there is no minus on
the square root but also there is no reversal of integration order (check
it out). The result is that I2 also equals cos(3)/6 and
the original integral equals cos(3)/3.
I1 = cos(-3)/(2*3)
= integral,-5 to 0, cos x
delta(x - (-3))/|2(-3)| dx
and
I2 = integral,0 to 5, cos x
delta(x - (3))/|2(3)| dx.
Generalizing leads to the relationship
delta( f(x) ) = sum,a, delta(x - a)/|f '(a)|,
where a runs through all the zeros of f and the denominator is the magnitude
of the value at a of the derivative of f.
First, because we get a contribution wherever delta(0) arises, the
result for delta( f(x) ) will consist of a sum over zeros of f.
Second, near where any of these contributions arise, we can use
approximations based on x being close to a.
(If it isn't close, the contribution to the integral will be zero anyway.)
Specifically, we can use a Taylor series for f(x). That gives
delta( f(x) ) = delta( [f(a)
+ f '(a)(x-a) + 1/2 f "(a) (x-a)2 + ...] ).
But by assumption a is a zero so f(a) = 0, and (x-a)2 will
be negligible. Hence, near a,
delta( f(x) ) = delta ( [f '(a) (x-a)] ), which
by the special case result
= delta(x-a)/|f '(a)|.