Affine space: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Paul Wormer
imported>Paul Wormer
m (fixed link)
 
(23 intermediate revisions by 4 users not shown)
Line 1: Line 1:
{{subpages}}
{{subpages}}
The 2- and 3-dimensional point spaces studied in [[Euclidean geometry|elementary Euclidean geometry]] are examples of '''affine spaces''', ''A''<sub>2</sub>   and ''A''<sub>3</sub>, respectively.
The 2-dimensional plane, well-known from [[Euclidean geometry|elementary Euclidean geometry]], is an example of an '''affine space'''. Remember that in elementary geometry none of the points in the plane is special&mdash;there is no origin. A '''real''' ''n''-'''dimensional affine space''' is distinguished from the [[vector space]] <font style = "vertical-align: 15%"><math>\mathbb{R}^n</math></font> by  having no special point, no fixed origin.<ref>V. I . Arnold, ''Mathematical Methods of Classical Mechanics'',  translated from the Russian by K. Vogtmann and A. Weinstein.  Springer, New York (1978).</ref>


Assume that any two points ''P'' and ''Q'' in a space ''A'' can be connected by a line segment in ''A''; this possibility is Axiom 1 of [[Euclid's Elements|Euclidean geometry]].   If we order ''P'' and ''Q'' (we say that ''P'' comes before ''Q''), then the line segment obtains a direction (points from ''P'' to ''Q'') and has become a [[vector]], the ''difference vector'' <font style = "vertical-align: 20%"><math>\overrightarrow{PQ}</math></font>.  When all difference vectors can be  mapped onto vectors of the same ''n''-dimensional [[vector space]] ''V''<sub>''n''</sub>,  we may call the point space ''A''  an '''affine space''' of dimension ''n'', written ''A''<sub>''n''</sub>.
From elementary geometry  we know that any two points in a plane (a collection of infinitely many points) can be connected by a line segment. If the points  ''P'' and ''Q'' in a plane are [[ordered pair|ordered]] with ''P'' before ''Q'', the line segment connecting the two becomes an ''arrow'' pointing from ''P'' to ''Q''.  This arrow can be mapped onto a [[vector]], the ''difference vector'',  denoted by <font style = "vertical-align: 20%"><math>\overrightarrow{PQ}</math></font>.<ref> Sometimes it is stated: "the arrow is a vector", but in the present context it is necessary to carefully  distinguish  arrows from vectors.</ref> If all arrows in a plane can be  mapped onto vectors of a ''2''-dimensional [[vector space]]  ''V''<sub>''2''</sub>, called the ''difference space'',  the plane is  an <b>affine space of dimension ''2''</b>, denoted by  ''A''<sub>2</sub>.
Arrows that are  mapped onto the same vector in the difference space are said to be ''parallel'', they differ from each other by [[translation]].
 
In elementary [[analytic geometry]], the map of arrows onto  vectors is almost always defined by the choice of an origin ''O'', which is a point somewhere in the plane.  Clearly, an arbitrary point ''P'' is the head of an arrow with tail in the origin and corresponding with the unique difference vector <font style = "vertical-align: 20%"><math>\overrightarrow{OP}</math></font>. All arrows with tail in ''O'' are mapped one-to-one onto a 2-dimensional difference space ''V''<sub>''2''</sub>, with the vector addition in ''V''<sub>''2''</sub>  in one-to-one correspondence with the [[vector (mathematics)|parallelogram rule]] for the addition of arrows in the plane.
 
Usually one equips the difference space  with an inner product, turning it into an [[inner product space]].  Its elements  have  well-defined length, namely, the  square root of the inner product of the vector with itself. The distance between  any two points ''P'' and ''Q'' may now be defined as the length of  <font style = "vertical-align: 20%"><math>\overrightarrow{PQ}</math></font> in ''V''<sub>''2''</sub>.  A two-dimensional affine space, with this distance defined between the points, is the ''Euclidean plane'' known from high-school geometry.
 
Upon formalizing and generalizing the definition of an affine space,  we replace the dimension 2 by an arbitrary finite dimension ''n'' and replace arrows  by ordered pairs of points ("head" and "tail") in a given point space ''A''. Briefly, ''A'' is an affine space of dimension ''n'' if there exists a map of the [[Cartesian product]], ''A'' &times; ''A'' onto a vector space of dimension ''n''. This map must satisfy certain axioms that are treated in the next section. If the dimension needs to be exhibited, we may write ''A''<sub>''n''</sub> for the affine space of dimension ''n''.


Usually one takes as a difference space an [[inner product space]]. Its elements have a well-defined length, namely, the  square root of the inner product of the vector with itself. The distance  between points ''P'' and ''Q'' in an affine space is then defined as the length of the image of  <font style = "vertical-align: 20%"><math>\overrightarrow{PQ}</math></font> in ''V''<sub>''n''</sub>.  Difference vectors that are  mapped onto the same element of ''V''<sub>''n''</sub> are said to be ''parallel'', they differ from each other by [[translation]].
==Formal definition==
==Formal definition==
We will restrict the definition to vector spaces over the [[Field theory (mathematics)|field]] <font style = "vertical-align: 10%"><math>\mathbb{R}</math></font> of real numbers.  
We will restrict the definition to [[vector space]]s over the [[Field theory (mathematics)|field]] <font style = "vertical-align: 10%"><math>\mathbb{R}</math></font> of real numbers.  
 
Let ''V'' be an ''n''-dimensional vector space and ''A'' a set of elements that we will call points. Assume that a relation between points and vectors is defined in the following way:
Let ''V'' be an ''n''-dimensional vector space and ''A'' a set of elements that we will call points. Assume that a relation between points and vectors is defined in the following way:
# To every ordered pair ''P'', ''Q'' of ''A'' there is assigned a vector of ''V'', called the difference vector, denoted by <font style = "vertical-align: 20%"><math>\overrightarrow{PQ} \in V</math></font>.
# To every ordered pair ''P'', ''Q'' of ''A'' there is assigned a vector of ''V'', called the difference vector, denoted by <font style = "vertical-align: 20%"><math>\overrightarrow{PQ} \in V</math></font>.
# To every point ''P'' of ''A'' and every vector <font style = "vertical-align: 15%"><math>\vec{v}</math></font> of ''V'' there exists exactly one point ''Q'' in ''A'', such that <font style = "vertical-align: 15%"><math>\overrightarrow{PQ} = \vec{v}</math></font>
# To every point ''P'' of ''A'' and every vector <font style = "vertical-align: 15%"><math>\vec{v}</math></font> of ''V'' there exists exactly one point ''Q'' in ''A'', such that <font style = "vertical-align: 15%"><math>\overrightarrow{PQ} = \vec{v}</math></font>.
# If ''P'', ''Q'', and ''R'' are three arbitrary points in ''A'', then
# If ''P'', ''Q'', and ''R'' are three arbitrary points in ''A'', then
:::<math>
:::<math>
\overrightarrow{PQ} + \overrightarrow{QR}  = \overrightarrow{PR}.
\overrightarrow{PQ} + \overrightarrow{QR}  = \overrightarrow{PR}.
</math>  
</math>  
If these three requirements are fulfilled, the set ''A'' is called an ''n''-dimensional affine space with the difference space ''V''.
If these three postulates hold, the set ''A'' is <i>an ''n''-dimensional affine space with difference space ''V''</i>.
 
Two immediate and important consequences are:
:<math>
\begin{align}
\overrightarrow{PP} + \overrightarrow{PR}  &= \overrightarrow{PR} &\Longrightarrow\quad
\overrightarrow{PP} = \vec{0}.\qquad \\
\overrightarrow{PQ} + \overrightarrow{QP}  &= \vec{0}\qquad &\Longrightarrow\quad
\overrightarrow{PQ} = -\overrightarrow{QP}. \\
\end{align}
</math>
 


Immediate consequences:
'''Lemma 1:'''
:<math>
:<math>
\overrightarrow{PP} + \overrightarrow{PR}  = \overrightarrow{PR} \quad\Longrightarrow\quad
\overrightarrow{PQ} = \vec{0} \quad \Longleftrightarrow\quad P = Q
\overrightarrow{PP} = \vec{0}.
</math>.
</math>  
'''Proof:''' &nbsp;  If the points coincide, <math>\,P = Q</math>,  we just saw that the difference vector is the zero vector. Conversely, assume that &nbsp; <math>\,P \ne Q</math> &nbsp; and &nbsp;  <font style = "vertical-align: 10%"><math>\overrightarrow{PQ} = \vec{0}</math></font>. Then for an arbitrary point <font style = "vertical-align: 8%"><math>\,R</math></font>,
:<math>
:<math>
\overrightarrow{PQ} + \overrightarrow{QP} = \vec{0} \quad\Longrightarrow\quad
\vec{0} = \overrightarrow{PQ} = -\overrightarrow{RP} + \overrightarrow{RQ} \quad\Longrightarrow\quad
\overrightarrow{PQ} = -\overrightarrow{QP}.
\overrightarrow{RP} = \overrightarrow{RQ},
</math>
</math>
which implies that the same vector in ''V'' connects <font style = "vertical-align: 10%"><math>\,R </math></font> in ''A'' with two different points, which by postulate 2 is forbidden.
==Parallelogram law==
==Parallelogram law==
[[Image:Affine points.png|right|thumb|200px|Parallelogram law]]
{{Image|Affine points.png|right|200px|The parallelogram law in the 2-dimensional Euclidean plane.}}
Consider four points in ''A'': &nbsp; ''P''<sub>1</sub>, ''P''<sub>2</sub>, ''Q''<sub>1</sub>, and ''Q''<sub>2</sub>.  Assume that the following difference vectors are equal, i.e., the corresponding line-segments in ''A'' are parallel  and of equal length:
Consider four points in ''A'': &nbsp; ''P''<sub>1</sub>, ''P''<sub>2</sub>, ''Q''<sub>1</sub>, and ''Q''<sub>2</sub>.  Assume that the following difference vectors are equal,  
:<math>
:<math>
\overrightarrow{P_1Q_1} = \overrightarrow{P_2Q_2}  
\overrightarrow{P_1Q_1} = \overrightarrow{P_2Q_2}  
</math>
</math>
then the four points form a parallelogram, that is,
then we may exchange ''Q''<sub>1</sub> and ''P''<sub>2</sub>,
:<math>
:<math>
\overrightarrow{P_1P_2} = \overrightarrow{Q_1Q_2},
\overrightarrow{P_1P_2} = \overrightarrow{Q_1Q_2}.
</math>
</math>
see the figure.
See the figure for a concrete example in which the four points form a parallelogram.


'''Proof''' Subtract the following equations:
'''Proof:''' &nbsp; Subtract the following equations:
:<math>
:<math>
\begin{align}
\begin{align}
Line 49: Line 69:
+ \overrightarrow{P_1Q_1} = \vec{0}.
+ \overrightarrow{P_1Q_1} = \vec{0}.
</math>
</math>
==Position vector==
Choose a fixed point ''O'' in the affine space ''A'', an ''origin''.  Every point ''P'' is uniquely determined by the vector <font style = "vertical-align: 20%"><math> \overrightarrow{OP} \in V</math></font>. Indeed, suppose  that there is another point ''Q'' such that  <font style = "vertical-align: 15%"><math>\overrightarrow{OP}  = \overrightarrow{OQ} </math></font>,  it then follows from lemma 1 that ''P'' = ''Q''.
The vector <font style = "vertical-align: 20%"><math>\vec{v}\equiv\overrightarrow{OP}</math></font> is the <i>position vector of ''P'' with respect to ''O''</i>. After choosing ''O'' every point ''P'' in ''A'' is uniquely identified by its corresponding position vector <font style = "vertical-align: 15%"><math>\vec{v} \in V</math></font>.
Choice of another origin ''O'' '''&prime;''' gives a translation of <font style = "vertical-align: 12%"><math>\vec{v}</math></font> by <font style = "vertical-align: 25%"><math>-\vec{t} \equiv -\overrightarrow{OO\;'}</math></font>, for
:<math>
\vec{v} \equiv \overrightarrow{OP} = \overrightarrow{OO\;'}+ \overrightarrow{O'P} = \vec{t} + \vec{v}\;'
\quad\Longrightarrow\quad \vec{v}\;' = \vec{v} - \vec{t}
</math>
with <font style = "vertical-align: 15%"><math> \vec{v}\;' </math></font> the position vector of ''P'' with respect to ''O'' '''&prime;'''.
==Affine coordinate systems==
An ''affine coordinate system''
:<math>
\mathbb{B} \equiv (O; \,\vec{e}_1, \vec{e}_2,\ldots, \vec{e}_n)
</math>
consists of an origin ''O'' in ''A'' and a [[basis]] &nbsp;  <font style = "vertical-align: 5%"><math>\vec{e}_i\,\, (i=1,\ldots,n)</math></font>  &nbsp; of the difference space ''V''. Then every point ''P'' in ''A'' determines a system of ''n'' real numbers ''x''<sub>''i''</sub> (''i'' = 1, ..., ''n'') by
:<math>
\overrightarrow{OP} = \sum_{i=1}^n \, x_i\, \vec{e}_i \in V.
</math>
The numbers ''x''<sub>''i''</sub> (''i'' = 1, ..., ''n'') are the ''affine coordinates'' of ''P'' with respect to the given coordinate system. Note that ''O'' has the coordinates ''x''<sub>''i''</sub> = 0.
Consider now two different affine coordinate systems,
:<math>
\mathbb{B} \equiv (O; \,\vec{e}_1, \vec{e}_2,\ldots, \vec{e}_n)\quad \hbox{and}\quad\mathbb{B}' \equiv (O'; \,\vec{f}_1, \vec{f}_2,\ldots, \vec{f}_n).
</math>
Write
:<math>
\vec{f}_i = \sum_{j=1}^n \vec{e}_j \, A_{ji} \quad\hbox{and}\quad \overrightarrow{OO'} = \sum_{i=1}^n t_i\, \vec{e}_i.
</math>
The matrix (''A''<sub>''i j''</sub>)  transforms the one basis of ''V'' into the other, hence it is a square regular (invertible) matrix.  The real numbers ''t''<sub>''i''</sub> are the affine coordinates of ''O''<b>&prime;</b> relative to  <font style = "vertical-align: 10%"><math>\mathbb{B}</math></font>.
Express a fixed point ''P'' with respect to <font style = "vertical-align: 12%"><math>\mathbb{B}</math></font> and <font style = "vertical-align: 19%"><math>\mathbb{B}'</math></font>, 
:<math>
\overrightarrow{OP} = \sum_{i=1}^n \, x_i\, \vec{e}_i\quad\hbox{and}\quad
\overrightarrow{O'P} = \sum_{i=1}^n \, y_i\, \vec{f}_i
</math>
Insert into the second equation
:<math>
\overrightarrow{O'P} = \overrightarrow{OP} -\overrightarrow{OO'}
</math>
and express <font style = "vertical-align: 12%"><math>\vec{f}_i</math></font> in <font style = "vertical-align: 7%"><math>\vec{e}_j</math></font> , then
:<math>
\sum_{i=1}^n y_i\, \vec{f}_i = \sum_{j=1}^n (x_j - t_j) \vec{e}_j= \sum_{j=1}^n \vec{e}_j  \sum_{j=1}^n  \, A_{ji}\, y_i,
</math>
so that the transformation from the one affine coordinate system to the other is,
:<math>
(x_j-t_j) = \sum_{i=1}^n A_{ji}\, y_i  \quad \Longrightarrow\quad \mathbf{x} = \mathbf{t} + \mathbf{A} \mathbf{y},
</math>
where we introduced bold lowercase letters for real column-vectors (stacks of ''n'' real numbers) and
the boldface capital indicates an ''n'' &times; ''n'' matrix.
Inversion of the non-singular (regular) matrix (''A''<sub>''i j''</sub>) gives the inverse transformation,
:<math>
y_j = \sum_{i=1}^n (A^{-1})_{ji}\, (x_i-t_i)
\quad \Longrightarrow\quad \mathbf{y} = \mathbf{A}^{-1} \,(\mathbf{x} - \mathbf{t}).
</math>
==Affine maps==
Let ''P'' &rarr; ''P''&prime; be a mapping of the affine space ''A'' into itself; if the map satisfies the following  two conditions, it is an ''affine map''.
(i) Conserve "parallelism". (In the Euclidean plane this condition implies that  sets of mutually parallel arrows are mapped onto sets of mutually parallel arrows. Note, however, that in general the mapped arrows are not parallel to the original arrows):
:<math>
\overrightarrow{P_1 Q_1} = \overrightarrow{P_2 Q_2}\quad \Longrightarrow\quad
\overrightarrow{P'_1 Q'_1} = \overrightarrow{P'_2 Q'_2}
</math>
(ii) The map is linear in the difference space. That is, the map &phi;: ''V'' &rarr; ''V'' defined by
:<math>
\varphi(\overrightarrow{P_1 Q_1}) = \overrightarrow{P'_1 Q'_1} \qquad\qquad\qquad\qquad(1)
</math>
is linear.<ref>Recall that &phi; is linear when
:<math>
\varphi(a\vec{v} +b \vec{w}) = a \varphi(\vec{v}) + b \varphi(\vec{w}), \qquad \vec{v}, \vec{w} \in V, \quad a,b\in \mathbb{R}.
</math>
</ref>
A ''translation'' is an affine map with &phi; the identity operation,
:<math>
\varphi(\overrightarrow{P_1 Q_1})\equiv \overrightarrow{P'_1 Q'_1} = \overrightarrow{P_1 Q_1}.
</math>
Here the origin and the image are parallel.
{{Image|Affine map.png|right|350px|Fig. 1. Affine map. Here &phi; is linear, but not necessarily length preserving.}}
Given two points ''O'' and ''O''&prime; and a linear map &phi;:  ''V'' &rarr; ''V''. There exists exactly one affine map that sends ''O'' into ''O''&prime;  and induces &phi; on ''V''. This is the map
:<math>
\overrightarrow{OP\,'}  = \overrightarrow{OO\,'} + \varphi(\overrightarrow{OP}) ,
</math>
because (see Fig. 1),
:<math>
\begin{align}
\varphi(\overrightarrow{OP}) &= \overrightarrow{OP\,''}\\
\overrightarrow{P''P\,'} &= \overrightarrow{OO\,'}\\
\overrightarrow{OP\,'} &= \overrightarrow{P''P\,'} + \overrightarrow{OP\,''} =
\overrightarrow{OO\,'} + \varphi(\overrightarrow{OP}) .
\end{align}
</math>
Note: since <font style = "vertical-align: 22%"><math>\overrightarrow{OP\,''} = \overrightarrow{O'P\,'}</math></font> by the parallelogram law (see above), the map &phi; may also be defined as
:<math>
\varphi(\overrightarrow{OP}) = \overrightarrow{O'P\,'},
</math>
as was done in Eq. (1).
Choose a basis
:<math>
(O; \,\vec{e}_1, \vec{e}_2,\ldots, \vec{e}_n)
</math>
then
:<math>
\overrightarrow{OP\,'} = \sum_{i=1}^n p'_i \vec{e}_i, \qquad
\overrightarrow{OO\,'} = \sum_{i=1}^n t_i \vec{e}_i, \qquad
\varphi(\overrightarrow{OP}) = \sum_{i=1}^n \vec{e}_i\sum_{j=1}^n F_{ij} p_j,
</math>
from which follows the matrix-vector expression for an affine map,
:<math>
p'_i  = t_i  +\sum_{j=1}^n F_{ij} p_j, \quad i=1,\ldots,n  \quad\Longleftrightarrow\quad
\mathbf{p}' = \mathbf{t} + \mathbf{F} \mathbf{p}
</math>
Often<ref>The computer language [[PostScript]] calls the augmented matrix (for ''n'' = 2) the "current transformation matrix" (CTM).</ref> one writes the last expression for an affine map with the aid of a square (''n''+1) &times; (''n''+1) matrix that contains '''F''' on the diagonal and that is augmented with the translation vector '''t''' and the number 1,
:<math>
\begin{pmatrix}
\mathbf{p}' \\
1
\end{pmatrix}
=
\begin{pmatrix}
\mathbf{F} & \mathbf{t} \\
\mathbf{0} &  1 \\
\end{pmatrix}
\begin{pmatrix}
\mathbf{p} \\
1
\end{pmatrix} .
</math>
==Euclidean space==
Let ''A'' be an affine  space with difference space ''V'' on which a positive-definite inner product  is defined. Then ''A'' is called a ''Euclidean space''. The distance between two point ''P'' and ''Q''
is defined by the length ,
:<math>
\rho(P,Q) = |\overrightarrow{PQ}| \equiv \left(\overrightarrow{PQ},\; \overrightarrow{PQ}\right)^{1/2}
</math>
where the expression between round brackets indicates the inner product of the vector with itself.
It follows from the properties of the real inner product that the distance has the usual properties,
# &rho;(''P'',''Q'') &ge; 0 and &rho;(''P'',''Q'') = 0 if and only if ''P'' = ''Q''
# &rho;(''P'',''Q'') = &rho;(''Q'',''P'')
# &rho;(''P'',''Q'') &le; &rho;(''P'',''R'') + &rho;(''R'',''Q'')
A ''[[rigid motion]]'' of a Euclidean space is an affine map which preserves distances. The linear map &phi; on ''V'' is then a [[rotation]]. Conversely, given a rotation &phi; and two points ''P'' and ''P''&prime; then there exists exactly one rigid motion which sends ''P'' into ''P''&prime; and induces &phi; on ''V''.
==Notes and references==
<references />
*A. Lichnerowicz, ''Elements of Tensor Calculus'', Translated from the French by J. W. Leech and D. J. Newman, Methuen (London) 1962.
*W. H. Greub, ''Linear Algebra'', 2nd edition, Springer (Berlin) 1963.

Latest revision as of 09:09, 4 April 2010

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
 
This editable Main Article is under development and subject to a disclaimer.

The 2-dimensional plane, well-known from elementary Euclidean geometry, is an example of an affine space. Remember that in elementary geometry none of the points in the plane is special—there is no origin. A real n-dimensional affine space is distinguished from the vector space by having no special point, no fixed origin.[1]

From elementary geometry we know that any two points in a plane (a collection of infinitely many points) can be connected by a line segment. If the points P and Q in a plane are ordered with P before Q, the line segment connecting the two becomes an arrow pointing from P to Q. This arrow can be mapped onto a vector, the difference vector, denoted by .[2] If all arrows in a plane can be mapped onto vectors of a 2-dimensional vector space V2, called the difference space, the plane is an affine space of dimension 2, denoted by A2. Arrows that are mapped onto the same vector in the difference space are said to be parallel, they differ from each other by translation.

In elementary analytic geometry, the map of arrows onto vectors is almost always defined by the choice of an origin O, which is a point somewhere in the plane. Clearly, an arbitrary point P is the head of an arrow with tail in the origin and corresponding with the unique difference vector . All arrows with tail in O are mapped one-to-one onto a 2-dimensional difference space V2, with the vector addition in V2 in one-to-one correspondence with the parallelogram rule for the addition of arrows in the plane.

Usually one equips the difference space with an inner product, turning it into an inner product space. Its elements have well-defined length, namely, the square root of the inner product of the vector with itself. The distance between any two points P and Q may now be defined as the length of in V2. A two-dimensional affine space, with this distance defined between the points, is the Euclidean plane known from high-school geometry.

Upon formalizing and generalizing the definition of an affine space, we replace the dimension 2 by an arbitrary finite dimension n and replace arrows by ordered pairs of points ("head" and "tail") in a given point space A. Briefly, A is an affine space of dimension n if there exists a map of the Cartesian product, A × A onto a vector space of dimension n. This map must satisfy certain axioms that are treated in the next section. If the dimension needs to be exhibited, we may write An for the affine space of dimension n.

Formal definition

We will restrict the definition to vector spaces over the field of real numbers.

Let V be an n-dimensional vector space and A a set of elements that we will call points. Assume that a relation between points and vectors is defined in the following way:

  1. To every ordered pair P, Q of A there is assigned a vector of V, called the difference vector, denoted by .
  2. To every point P of A and every vector of V there exists exactly one point Q in A, such that .
  3. If P, Q, and R are three arbitrary points in A, then

If these three postulates hold, the set A is an n-dimensional affine space with difference space V.

Two immediate and important consequences are:


Lemma 1:

.

Proof:   If the points coincide, , we just saw that the difference vector is the zero vector. Conversely, assume that     and   . Then for an arbitrary point ,

which implies that the same vector in V connects in A with two different points, which by postulate 2 is forbidden.

Parallelogram law

CC Image
The parallelogram law in the 2-dimensional Euclidean plane.

Consider four points in A:   P1, P2, Q1, and Q2. Assume that the following difference vectors are equal,

then we may exchange Q1 and P2,

See the figure for a concrete example in which the four points form a parallelogram.

Proof:   Subtract the following equations:

This gives

Position vector

Choose a fixed point O in the affine space A, an origin. Every point P is uniquely determined by the vector . Indeed, suppose that there is another point Q such that , it then follows from lemma 1 that P = Q.

The vector is the position vector of P with respect to O. After choosing O every point P in A is uniquely identified by its corresponding position vector .

Choice of another origin O gives a translation of by , for

with the position vector of P with respect to O .

Affine coordinate systems

An affine coordinate system

consists of an origin O in A and a basis     of the difference space V. Then every point P in A determines a system of n real numbers xi (i = 1, ..., n) by

The numbers xi (i = 1, ..., n) are the affine coordinates of P with respect to the given coordinate system. Note that O has the coordinates xi = 0.

Consider now two different affine coordinate systems,

Write

The matrix (Ai j) transforms the one basis of V into the other, hence it is a square regular (invertible) matrix. The real numbers ti are the affine coordinates of O relative to .

Express a fixed point P with respect to and ,

Insert into the second equation

and express in , then

so that the transformation from the one affine coordinate system to the other is,

where we introduced bold lowercase letters for real column-vectors (stacks of n real numbers) and the boldface capital indicates an n × n matrix. Inversion of the non-singular (regular) matrix (Ai j) gives the inverse transformation,

Affine maps

Let PP′ be a mapping of the affine space A into itself; if the map satisfies the following two conditions, it is an affine map.

(i) Conserve "parallelism". (In the Euclidean plane this condition implies that sets of mutually parallel arrows are mapped onto sets of mutually parallel arrows. Note, however, that in general the mapped arrows are not parallel to the original arrows):

(ii) The map is linear in the difference space. That is, the map φ: VV defined by

is linear.[3] A translation is an affine map with φ the identity operation,

Here the origin and the image are parallel.


CC Image
Fig. 1. Affine map. Here φ is linear, but not necessarily length preserving.

Given two points O and O′ and a linear map φ: VV. There exists exactly one affine map that sends O into O′ and induces φ on V. This is the map

because (see Fig. 1),

Note: since by the parallelogram law (see above), the map φ may also be defined as

as was done in Eq. (1).

Choose a basis

then

from which follows the matrix-vector expression for an affine map,

Often[4] one writes the last expression for an affine map with the aid of a square (n+1) × (n+1) matrix that contains F on the diagonal and that is augmented with the translation vector t and the number 1,

Euclidean space

Let A be an affine space with difference space V on which a positive-definite inner product is defined. Then A is called a Euclidean space. The distance between two point P and Q is defined by the length ,

where the expression between round brackets indicates the inner product of the vector with itself. It follows from the properties of the real inner product that the distance has the usual properties,

  1. ρ(P,Q) ≥ 0 and ρ(P,Q) = 0 if and only if P = Q
  2. ρ(P,Q) = ρ(Q,P)
  3. ρ(P,Q) ≤ ρ(P,R) + ρ(R,Q)

A rigid motion of a Euclidean space is an affine map which preserves distances. The linear map φ on V is then a rotation. Conversely, given a rotation φ and two points P and P′ then there exists exactly one rigid motion which sends P into P′ and induces φ on V.

Notes and references

  1. V. I . Arnold, Mathematical Methods of Classical Mechanics, translated from the Russian by K. Vogtmann and A. Weinstein. Springer, New York (1978).
  2. Sometimes it is stated: "the arrow is a vector", but in the present context it is necessary to carefully distinguish arrows from vectors.
  3. Recall that φ is linear when
  4. The computer language PostScript calls the augmented matrix (for n = 2) the "current transformation matrix" (CTM).
  • A. Lichnerowicz, Elements of Tensor Calculus, Translated from the French by J. W. Leech and D. J. Newman, Methuen (London) 1962.
  • W. H. Greub, Linear Algebra, 2nd edition, Springer (Berlin) 1963.