Rotation matrix: Difference between revisions

Revision as of 06:51, 14 May 2009

In mathematics and physics a rotation matrix is synonymous with a 3×3 orthogonal matrix, which is a matrix R satisfying

\mathbf {R} ^{\mathrm {T} }=\mathbf {R} ^{-1},

where T stands for the transposed matrix and R⁻¹ is the inverse of R.

Connection of orthogonal matrix to rotation

In general a motion of a rigid body (which is equivalent to an angle and distance preserving transformation of affine space) can be described as a translation of the body followed by a rotation. By a translation all points of the rigid body are displaced, while under a rotation at least one point stays in place. Let the the fixed point be O. By Euler's theorem follows that then not only the point is fixed but also an axis—the rotation axis— through the fixed point. Write ${\hat {n}}$ for the unit vector along the rotation axis and φ for the angle over which the body is rotated, then the rotation is written as ${\mathcal {R}}(\varphi ,{\hat {n}}).$

Erect three Cartesian coordinate axes with the origin in the fixed point O and take unit vectors ${\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}$ along the axes, then the rotation matrix $\mathbf {R} (\varphi ,{\hat {n}})$ is defined by its elements $R_{ji}(\varphi ,{\hat {n}})$ :

{\mathcal {R}}(\varphi ,{\hat {n}})({\hat {e}}_{i})=\sum _{j=x,y,x}{\hat {e}}_{j}R_{ji}(\varphi ,{\hat {n}})\quad {\hbox{for}}\quad i=x,y,z.

In a more condensed notation this equation can be written as

{\mathcal {R}}(\varphi ,{\hat {n}})\left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right)=\left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right)\;\mathbf {R} (\varphi ,{\hat {n}}).

Given a basis of a linear space, the association between a linear map and its matrix is one-to-one.

Since a rotation leaves angles and distances invariant, for any pair of vectors ${\vec {a}}$ and ${\vec {b}}$ in $\mathbb {R} ^{3}$ the inner product is invariant,

\left({\mathcal {R}}({\vec {a}}),\;{\mathcal {R}}({\vec {b}})\right)=\left({\vec {a}},\;{\vec {b}}\right).

A linear map with this property is called orthogonal. It is easily shown that a similar vector-matrix relation holds. First we define

{\vec {a}}=\left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right){\begin{pmatrix}a_{x}\\a_{y}\\a_{z}\end{pmatrix}}\equiv \left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right)\mathbf {a} \quad {\hbox{and}}\quad {\vec {b}}=\left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right){\begin{pmatrix}b_{x}\\b_{y}\\b_{z}\end{pmatrix}}\equiv \left({\hat {e}}_{x},\;{\hat {e}}_{y},\;{\hat {e}}_{z}\right)\mathbf {b}

and observe that the inner product becomes by virtue of the orthonormality of the basis vectors

\left({\vec {a}},\;{\vec {b}}\right)=\mathbf {a} ^{\mathrm {T} }\mathbf {b} \equiv \left(a_{x},\;a_{y},\;a_{z}\right){\begin{pmatrix}b_{x}\\b_{y}\\b_{z}\end{pmatrix}}\equiv a_{x}b_{x}+a_{y}b_{y}+a_{z}b_{z}.

The invariance of the inner product under ${\mathcal {R}}$ leads to

{\big (}\mathbf {R} \mathbf {a} {\big )}^{\mathrm {T} }\;\mathbf {R} \mathbf {b} =\mathbf {a} ^{\mathrm {T} }\mathbf {R} ^{\mathrm {T} }\;\mathbf {R} \mathbf {b}

since this holds for any pair a and b it follows that a rotation matrix satisfies

\mathbf {R} ^{\mathrm {T} }\mathbf {R} =\mathbf {E}

where E is the 3×3 identity matrix. For finite-dimensional matrices one shows easily

\mathbf {R} ^{\mathrm {T} }\mathbf {R} =\mathbf {E} \quad \Longleftrightarrow \quad \mathbf {R} \mathbf {R} ^{\mathrm {T} }=\mathbf {E} .

A matrix with this property is called orthogonal. So, a rotation gives rise to a unique orthogonal matrix.

Conversely, consider a point P in the body and let the vector ${\overrightarrow {OP}}$ connect O with P. Express this vector with respect to a Cartesian frame in O, giving the column vector p (three stacked real numbers). Multiply p by the orthogonal matrix R, then Rp represents the rotated point P′ (the vector ${\overrightarrow {OP'}}$ is expressed with respect to the same Cartesian frame). If we map all points P of the body by the same matrix R in this manner, we have rotated the body. Thus, an orthogonal matrix leads to a unique rotation.

Properties of orthogonal matrix

Writing out matrix products it follows that both the rows and the columns of the matrix are orthonormal (normalized and orthogonal). Indeed,

{\begin{aligned}\mathbf {R} ^{\mathrm {T} }\mathbf {R} &=\mathbf {E} \quad \Longrightarrow \quad \sum _{k=1}^{3}R_{ki}\,R_{kj}=\delta _{ij}\quad {\hbox{(columns)}}\\\mathbf {R} \mathbf {R} ^{\mathrm {T} }&=\mathbf {E} \quad \Longrightarrow \quad \sum _{k=1}^{3}R_{ik}\,R_{jk}=\delta _{ij}\quad {\hbox{(rows)}}\\\end{aligned}}

where δ_ij is the Kronecker delta.

Orthogonal matrices come in two flavors: proper (det = 1) and improper (det = −1) rotations. Indeed, invoking some properties of determinants, one can prove

1=\det(\mathbf {E} )=\det(\mathbf {R} ^{\mathrm {T} }\mathbf {R} )=\det(\mathbf {R} ^{\mathrm {T} })\det(\mathbf {R} )=\det(\mathbf {R} )^{2}\quad \Longrightarrow \quad \det(\mathbf {R} )=\pm 1.

Compact notation

A compact way of presenting the same results is the following. Designate the columns of R by r₁, r₂, r₃, i.e.,

\mathbf {R} =\left(\mathbf {r} _{1},\,\mathbf {r} _{2},\,\mathbf {r} _{3}\right)

.

The matrix R is orthogonal if

\mathbf {r} _{i}\cdot \mathbf {r} _{j}=\delta _{ij},\quad i,j=1,2,3.

The matrix R is a proper rotation matrix, if it is orthogonal and if r₁, r₂, r₃ form a right-handed set, i.e.,

\mathbf {r} _{i}\times \mathbf {r} _{j}=\sum _{k=1}^{3}\,\varepsilon _{ijk}\mathbf {r} _{k}.

Here the symbol × indicates a cross product and $\varepsilon _{ijk}$ is the antisymmetric Levi-Civita symbol,

{\begin{aligned}\varepsilon _{123}=&\;\varepsilon _{312}=\varepsilon _{231}=1\\\varepsilon _{213}=&\;\varepsilon _{321}=\varepsilon _{132}=-1\end{aligned}}

and $\varepsilon _{ijk}=0$ if two or more indices are equal.

The matrix R is an improper rotation matrix if its column vectors form a left-handed set, i.e.,

\mathbf {r} _{i}\times \mathbf {r} _{j}=-\sum _{k=1}^{3}\,\varepsilon _{ijk}\mathbf {r} _{k}\;.

The last two equations can be condensed into one equation

\mathbf {r} _{i}\times \mathbf {r} _{j}=\det(\mathbf {R} )\sum _{k=1}^{3}\;\varepsilon _{ijk}\mathbf {r} _{k}

by virtue of the the fact that the determinant of a proper rotation matrix is 1 and of an improper rotation −1. This was proved above, an alternative proof is the following: The determinant of a 3×3 matrix with column vectors a, b, and c can be written as scalar triple product

\det \left(\mathbf {a} ,\,\mathbf {b} ,\,\mathbf {c} \right)=\mathbf {a} \cdot (\mathbf {b} \times \mathbf {c} )

.

It was just shown that for a proper rotation the columns of R are orthonormal and satisfy,

\mathbf {r} _{1}\cdot (\mathbf {r} _{2}\times \mathbf {r} _{3})=\mathbf {r} _{1}\cdot \left(\sum _{k=1}^{3}\,\varepsilon _{23k}\,\mathbf {r} _{k}\right)=\varepsilon _{231}=1.

Likewise the determinant is −1 for an improper rotation.

Explicit expression

Let ${\overrightarrow {OP}}\equiv {\vec {r}}$ be a vector pointing from the fixed point O of a rotating rigid body to an arbitrary point P of the body. A rotation of this arbitrary vector around the unit vector ${\hat {n}}$ over an angle φ can be written as

{\mathcal {R}}(\varphi ,{\hat {n}})({\vec {r}}\,)={\vec {r}}\,'=\left[{\vec {r}}-({\hat {n}}\cdot {\vec {r}}\,)\;{\hat {n}}\right]\cos \varphi +({\hat {n}}\times {\vec {r}}\,)\sin \varphi .

where • indicates an inner product and the symbol × a cross product.

Rotation of vector

\scriptstyle {\vec {r}}

around axis

\scriptstyle {\hat {n}}

over an angle φ. The red vectors are in the plane of drawing spanned by

\scriptstyle {\vec {r}}

and

\scriptstyle {\hat {n}}

. The blue vectors are rotated, the green cross product points away from the reader and is perpendicular to the plane of drawing.

@@ Line 1: / Line 1: @@
-A '''rotation''' of a 3-dimensional rigid body is a motion of the body that leaves one point, ''O'', fixed. By [[Euler's theorem (rotation)|Euler's theorem]] follows that then not only the point is fixed but also an axis&mdash;the ''rotation axis''&mdash; through the fixed point. Write <math>\hat{n}</math> for the unit vector along the rotation axis and &phi; for the angle over which the body is rotated, then the rotation is written as <math> \mathcal{R}(\varphi, \hat{n}). </math>
+In [[mathematics]] and [[physics]] a '''rotation matrix''' is synonymous with a 3&times;3 [[orthogonal matrix]], which is a matrix   '''R''' satisfying
+:<math>
+\mathbf{R}^\mathrm{T} = \mathbf{R}^{-1},
+</math>
+where T stands for the [[transposed matrix]] and '''R'''<sup>&minus;1</sup> is the [[inverse matrix| inverse]] of '''R'''.
+==Connection of orthogonal matrix to rotation==
+In general a motion  of a rigid body (which is equivalent to an angle and distance preserving transformation of [[affine space]]) can be described as a translation of the body followed by a rotation. By a translation ''all'' points of the rigid body are displaced, while under a rotation at least one point stays in place. Let the the fixed point be ''O''.  By [[Euler's theorem (rotation)|Euler's theorem]] follows that then not only the point is fixed but also an axis&mdash;the ''rotation axis''&mdash; through the fixed point. Write <math>\hat{n}</math> for the unit vector along the rotation axis and &phi; for the angle over which the body is rotated, then the rotation is written as <math> \mathcal{R}(\varphi, \hat{n}). </math>
+Erect  three [[Cartesian coordinates|Cartesian coordinate]] axes with the origin in the fixed point ''O'' and take unit vectors <math>\hat{e}_x,\;\hat{e}_y,\;\hat{e}_z</math> along the axes, then the ''rotation matrix'' <math>\mathbf{R}(\varphi, \hat{n})</math> is defined by its elements
+<math>R_{ji}(\varphi, \hat{n})</math> :
-Erect  three [[Cartesian coordinates|Cartesian coordinate]] axes with the origin in the fixed point ''O'' and take unit vectors <math>\hat{e}_x,\;\hat{e}_y,\;\hat{e}_z</math> along the axes, then the '''rotation matrix''' <math>\mathbf{R}(\varphi, \hat{n})</math> is defined by its elements
-<math>R_{ji}(\varphi, \hat{n})</math>:
 :<math>
 \mathcal{R}(\varphi, \hat{n})(\hat{e}_i) = \sum_{j=x,y,x} \hat{e}_j R_{ji}(\varphi, \hat{n})
 \quad\hbox{for}\quad i=x,y,z.
 </math>
-In a more condensed notation this equation is written as
+In a more condensed notation this equation can be written as
 :<math>
 \mathcal{R}(\varphi, \hat{n})\left(\hat{e}_x,\;\hat{e}_y,\;\hat{e}_z\right) =
@@ Line 13: / Line 21: @@
 </math>
 Given a basis of a linear space, the association between a linear map and its matrix is one-to-one.
-==Properties of matrix==
-Since rotation conserves the shape of a rigid body, it leaves angles and distances invariant. In other words, for any pair of vectors
+Since a  rotation leaves angles and distances invariant, for any pair of vectors
 <math>\vec{a}</math> and <math>\vec{b}</math> in <math>\mathbb{R}^3</math> the [[inner product]] is invariant,
 :<math>
 \left(\mathcal{R}(\vec{a}),\;\mathcal{R}(\vec{b}) \right) = \left(\vec{a},\;\vec{b}\right).
 </math>
-A linear map with this property is called ''orthogonal''.  It is easily shown that a similar  vector/matrix relation holds. First we define
+A linear map with this property is called ''orthogonal''.  It is easily shown that a similar  vector-matrix relation holds. First we define
 :<math>
 \vec{a} =\left(\hat{e}_x,\;\hat{e}_y,\;\hat{e}_z\right)\begin{pmatrix}a_x\\a_y\\a_z\end{pmatrix}
@@ Line 47: / Line 55: @@
 \mathbf{R}^\mathrm{T} \mathbf{R}  = \mathbf{E} \quad \Longleftrightarrow\quad\mathbf{R}\mathbf{R}^\mathrm{T}   = \mathbf{E}.
 </math>
-A matrix with this property is also called ''orthogonal''. Writing out the two matrix products it follows that both the rows and the columns of the matrix are orthonormal (normalized and orthogonal). Indeed,
+A matrix with this property is called ''orthogonal''. So, a rotation gives rise to a unique orthogonal matrix.
+Conversely, consider a point ''P'' in the body and let the vector <font style="vertical-align: text-bottom"> <math>\overrightarrow{OP}</math></font> connect ''O''  with ''P''.  Express this vector with respect to a Cartesian frame in ''O'', giving the column vector '''p''' (three stacked real numbers). Multiply '''p''' by  the orthogonal matrix '''R''', then '''R'''<b>p</b> represents the rotated point ''P''&prime;  (the vector <font style="vertical-align: text-bottom"> <math>\overrightarrow{OP'}</math></font> is expressed with respect to the same Cartesian frame).  If we map all points ''P'' of the body by the same matrix  '''R''' in this manner, we have rotated the body. Thus, an orthogonal matrix leads to a unique rotation.
+==Properties of orthogonal matrix==
+Writing out  matrix products it follows that both the rows and the columns of the matrix are orthonormal (normalized and orthogonal). Indeed,
 :<math>
 \begin{align}
@@ Line 58: / Line 70: @@
 where &delta;<sub>ij</sub> is the [[Kronecker delta]].
-Orthogonal matrices come in two flavors: proper (det = 1) and improper (det = &minus;1) rotations.
+Orthogonal matrices come in two flavors: ''proper'' (det = 1) and ''improper'' (det = &minus;1) rotations. Indeed, invoking some properties of determinants, one can prove
-Invoking some properties of determinants, one can prove
 :<math>
 =\det(\mathbf{E})=\det(\mathbf{R}^\mathrm{T}\mathbf{R}) = \det(\mathbf{R}^\mathrm{T})\det(\mathbf{R})

Rotation matrix: Difference between revisions

Revision as of 06:51, 14 May 2009

Contents

Connection of orthogonal matrix to rotation

Properties of orthogonal matrix

Compact notation

Explicit expression

Navigation menu

Rotation matrix: Difference between revisions

Revision as of 06:51, 14 May 2009

Connection of orthogonal matrix to rotation

Properties of orthogonal matrix

Compact notation

Explicit expression

Navigation menu

Search