Molecular orbital theory

In chemistry, molecular orbital theory is the theory that deals with the definition and computation of molecular orbitals (MOs). The branch of chemistry that studies MO theory is called quantum chemistry.

Molecular orbitals are wave functions describing the quantum mechanical "motion"^[1] of one electron in the Coulomb (also known as electrostatic) field of all the nuclei of a molecule. In MO theory the electrostatic field due to the positive nuclei is screened (i.e., weakened) by an average electrostatic field due to the negative electrons.

The absolute square |φ|² (a one-electron density) of an MO φ is usually delocalized, that is, spread out over the whole molecule, hence the adjective "molecular" in the name. This is in contrast to an atomic orbital (AO), which gives rise to a one-electron density localized in the vicinity of a single atom.

The purpose of molecular orbital theory is to obtain approximate solutions of the time-independent Schrödinger equations of molecules. The Schrödinger equation contains a Hamilton operator (quantum mechanical energy operator) from which in general several terms of lesser importance are omitted, see molecular Hamiltonian for the details. The molecular orbitals resulting from the solutions of the Schrödinger equation form the key to all kinds of molecular properties of chemical interest.

In the great majority of MO theories an MO is expanded in a basis χ_i of AOs, centered on the different nuclei of the molecule. Let there be N_nuc nuclei in the molecule, let A run over the nuclei and let there be n_A AOs on the A-th nucleus, then the MO φ of electron 1 has the following LCAO (linear combination of atomic orbitals) form,

\phi (\mathbf {r} _{1})=\sum _{A=1}^{N_{\mathrm {nuc} }}\sum _{i=1}^{n_{A}}c_{Ai}\chi _{i}(\mathbf {r} _{A1}),\qquad c_{Ai}\in \mathbb {C} ,\qquad \qquad \qquad \qquad (1)

here $\scriptstyle \mathbf {r} _{A1}$ is the coordinate vector of electron 1 with respect to a Cartesian coordinate system with nucleus A as origin. Note in this context that nuclei are seen as point charges that are fixed in space.

Molecular orbital theory deals with the choice of the AOs χ_i and the derivation and solution of the equations for the computation of the expansion coefficients c_Ai. In MO theory the AOs are explicitly known functions (usually algebraic—as opposed to numerical—functions, see this article), and once the expansion coefficients have been determined, the molecular orbitals are known unambiguously and can be used to compute observable molecular properties.

Types of MO theory

A crucial part of MO theory is concerned with the evaluation of molecular integrals. These are 3- and 6-fold integrals over all space that contain as integrands products of AOs, centered on different nuclei, and operators arising from the molecular Hamiltonian. Before 1970, when computers were still in their infancy, the computation of these integrals formed a major hurdle. This is why approximations had to be introduced. Many of these approximations are based on the fact that certain groups of integrals can be seen to represent some empirical (experimentally observable) quantity, usually ionization potentials, electron affinities, or state energy differences. Replacement of groups of integrals by their empirical counterpart (rather than calculation) leads to methods known as semi-empirical MO methods.

When computers and quantum chemical software developed from the 1970s onward, the computation of all necessary molecular integrals became possible. Methods in which all integrals are computed are known as ab initio MO methods. The Latin phrase ab initio stands for from the beginning and implies that no empirical data enter the computation.^[2]

An important error in many MO calculations is the neglect of the electronic correlation. By the averaging inherent to Hartree-Fock MO theory, the correlation between the electronic motions is lost. If electron 1 is at point P, the chance that electron 2 will be near P is smaller than that it will be far away from P due to electrostatic repulsion beween the electrons (which falls off with the inverse interelectronic distance). Neither ab initio nor semi-empirical MO theory account for correlation. However, there is a third variant of MO theory, density functional theory (DFT) that—at least in principle—accounts of the electronic correlation. This method requires a choice of AOs, as do the other MO methods, but also knowledge of the density functional. Since the exact density functional is not known, many different approximations have been proposed, and in that sense DFT is reminiscent of semi-empirical theory with its freedom of choice in empirical parameters.

Hartree-Fock MO-LCAO theory

History

In the late 1920s and early 1930s the Briton Hartree and the Russian physicist Fock developed independently an effective one-electron method for the solution of many-electron problems. The method, now known as the Hartree-Fock method (HF), is an iterative method. During the the iteration the electron-electron interaction is averaged. Usually (but not always) this is a convergent process. The averaged, converged, electrostatic field due to the electrons is said to be self-consistent. In quantum chemistry the terms Hartree-Fock (HF) and self-consistent-field (SCF) are practically synonyms. Especially Hartree, (often in collaboration with his retired, non-physicist father), performed many calculations on atoms. Because of their spherical symmetry atoms are relatively simple: their radial and angular coordinates decouple. The HF equations used then were in operator form—not in matrix form.

Simultaneously, LCAO-MO theory was developed by Erich Hückel, Friedrich Hund, Robert S. Mulliken, John Lennard-Jones, and others. Because of computational difficulties this theory was qualitative or at most semi-quantative, meaning that so many approximations were introduced that the resulting equations were amenable to hand calculations. Symmetry played an important role. By symmetry arguments one can predict when MO coefficients will vanish, or whether some coefficients will be numerically equal.

These two threads were joined in 1951 by the Dutch-American physicist Clemens C.J. Roothaan^[3], who wrote the HF equations in a basis of atomic orbitals, thus obtaining matrix equations. It is no coincidence that this happened when electronic computers were on the horizon. It is also noteworthy that Roothaan was at the time working in the University of Chicago Laboratory directed by Robert Mulliken, the great advocate of MO theory.

Equations

Before sketching their derivation, we present and discuss the Hartree-Fock equations in the form given by Roothaan. Solution of these equations yield the expansion coefficients of Eq. (1). Since in general more than one MO is obtained we enter an extra label to this equation,

\phi _{j}(\mathbf {r} _{1})=\sum _{A=1}^{N_{\mathrm {nuc} }}\sum _{i=1}^{n_{A}}\chi _{Ai}(\mathbf {r} _{1})\,c_{Ai,j},\qquad j=1,\ldots ,N_{\mathrm {el} }/2,

where we assume that each orbital is doubly occupied (once with spin α, once with β) so that the number of occupied MOs is half the number N_el of electrons. The coefficients with fixed j form a column vector in which the rows are labeled by (Ai). Let (Ai) run from 1 to n, that is, denote the total number of AOs by n. We also made a small change in notation by appending the atom label (A) to χ, instead of to the electron coordinate. Both notations can be found in the literature.

The Roothaan equations have the form of a matrix eigenvalue equation, which is not surprising, as the corresponding HF operator equation is also an eigenvalue equation. The matrix to be diagonalized (known as the Fock matrix F ) depends on its eigenvectors. Here enters the same self-consistency procedure as in the original HF problem. First one must guess a set of eigenvectors before the Fock matrix can be constructed. Once the matrix has been constructed, with the guessed eigenvectors as input, it can be diagonalized, i.e., its eigenvectors and eigenvalues can be determined. The computed eigenvectors enter a new iteration cycle in which the Fock matrix is constructed and diagonalized, yielding eigenvectors which (hopefully) are closer to the final result. This procedure is continued until the eigenvectors that enter the Fock matrix are essentially the same as the eigenvectors that emerge from the diagonalization of the Fock matrix. The process is then self-consistent.

It must be remarked that the atomic orbitals, localized on different atoms, are non-orthogonal (but linearly independent), while the MOs are constrained to be orthogonal. The non-orthogonality of the basis turns the operator eigenvalue equation into a generalized matrix eigenvalue equation. That is, the matrix equation obtains the form

\mathbf {F} \mathbf {C} =\mathbf {S} \mathbf {C} {\boldsymbol {\mathrm {E} }}

The Fock matrix F is n × n, the eigenvector matrix C contains N_el/2 columns of length n, the overlap matrix S is n × n and the matrix E is diagonal with the N_el/2 orbital energies on the diagonal. If the AOs were orthonormal, S would be the identity matrix. S contains overlap integrals, which are of the form

S_{Ai,Bk}=\langle \chi _{Ai}|\chi _{Bk}\rangle .

Since $\scriptstyle \chi _{Ai}$ is expressed with respect to a system of axes on atom A, and $\scriptstyle \chi _{Bk}$ is centered on atom B, this bra-ket is a two-center integral, an integral over $\scriptstyle \mathbb {R} ^{3}$ in which the integrand is expressed with respect to two non-coinciding Cartesian systems of axes.

As is known from linear algebra the matrix S is non-singular (invertible) if and only if the n atomic orbitals are linearly independent. The matrix F is Hermitian (usually it is real, in which case the matrix is called symmetric). It is also known from linear algebra that the generalized eigenvalue problem with symmetric F and non-singular S can be solved, i.e., that the columns of C and the corresponding eigenvalues (diagonal elements of E) can be determined. We reiterate: since the AOs χ_Ai are known, after C has been computed the LCAO molecular orbitals φ_j have been determined.

Derivation

With regard to the derivation of the Roothaan equations the following. The derivation applies the variational principle which is based on the fact that the expectation value of the molecular Hamiltonian has a lower bound. An expectation value contains by definition the same wave function in bra and ket. For this function we take a "trial" function containing free parameters that can be varied to minimize the expectation value, and by the boundedness of the Hamiltonian the minimum exists. It is an (unproven) assumption of the variational method that the trial function that minimizes the expectation value gives the best approximation of the exact wave function.

Roothaan took, following V. Fock, a trial function in the form of a single Slater determinant. Basically this is an antisymmetrized N_el-electron product of spin MOs. In the Slater determinant each spatial MO φ appears twice, once multiplied by the spin function α and once by β. The MOs are expanded as above, and the expansion coefficients serve as variation parameters. When the same Slater determinant is inserted into bra and ket, the expectation value becomes a highly non-linear function of variation parameters. (In fact a Slater determinant is an N_el!-order multinomial in the coefficients and the expectation value is the square of this order). In order to proceed it is necessary to simplify this very complicated expression. At this point one introduces the constraint that the MOs are orthonormal,

\langle \phi _{j}|\phi _{k}\rangle =\delta _{jk}.

This constraint, for which there is no physical justification, simplifies the derivation considerably. In the 1930s, rules were derived by Slater and Condon & Shortley that allowed the simplification of the expectation value (the so-called Slater-Condon rules). These rules give the reduction of the expectation value to a form that is quartic (fourth power) in the coefficients.

The trick then applied by Roothaan to arrive finally at the matrix eigenvalue equation, was the introduction of the charge- and bond-order matrix (sometimes referred to as density matrix),

\mathbf {P} \equiv \mathbf {C} \,\mathbf {C} ^{T},

which is of dimension n × n. Taking this matrix as given and ignoring the fact that it contains the expansion coefficients, the expectation value becomes quadratic in the coefficients, just as in the linear variation method (also known as Rayleigh-Ritz method). Minimization then yields a matrix eigenvalue equation in the usual manner, with however, a Fock matrix that depends on C through P. In the first iteration P is computed from a guessed matrix C, in following iterations the matrix C is used from the previous iterative step.

(To be continued)

Note

↑ The quotes are here to remind us that the word motion relates to a stationary, time-independent wave function. In classical mechanics the word motion relates to a time-dependent trajectory. Since quantum mechanics accounts for the (non-zero) kinetic energy of particles the word motion is applicable, but with some care.
↑ This name is somewhat pretentious, since the nuclear geometry, (that is, the positions of the nuclei constituting the molecule in space), is often taken from experiment. Further, the choice of AO basis is an art in which experimental data form an important guideline.
↑ C. C.J. Roothaan, New Developments in Molecular Orbital Theory, Review Modern Physics, vol. 23, p. 69 (1951)

[1] The quotes are here to remind us that the word motion relates to a stationary, time-independent wave function. In classical mechanics the word motion relates to a time-dependent trajectory. Since quantum mechanics accounts for the (non-zero) kinetic energy of particles the word motion is applicable, but with some care.

[2] This name is somewhat pretentious, since the nuclear geometry, (that is, the positions of the nuclei constituting the molecule in space), is often taken from experiment. Further, the choice of AO basis is an art in which experimental data form an important guideline.

[3] C. C.J. Roothaan, New Developments in Molecular Orbital Theory, Review Modern Physics, vol. 23, p. 69 (1951)

[1]

[2]

[3]

Molecular orbital theory

Contents

Types of MO theory