Abstract vector spaces

The vectors in Rⁿ compose a vector space, because they can be added to each other and multiplied by scalars according to certain rules. (In fact, Rⁿ is a real vector space, because the scalars are real numbers. Later we shall treat complex vector spaces.)

In general, a vector space is any structure that behaves like Rⁿ.

Formal definition

A scalar is a real number. A (real) vector space consists of five things:

A set of objects, called vectors;
An operation assigning, to each ordered pair (x, y) of vectors, a vector x + y, called their sum;
For each scalar r, an operation assigning to each vector x a vector rx, called the scalar product of x by r ;
An operation assigning to each vector x a vector -x, called its negative;
A distinguished vector 0, called the zero-vector.

Moreover, the assignments of sums and products must satisfy the following rules (where bold-face letters are vectors, and italic letters are scalars, that is, real numbers):

Addition rules

x + y = y + x ;
(x + y) + z = x + (y + z) ;
x + 0 = x ;
x + -x = 0.

Scalar-multiplication rules

(r + s)x = rx + sx ;
r(x + y) = rx + ry ;
r(sx) = (rs)x ;

`Rule of unity'

1x = x .

This completes the definition of a real vector space. I have distinguished the `rule of unity' (my term) to emphasize that it does not follow from the other rules.

Consequences of the definition

You can prove the following from the vector-space rules:

0x = 0 .
r0 = 0 .
-1x = -x .
If rx = 0 and r is not zero, then x = 0.

Examples

Every vector space has at least one vector, namely the zero-vector. Possibly this is the only vector in the space, in which case the space is trivial.

The set R of real numbers, with its operations of addition and multiplication, is a vector space.

With the usual operations of addition and scalar multiplication, the m×n matrices compose a vector space, which we could call R^m×n.

If n is a nonnegative integer, then a polynomial of degree n is a sum

a₀ + a₁x + a₂x² ... + a_nxⁿ ,

where x is a variable, the a_i are scalars, and a_n is not zero. In particular, a scalar a is a polynomial of degree 0, if a is not zero; if a = 0, then a is the zero-polynomial, whose degree is less than zero. We can observe the following:

The polynomials of degree n do not compose a vector space;
the polynomials of degree less than n do compose a vector space (which is the trivial space if n = 0);
there is a vector space comprising all polynomials.

The set of polynomials of degree at most n can be denoted P_n.

The set of positive real numbers is a vector space, provided that addition of vectors is understood to be multiplication, and scalar-multiplication by a real number r is understood to be exponentiation by r.

Suppose A is an n×n matrix such that A² = A. We can define the product of a vector x in Rⁿ by a scalar r to be rAx. With this new product, and the usual addition, Rⁿ satisfies all rules for vector spaces, except the `rule of unity' (unless A is the identity).

Subspaces

Suppose V is a vector space, and W is a nonempty subset of V such that:

x + y is in W whenever x and y are in W;
rx is in W whenever r is a scalar and x is in W.

Then W is by definition a subspace of V. In particular, W is a vector space. (Note that 0 is in W.)

Any vector space has the following two subspaces: itself, and the trivial subspace {0}. (Possibly these are identical.)

If A is an m×n matrix, then the set of solutions x to the homogeneous linear system

Ax = 0

is a subspace of Rⁿ, called the nullspace of A.

Linear combinations and spanning sets

Suppose v₁, v₂,..., v_n are vectors in a vector space V. A linear combination of these vectors is a vector of the form

x₁v₁ + x₂v₂ + ... + x_nv_n ,

where the coefficients x_i are scalars. Note that any such vector is in V. If all of the coefficients x_i are zero, then the linear combination can be called trivial. The set of all linear combinations of the vectors v_i is a subspace W of V called the span of the v_i and denoted

span{v₁, v₂,..., v_n} .

We also say that W is spanned by the vectors v_i, and that the set {v₁, v₂,..., v_n} is a spanning set for W.

Spanning sets, in general, are not unique, and may be redundant. For example, Rⁿ is spanned by the set

{e₁, e₂, ..., e_n},

where e_i has 1 in row i and 0 everywhere else. But if v is any other vector in Rⁿ, then Rⁿ is also spanned by the set {e₁, e₂, ..., e_n, v}.

To check whether a vector b in R^m is a linear combination of vectors a₁, a₂,..., a_n, write the vectors a_i as the columns of an m×n matrix A, and set up the equation

Ax = b .

The product Ax is a linear combination of the columns of A, so the equation is consistent if and only if b is a linear combination of the columns of A.

Linearly independent sets

A set {v₁, v₂,..., v_r} of vectors from a vector space is called linearly independent if one of the following equivalent conditions holds:

None of the vectors v_i is a linear combination of the others;
The zero vector is not a nontrivial linear combination of the vectors v_i; in other words, if
x₁v₁ + x₂v₂ + ... + x_rv_r = 0 ,
then each of the coefficients x_i is zero.

A linearly independent set has no redundancy, in this sense: if a linearly independent set spans a certain vector space, and if you take a vector away from the set, then the remaining vectors no longer span the space. In particular, a linearly independent set cannot contain the zero-vector.

To check for linear independence in a set of n vectors in R^m, write the vectors as the columns of an m×n matrix. The nullspace of the matrix is the trivial vector space if and only if the columns are linearly independent.

Note a consequence of the last fact: No set of n vectors in R^m is linearly independent, if m<n. Such sets may or may not be independent, if m is not less than n.

Suppose A is an m×n matrix, and B is an n×r matrix. Then the columns of the product AB are linear combinations of the columns of A. Knowing whether the columns of one of these are independent tells you nothing about whether the columns of the other are independent.

Bases

[The word bases is the plural of basis, which is originally Greek.] Linear independence is an intrinsic property of a subset of a vector space. If a linearly independent set also spans the vector space it is in, then it is called a basis of that space.

Any linearly independent set is the basis of something, namely its span.

The standard basis of Rⁿ is the set {e₁, e₂, ..., e_n} (where e_i is as defined above; it is also column i of the n×n identity matrix). An arbitrary subset of Rⁿ is a basis (of Rⁿ) if and only if it has n elements, which are the columns of an invertible matrix.

Suppose B is a finite basis {v₁, v₂, ..., v_n} of a vector space V. Then for any vector v in V, there is a unique vector x in Rⁿ such that

v = x₁v₁ + x₂v₂ + ... x_nv_n ;

the vector x is called the B-coordinate vector of v, and denoted by

(v)_B .

The function from V to Rⁿ that takes a vector to its B-coordinate vector is a linear transformation. One can use it to show that every basis of V has size n; this justifies calling n the dimension of V.

next section