Linear combination

In mathematics, a linear combination or superposition is an expression constructed from a set of terms by multiplying each term by a constant and adding the results (e.g. a linear combination of x and y would be any expression of the form ax + by, where a and b are constants).^[1]^[2]^[3]^[4] The concept of linear combinations is central to linear algebra and related fields of mathematics. Most of this article deals with linear combinations in the context of a vector space over a field, with some generalizations given at the end of the article.

Definition

Let V be a vector space over the field K. As usual, we call elements of V vectors and call elements of K scalars. If v₁,...,v_n are vectors and a₁,...,a_n are scalars, then the linear combination of those vectors with those scalars as coefficients is

a_{1}\mathbf {v} _{1}+a_{2}\mathbf {v} _{2}+a_{3}\mathbf {v} _{3}+\cdots +a_{n}\mathbf {v} _{n}.

There is some ambiguity in the use of the term "linear combination" as to whether it refers to the expression or to its value. In most cases the value is emphasized, as in the assertion "the set of all linear combinations of v₁,...,v_n always forms a subspace". However, one could also say "two different linear combinations can have the same value" in which case the reference is to the expression. The subtle difference between these uses is the essence of the notion of linear dependence: a family F of vectors is linearly independent precisely if any linear combination of the vectors in F (as value) is uniquely so (as expression). In any case, even when viewed as expressions, all that matters about a linear combination is the coefficient of each v_i; trivial modifications such as permuting the terms or adding terms with zero coefficient do not produce distinct linear combinations.

In a given situation, K and V may be specified explicitly, or they may be obvious from context. In that case, we often speak of a linear combination of the vectors v₁,...,v_n, with the coefficients unspecified (except that they must belong to K). Or, if S is a subset of V, we may speak of a linear combination of vectors in S, where both the coefficients and the vectors are unspecified, except that the vectors must belong to the set S (and the coefficients must belong to K). Finally, we may speak simply of a linear combination, where nothing is specified (except that the vectors must belong to V and the coefficients must belong to K); in this case one is probably referring to the expression, since every vector in V is certainly the value of some linear combination.

Note that by definition, a linear combination involves only finitely many vectors (except as described in the § Generalizations section. However, the set S that the vectors are taken from (if one is mentioned) can still be infinite; each individual linear combination will only involve finitely many vectors. Also, there is no reason that n cannot be zero; in that case, we declare by convention that the result of the linear combination is the zero vector in V.

Examples and counterexamples

Euclidean vectors

Let the field K be the set R of real numbers, and let the vector space V be the Euclidean space R³. Consider the vectors e₁ = (1,0,0), e₂ = (0,1,0) and e₃ = (0,0,1). Then any vector in R³ is a linear combination of e₁, e₂, and e₃.

To see that this is so, take an arbitrary vector (a₁,a₂,a₃) in R³, and write:

{\begin{aligned}(a_{1},a_{2},a_{3})&=(a_{1},0,0)+(0,a_{2},0)+(0,0,a_{3})\\[6pt]&=a_{1}(1,0,0)+a_{2}(0,1,0)+a_{3}(0,0,1)\\[6pt]&=a_{1}\mathbf {e} _{1}+a_{2}\mathbf {e} _{2}+a_{3}\mathbf {e} _{3}.\end{aligned}}

Functions

Let K be the set C of all complex numbers, and let V be the set C_C(R) of all continuous functions from the real line R to the complex plane C. Consider the vectors (functions) f and g defined by f(t) := e^it and g(t) := e^−it. (Here, e is the base of the natural logarithm, about 2.71828..., and i is the imaginary unit, a square root of −1.) Some linear combinations of f and g are:

$\cos t={\tfrac {1}{2}}\,e^{it}+{\tfrac {1}{2}}\,e^{-it}$
$2\sin t=(-i)e^{it}+(i)e^{-it}.$

On the other hand, the constant function 3 is not a linear combination of f and g. To see this, suppose that 3 could be written as a linear combination of e^it and e^−it. This means that there would exist complex scalars a and b such that ae^it + be^−it = 3 for all real numbers t. Setting t = 0 and t = π gives the equations a + b = 3 and a + b = −3, and clearly this cannot happen. See Euler's identity.

Polynomials

Let K be R, C, or any field, and let V be the set P of all polynomials with coefficients taken from the field K. Consider the vectors (polynomials) p₁ := 1, p₂ := x + 1, and p₃ := x² + x + 1.

Is the polynomial x² − 1 a linear combination of p₁, p₂, and p₃? To find out, consider an arbitrary linear combination of these vectors and try to see when it equals the desired vector x² − 1. Picking arbitrary coefficients a₁, a₂, and a₃, we want

a_{1}(1)+a_{2}(x+1)+a_{3}(x^{2}+x+1)=x^{2}-1.

Multiplying the polynomials out, this means

(a_{1})+(a_{2}x+a_{2})+(a_{3}x^{2}+a_{3}x+a_{3})=x^{2}-1

and collecting like powers of x, we get

a_{3}x^{2}+(a_{2}+a_{3})x+(a_{1}+a_{2}+a_{3})=1x^{2}+0x+(-1).

Two polynomials are equal if and only if their corresponding coefficients are equal, so we can conclude

a_{3}=1,\quad a_{2}+a_{3}=0,\quad a_{1}+a_{2}+a_{3}=-1.

This system of linear equations can easily be solved. First, the first equation simply says that a₃ is 1. Knowing that, we can solve the second equation for a₂, which comes out to −1. Finally, the last equation tells us that a₁ is also −1. Therefore, the only possible way to get a linear combination is with these coefficients. Indeed,

x^{2}-1=-1-(x+1)+(x^{2}+x+1)=-p_{1}-p_{2}+p_{3}

so x² − 1 is a linear combination of p₁, p₂, and p₃.

On the other hand, what about the polynomial x³ − 1? If we try to make this vector a linear combination of p₁, p₂, and p₃, then following the same process as before, we get the equation

{\begin{aligned}&0x^{3}+a_{3}x^{2}+(a_{2}+a_{3})x+(a_{1}+a_{2}+a_{3})\\[5pt]={}&1x^{3}+0x^{2}+0x+(-1).\end{aligned}}

However, when we set corresponding coefficients equal in this case, the equation for x³ is

0=1

which is always false. Therefore, there is no way for this to work, and x³ − 1 is not a linear combination of p₁, p₂, and p₃.

The linear span

Take an arbitrary field K, an arbitrary vector space V, and let v₁,...,v_n be vectors (in V). It is interesting to consider the set of all linear combinations of these vectors. This set is called the linear span (or just span) of the vectors, say S = {v₁, ..., v_n}. We write the span of S as span(S)^[5]^[6] or sp(S):

\operatorname {span} (\mathbf {v} _{1},\ldots ,\mathbf {v} _{n}):=\{a_{1}\mathbf {v} _{1}+\cdots +a_{n}\mathbf {v} _{n}:a_{1},\ldots ,a_{n}\in K\}.

Linear independence

Suppose that, for some sets of vectors v₁,...,v_n, a single vector can be written in two different ways as a linear combination of them:

\mathbf {v} =\sum _{i}a_{i}\mathbf {v} _{i}=\sum _{i}b_{i}\mathbf {v} _{i}{\text{ where }}a_{i}\neq b_{i}.

This is equivalent, by subtracting these ( $c_{i}:=a_{i}-b_{i}$ ), to saying a non-trivial combination is zero:^[7]^[8]

\mathbf {0} =\sum _{i}c_{i}\mathbf {v} _{i}.

If that is possible, then v₁,...,v_n are called linearly dependent; otherwise, they are linearly independent. Similarly, we can speak of linear dependence or independence of an arbitrary set S of vectors.

If S is linearly independent and the span of S equals V, then S is a basis for V.

Affine, conical, and convex combinations

By restricting the coefficients used in linear combinations, one can define the related concepts of affine combination, conical combination, and convex combination, and the associated notions of sets closed under these operations.

Type of combination	Restrictions on coefficients	Name of set	Model space
Linear combination	no restrictions	Vector subspace	$\mathbf {R} ^{n}$
Affine combination	${\textstyle \sum a_{i}=1}$	Affine subspace	Affine hyperplane
Conical combination	$a_{i}\geq 0$	Convex cone	Quadrant, octant, or orthant
Convex combination	$a_{i}\geq 0$ and ${\textstyle \sum a_{i}=1}$	Convex set	Simplex

Because these are more restricted operations, more subsets will be closed under them, so affine subsets, convex cones, and convex sets are generalizations of vector subspaces: a vector subspace is also an affine subspace, a convex cone, and a convex set, but a convex set need not be a vector subspace, affine, or a convex cone.

These concepts often arise when one can take certain linear combinations of objects, but not any: for example, probability distributions are closed under convex combination (they form a convex set), but not conical or affine combinations (or linear), and positive measures are closed under conical combination but not affine or linear – hence one defines signed measures as the linear closure.

Linear and affine combinations can be defined over any field (or ring), but conical and convex combination require a notion of "positive", and hence can only be defined over an ordered field (or ordered ring), generally the real numbers.

If one allows only scalar multiplication, not addition, one obtains a (not necessarily convex) cone; one often restricts the definition to only allowing multiplication by positive scalars.

All of these concepts are usually defined as subsets of an ambient vector space (except for affine spaces, which are also considered as "vector spaces forgetting the origin"), rather than being axiomatized independently.

Operad theory

More abstractly, in the language of operad theory, one can consider vector spaces to be algebras over the operad $\mathbf {R} ^{\infty }$ (the infinite direct sum, so only finitely many terms are non-zero; this corresponds to only taking finite sums), which parametrizes linear combinations: the vector $(2,3,-5,0,\dots )$ for instance corresponds to the linear combination $2\mathbf {v} _{1}+3\mathbf {v} _{2}-5\mathbf {v} _{3}+0\mathbf {v} _{4}+\cdots$ . Similarly, one can consider affine combinations, conical combinations, and convex combinations to correspond to the sub-operads where the terms sum to 1, the terms are all non-negative, or both, respectively. Graphically, these are the infinite affine hyperplane, the infinite hyper-octant, and the infinite simplex. This formalizes what is meant by $\mathbf {R} ^{n}$ being or the standard simplex being model spaces, and such observations as that every bounded convex polytope is the image of a simplex. Here suboperads correspond to more restricted operations and thus more general theories.

From this point of view, we can think of linear combinations as the most general sort of operation on a vector space – saying that a vector space is an algebra over the operad of linear combinations is precisely the statement that all possible algebraic operations in a vector space are linear combinations.

The basic operations of addition and scalar multiplication, together with the existence of an additive identity and additive inverses, cannot be combined in any more complicated way than the generic linear combination: the basic operations are a generating set for the operad of all linear combinations.

Ultimately, this fact lies at the heart of the usefulness of linear combinations in the study of vector spaces.

Generalizations

If V is a topological vector space, then there may be a way to make sense of certain infinite linear combinations, using the topology of V. For example, we might be able to speak of a₁v₁ + a₂v₂ + a₃v₃ + ⋯, going on forever. Such infinite linear combinations do not always make sense; we call them convergent when they do. Allowing more linear combinations in this case can also lead to a different concept of span, linear independence, and basis. The articles on the various flavors of topological vector spaces go into more detail about these.

If K is a commutative ring instead of a field, then everything that has been said above about linear combinations generalizes to this case without change. The only difference is that we call spaces like this V modules instead of vector spaces. If K is a noncommutative ring, then the concept still generalizes, with one caveat: since modules over noncommutative rings come in left and right versions, our linear combinations may also come in either of these versions, whatever is appropriate for the given module. This is simply a matter of doing scalar multiplication on the correct side.

A more complicated twist comes when V is a bimodule over two rings, K_L and K_R. In that case, the most general linear combination looks like

a_{1}\mathbf {v} _{1}b_{1}+\cdots +a_{n}\mathbf {v} _{n}b_{n}

where a₁,...,a_n belong to K_L, b₁,...,b_n belong to K_R, and v₁,…,v_n belong to V.

Citations

^ Strang (2016) p. 3, § 1.1
^ Lay, Lay & McDonald (2016) p. 28, ch. 1
^ Axler (2015) p. 28, § 2.3
^ nLab (2015) Linear combinations.
^ Axler (2015) pp. 29-30, §§ 2.5, 2.8
^ Katznelson & Katznelson (2008) p. 9, § 1.2.3
^ Axler (2015) pp. 32-33, §§ 2.17, 2.19
^ Katznelson & Katznelson (2008) p. 14, § 1.3.2

References

Textbook

Axler, Sheldon Jay (2015). Linear Algebra Done Right. Undergraduate Texts in Mathematics (3rd ed.). Springer. doi:10.1007/978-3-319-11080-6. ISBN 978-3-319-11079-0.
Katznelson, Yitzhak; Katznelson, Yonatan R. (2008). A (Terse) Introduction to Linear Algebra. American Mathematical Society. ISBN 978-0-8218-4419-9.
Lay, David C.; Lay, Steven R.; McDonald, Judi J. (2016). Linear Algebra and its Applications (5th ed.). Pearson. ISBN 978-0-321-98238-4.
Strang, Gilbert (2016). Introduction to Linear Algebra (5th ed.). Wellesley Cambridge Press. ISBN 978-0-9802327-7-6.

Web

"Linear Combinations". nLab. 27 October 2015. Retrieved 16 Feb 2021.

External links

Linear Combinations and Span: Understanding linear combinations and spans of vectors, khanacademy.org.

[1] Strang (2016) p. 3, § 1.1

[2] Lay, Lay & McDonald (2016) p. 28, ch. 1

[3] Axler (2015) p. 28, § 2.3

[4] Lab (2015) Linear combinations.

[5] Axler (2015) pp. 29-30, §§ 2.5, 2.8

[6] Katznelson & Katznelson (2008) p. 9, § 1.2.3

[7] Axler (2015) pp. 32-33, §§ 2.17, 2.19

[8] Katznelson & Katznelson (2008) p. 14, § 1.3.2

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

v t e Linear algebra
Outline Glossary
Basic concepts	Scalar Vector Vector space Scalar multiplication Vector projection Linear span Linear map Linear projection Linear independence Linear combination Multilinear map Basis Change of basis Row and column vectors Row and column spaces Kernel Eigenvalues and eigenvectors Transpose Linear equations
Matrices	Block Decomposition Invertible Minor Multiplication Rank Transformation Cramer's rule Gaussian elimination Productive matrix
Bilinear	Orthogonality Dot product Hadamard product Inner product space Outer product Kronecker product Gram–Schmidt process
Multilinear algebra	Determinant Cross product Triple product Seven-dimensional cross product Geometric algebra Exterior algebra Bivector Multivector Tensor Outermorphism
Vector space constructions	Dual Direct sum Function space Quotient Subspace Tensor product
Numerical	Floating-point Numerical stability Basic Linear Algebra Subprograms Sparse matrix Comparison of linear algebra libraries
Category