The term is not very clear, as it could either mean:
- a real number function whose graph is a line, i.e.: hyperplane:
- a linear map. Note that the above linear functions are not linear maps unless (known as the homogeneous case), because e.g.:
A linear map is a function where and are two vector spaces over underlying fields such that:
A common case is , and .
One thing that makes such functions particularly simple is that they can be fully specified by specifyin how they act on all possible combinations of input basis vectors: they are therefore specified by only a finite number of elements of .
Every linear map in finite dimension can be represented by a matrix, the points of the domain being represented as vectors.
As such, when we say "linear map", we can think of a generalization of matrix multiplication that makes sense in infinite dimensional spaces like Hilbert spaces, since calling such infinite dimensional maps "matrices" is stretching it a bit, since we would need to specify infinitely many rows and columns.
The prototypical building block of infinite dimensional linear map is the derivative. In that case, the vectors being operated upon are functions, which cannot therefore be specified by a finite number of parameters, e.g.
For example, the left side of the time-independent Schrödinger equation is a linear map. And the time-independent Schrödinger equation can be seen as a eigenvalue problem.
A form is a function from a vector space to elements of the underlying field of the vector space.
A Linear map where the image is the underlying field of the vector space, e.g. .
The set of all linear forms over a vector space forms another vector space called the dual space.
For the typical case of a linear form over , the form can be seen just as a row vector with n elements, the full form being specified by the value of each of the basis vectors.
The dual space of a vector space , sometimes denoted , is the vector space of all linear forms over with the obvious addition and scalar multiplication operations defined.
Since a linear form is completely determined by how it acts on a bases, and since for each basis element it is specified by a scalar, at least in finite dimension, the dimension of the dual space is the same as the , and so they are isomorphic because all vector spaces of the same dimension on a given field are isomorphic, and so the dual is quite a boring concept in the context of finite dimension.
Infinite dimension seems more interesting however, see: en.wikipedia.org/w/index.php?title=Dual_space&oldid=1046421278#Infinite-dimensional_case
One place where duals are different from the non-duals however is when dealing with tensors, because they transform differently than vectors from the base space .
Dual vectors are the members of a dual space.
In the context of tensors , we use raised indices to refer to members of the dual basis vs the underlying basis: are defined to "pick the corresponding coordinate" out of elements of V. E.g.: Kronecker delta as:
Note that in Einstein notation, the components of a dual vector have lower indices. This works well with the upper case indices of the dual vectors, allowing us to write a dual vector as:
In the context of quantum mechanics, the bra notation is also used for dual vectors.
We define it as a linear map where the domain is the same as the image, i.e. an endofunction.
- a 2x2 matrix can represent a linear map from to , so which is a linear operator
- the derivative is a linear map from to , so which is also a linear operator
Given a linear operator over a space that has a inner product defined, we define the adjoint operator (the symbol is called "dagger") as the unique operator that satisfies:
Linear map of two variables.
More formally, given 3 vector spaces X, Y, Z over a single field, a bilinear map is a function from:
The most important example by far is the dot product from , which is more specifically also a symmetric bilinear form.
Analogous to a linear form, a bilinear form is a Bilinear map where the image is the underlying field of the vector space, e.g. .
Some definitions require both of the input spaces to be the same, e.g. , but it doesn't make much different in general.
The most important example of a bilinear form is the dot product. It is only defined if both the input spaces are the same.
As usual, it is useful to think about how a bilinear form looks like in terms of vectors and matrices.
Unlike a linear form, which was a vector, because it has two inputs, the bilinear form is represented by a matrix which encodes the value for each possible pair of basis vectors.
If is the change of basis matrix, then the matrix representation of a bilinear form that looked like: Sylvester's law of inertia then tells us that the number of positive, negative and 0 eigenvalues of both of those matrices is the same.
Proof: the value of a given bilinear form cannot change due to a change of bases, since the bilinear form is just a function, and does not depend on the choice of basis. The only thing that change is the matrix representation of the form. Therefore, we must have:
Analogous to a linear form, a multilinear form is a Multilinear map where the image is the underlying field of the vector space, e.g. .
Subcase of symmetric multilinear map:
Requires the two inputs and to be in the same vector space of course.
The most important example is the dot product, which is also a positive definite symmetric bilinear form.
symmetric bilinear maps that is also a bilinear form.
Like the matrix representation of a bilinear form, it is a matrix, but now the matrix has to be a symmetric matrix.
The complex number analogue of a symmetric bilinear form.
The prototypical example of it is the complex dot product.
Note that this form is neither strictly symmetric, it satisfies: complex conjugate, nor is it linear for complex scalar multiplication on the second argument.
Multivariate polynomial where each term has degree 2, e.g.:
but e.g.: has degree 3.
More generally for any number of variables it can be written as:
There is a 1-to-1 relationship between quadratic forms and symmetric bilinear forms. In matrix representation, this can be written as: contains each of the variabes of the form, e.g. for 2 variables:
Strictly speaking, the associated bilinear form would not need to be a symmetric bilinear form, at least for the real numbers or complex numbers which are commutative. E.g.:
Symmetric bilinear form that is also positive definite, i.e.:
A positive definite matrix that is also a symmetric matrix.
Subcase of antisymmetric multilinear map:
Skew-symmetric bilinear map that is also a bilinear form.
Same value if you swap any input arguments.
Change sign if you swap two input values.
Implies antisymmetric multilinear map.
The definition of the "dot product" of a general space varies quite a lot with different contexts.
Most definitions tend to be bilinear forms.
We use the unqualified generally refers to the dot product of Real coordinate spaces, which is a positive definite symmetric bilinear form. Other important examples include:
The rest of this section is about the case.
- the complex dot product, which is not strictly symmetric nor linear, but it is positive definite
- Minkowski inner product, sometimes called" "Minkowski dot product is not positive definite
The positive definite part of the definition likely comes in because we are so familiar with metric spaces, which requires a positive norm in the norm induced by an inner product.
The default Euclidean space definition, we use the matrix representation of a symmetric bilinear form as the identity matrix, e.g. in :
Denoted by the Greek letter epsilon with
\varepsilonencoding in LaTeX.
- odd permutation: -1
- even permutation: 1
- not a permutation: 0. This happens iff two more more indices are repeated
An Introduction to Tensors and Group Theory for Physicists by Nadir Jeevanjee (2011) shows that this is a tensor that represents the volume of a parallelepiped.
It takes as input three vectors, and outputs one real number, the volume. And it is linear on each vector. This perfectly satisfied the definition of a tensor of order (3,0).
Given a basis and a function that return the volume of a parallelepiped given by three vectors , .
Name origin: likely because it "determines" if a matrix is invertible or not, as a matrix is invertible iff determinant is not zero.
When it exists, which is not for all matrices, only invertible matrix, the inverse is denoted:
The set of all invertible matrices forms a group: the general linear group with matrix multiplication. Non-invertible matrices don't form a group due to the lack of inverse.
When it distributes it inverts the order of the matrix multiplication:
The transpose and matrix inverse commute:
Since a matrix can be seen as a linear map , the product of two matrices can be seen as the composition of two linear maps:
Set of eigenvalues of a linear operator.
Unlike the simple case of a matrix, in infinite dimensional vector spaces, the spectrum may be continuous.
The quintessential example of that is the spectrum of the position operator in quantum mechanics, in which any real number is a possible eigenvalue, since the particle may be found in any position. The associated eigenvectors are the corresponding Dirac delta functions.
Every invertible matrix can be written as:
Note therefore that this decomposition is unique up to swapping the order of eigenvectors. We could fix a canonical form by sorting eigenvectors from smallest to largest in the case of a real number.
- is a diagonal matrix containing the eigenvalues of
- columns of are eigenvectors of
Intuitively, Note that this is just the change of bases formula, and so:
- changes basis to align to the eigenvectors
- multiplies eigenvectors simply by eigenvalues
- changes back to the original basis
The general result from eigendecomposition of a matrix: is an orthogonal matrix, and therefore has .
The main interest of this theorem is in classifying the indefinite orthogonal groups, which in turn is fundamental because the Lorentz group is an indefinite orthogonal groups, see: all indefinite orthogonal groups of matrices of equal metric signature are isomorphic.
It also tells us that a change of bases does not the alter the metric signature of a bilinear form, see matrix congruence can be seen as the change of basis of a bilinear form.
The theorem states that the number of 0, 1 and -1 in the metric signature is the same for two symmetric matrices that are congruent matrices.
For example, consider:
The eigenvalues of are and , and the associated eigenvectors are: symPy code:and from the eigendecomposition of a real symmetric matrix we know that:
A = Matrix([[2, sqrt(2)], [sqrt(2), 3]]) A.eigenvects()
Now, instead of , we could use , where is an arbitrary diagonal matrix of type: : by any positive number and . Since we are multiplying by two arbitrary positive numbers, we cannot change the signs of the original eigenvalues, and so the metric signature is maintained, but respecting that any value can be reached.
Note that the matrix congruence relation looks a bit like the eigendecomposition of a matrix: does not have to contain eigenvalues, unlike the eigendecomposition of a matrix. This is because here is not fixed to having eigenvectors in its columns.
But because the matrix is symmetric however, we could always choose to actually diagonalize as mentioned at eigendecomposition of a real symmetric matrix. Therefore, the metric signature can be seen directly from eigenvalues.
Also, because is a diagonal matrix, and thus symmetric, it must be that:
What this does represent, is a general change of bases that maintains the matrix a symmetric matrix.
Two symmetric matrices and are defined to be congruent if there exists an in such that:
From effect of a change of basis on the matrix of a bilinear form, remember that a change of basis modifies the matrix representation of a bilinear form as:
So, by taking , we understand that two matrices being congruent means that they can both correspond to the same bilinear form in different bases.
This is the possibly infinite dimensional version of a Hermitian matrix, since linear operators are the possibly infinite dimensional version of matrices.
There's a catch though: now we don't have explicit matrix indices here however in general, the generalized definition is shown at: en.wikipedia.org/w/index.php?title=Hermitian_adjoint&oldid=1032475701#Definition_for_bounded_operators_between_Hilbert_spaces
The matrix ring of degree n is the set of all n-by-n square matrices together with the usual vector space and matrix multiplication operations.
This set forms a ring.
Members of the orthogonal group.
Complex analogue of orthogonal matrix.
- in quantum computers programming basically comes down to creating one big unitary matrix as explained at: quantum computing is just matrix multiplication
Can represent a symmetric bilinear form as shown at matrix representation of a symmetric bilinear form, or a quadratic form.
The definition implies that this is also a symmetric matrix.
The dot product is a positive definite matrix, and so we see that those will have an important link to familiar geometry.
WTF is a skew? "Antisymmetric" is just such a better name! And it also appears in other definitions such as antisymmetric multilinear map.
- : matrix in the old basis
- : matrix in the new basis
- : change of basis matrix
The change of basis matrix is the matrix that allows us to express the new basis in an old basis:
The usual question then is: given a vector in the new basis, how do we represent it in the old basis?
The answer is that we simply have to calculate the matrix inverse of :
That is the matrix inverse.
When we have a symmetric matrix, a change of bases keeps symmetry iff it is done by an orthogonal matrix, in which case:
Every vector space is defined over a field.
E.g. in , the underlying field is , the real numbers. And in the underlying field is , the complex numbers.
Any field can be used, including finite field. But the underlying thing has to be a field, because the definitions of a vector need all field properties to hold to make sense.
Elements of the underlying field of a vector space are known as scalar.
A member of the underlying field of a vector space. E.g. in , the underlying field is , and a scalar is a member of , i.e. a real number.
A multilinear form with a domain that looks like: is the dual space.
Because a tensor is a multilinear form, it can be fully specified by how it act on all combinations of basis sets, which can be done in terms of components. We refer to each component as: dual vector.
A linear map can be seen as a (1,1) tensor because: . is a dual vector, and W is a vector. Furthermoe, is linear in both and . All of this makes fullfill the definition of a (1,1) tensor.
The Wikipedia page of this article is basically a masterclass why Wikipedia is useless for learning technical subjects. They are not even able to teach such a simple subject properly there!
- www.maths.cam.ac.uk/postgrad/part-iii/files/misc/index-notation.pdf gives a definition that does not consider upper and lower indexes, it only counts how many times the indices appearTheir definition of the Laplacian is a bit wrong as only one appears in it, they likely meant to have written instead of , related:
TODO what is the point of them? Why not just sum over every index that appears twice, regardless of where it is, as mentioned at: www.maths.cam.ac.uk/postgrad/part-iii/files/misc/index-notation.pdf.
Vectors with the index on top such as are the "regular vectors", they are called covariant vectors.
Those in indices on bottom are called contravariant vectors.
It is possible to change between them by Raising and lowering indices.
The values are different only when the metric signature matrix is different from the identity matrix.
Then a specific metric is involved, sometimes we want to automatically add it to products.
E.g., in a context considering the common Minkowski inner product matrix where the 4x4 matrix and is a vector in
The Einstein summation convention works will with partial derivative, and this case is widely used in particle physics.
Partial index partial derivative notation is the partial derivative notation commonly used in this context, as we want to do operations by index rather than by labels such as , , .
This notation also allows us to have raised and lowered indices on the partial derivative symbol TODO how are they different?
Given a vector function of three variables: as with a raised index.
Then, the divergence can be written in Einstein notation as:
Given the function : Planck units as: Laplacian in Einstein notation, since the D'alembert operator is just a generalization of the laplace operator to Minkowski space.
The Klein-Gordon equation can be written in terms of the D'alembert operator as: D'alembert operator in Einstein notation to:
Written in MathBook XML.