Ciro Santilli $$ Sponsor Ciro $$ 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
The term is not very clear, as it could either mean:
  • a real number function whose graph is a line, i.e.:
    or for higher dimensions, a hyperplane:
  • a linear map. Note that the above linear functions are not linear maps unless (known as the homogeneous case), because e.g.:
    but
    For this reason, it is better never to refer to linear maps as linear functions.
A linear map is a function where and are two vector spaces over underlying fields such that:
A common case is , and .
One thing that makes such functions particularly simple is that they can be fully specified by specifyin how they act on all possible combinations of input basis vectors: they are therefore specified by only a finite number of elements of .
Every linear map in finite dimension can be represented by a matrix, the points of the domain being represented as vectors.
As such, when we say "linear map", we can think of a generalization of matrix multiplication that makes sense in infinite dimensional spaces like Hilbert spaces, since calling such infinite dimensional maps "matrices" is stretching it a bit, since we would need to specify infinitely many rows and columns.
The prototypical building block of infinite dimensional linear map is the derivative. In that case, the vectors being operated upon are functions, which cannot therefore be specified by a finite number of parameters, e.g.
For example, the left side of the time-independent Schrödinger equation is a linear map. And the time-independent Schrödinger equation can be seen as a eigenvalue problem.
A form is a function from a vector space to elements of the underlying field of the vector space.
A Linear map where the image is the underlying field of the vector space, e.g. .
The set of all linear forms over a vector space forms another vector space called the dual space.
For the typical case of a linear form over , the form can be seen just as a row vector with n elements, the full form being specified by the value of each of the basis vectors.
The dual space of a vector space , sometimes denoted , is the vector space of all linear forms over with the obvious addition and scalar multiplication operations defined.
Since a linear form is completely determined by how it acts on a bases, and since for each basis element it is specified by a scalar, at least in finite dimension, the dimension of the dual space is the same as the , and so they are isomorphic because all vector spaces of the same dimension on a given field are isomorphic, and so the dual is quite a boring concept in the context of finite dimension.
One place where duals are different from the non-duals however is when dealing with tensors, because they transform differently than vectors from the base space .
Dual vectors are the members of a dual space.
In the context of tensors , we use raised indices to refer to members of the dual basis vs the underlying basis:
The dual basis vectors are defined to "pick the corresponding coordinate" out of elements of V. E.g.:
By expanding into the basis, we can put this more succinctly with the Kronecker delta as:
Note that in Einstein notation, the components of a dual vector have lower indices. This works well with the upper case indices of the dual vectors, allowing us to write a dual vector as:
In the context of quantum mechanics, the bra notation is also used for dual vectors.
We define it as a linear map where the domain is the same as the image, i.e. an endofunction.
Examples:
Given a linear operator over a space that has a inner product defined, we define the adjoint operator (the symbol is called "dagger") as the unique operator that satisfies:
Linear map of two variables.
More formally, given 3 vector spaces X, Y, Z over a single field, a bilinear map is a function from:
that is linear on the first two arguments from X and Y, i.e.:
Note that the definition only makes sense if all three vector spaces are over the same field, because linearity can mix up each of them.
The most important example by far is the dot product from , which is more specifically also a symmetric bilinear form.
Analogous to a linear form, a bilinear form is a Bilinear map where the image is the underlying field of the vector space, e.g. .
Some definitions require both of the input spaces to be the same, e.g. , but it doesn't make much different in general.
The most important example of a bilinear form is the dot product. It is only defined if both the input spaces are the same.
As usual, it is useful to think about how a bilinear form looks like in terms of vectors and matrices.
Unlike a linear form, which was a vector, because it has two inputs, the bilinear form is represented by a matrix which encodes the value for each possible pair of basis vectors.
In terms of that matrix, the form is then given by:
If is the change of basis matrix, then the matrix representation of a bilinear form that looked like:
then the matrix in the new basis is:
Sylvester's law of inertia then tells us that the number of positive, negative and 0 eigenvalues of both of those matrices is the same.
Proof: the value of a given bilinear form cannot change due to a change of bases, since the bilinear form is just a function, and does not depend on the choice of basis. The only thing that change is the matrix representation of the form. Therefore, we must have:
and in the new basis:
and so since:
See form.
Analogous to a linear form, a multilinear form is a Multilinear map where the image is the underlying field of the vector space, e.g. .
Subcase of symmetric multilinear map:
Requires the two inputs and to be in the same vector space of course.
The most important example is the dot product, which is also a positive definite symmetric bilinear form.
Like the matrix representation of a bilinear form, it is a matrix, but now the matrix has to be a symmetric matrix.
We can then immediately see that the matrix is symmetric, then so is the form. We have:
But because is a scalar, we have:
and:
The prototypical example of it is the complex dot product.
Note that this form is neither strictly symmetric, it satisfies:
where the over bar indicates the complex conjugate, nor is it linear for complex scalar multiplication on the second argument.
;
Multivariate polynomial where each term has degree 2, e.g.:
is a quadratic form because each term has degree 2:
but e.g.:
is not because the term has degree 3.
There is a 1-to-1 relationship between quadratic forms and symmetric bilinear forms. In matrix representation, this can be written as:
where contains each of the variabes of the form, e.g. for 2 variables:
Strictly speaking, the associated bilinear form would not need to be a symmetric bilinear form, at least for the real numbers or complex numbers which are commutative. E.g.:
But that same matrix could also be written in symmetric form as:
so why not I guess, its simpler/more restricted.
Symmetric bilinear form that is also positive definite, i.e.:
Subcase of antisymmetric multilinear map:
Same value if you swap any input arguments.
Change sign if you swap two input values.
The definition of the "dot product" of a general space varies quite a lot with different contexts.
Most definitions tend to be bilinear forms.
We use the unqualified generally refers to the dot product of Real coordinate spaces, which is a positive definite symmetric bilinear form. Other important examples include:
The rest of this section is about the case.
The positive definite part of the definition likely comes in because we are so familiar with metric spaces, which requires a positive norm in the norm induced by an inner product.
The default Euclidean space definition, we use the matrix representation of a symmetric bilinear form as the identity matrix, e.g. in :
so that:
Denoted by the Greek letter epsilon with \varepsilon encoding in LaTeX.
Definition:
It takes as input three vectors, and outputs one real number, the volume. And it is linear on each vector. This perfectly satisfied the definition of a tensor of order (3,0).
Given a basis and a function that return the volume of a parallelepiped given by three vectors , .
Name origin: likely because it "determines" if a matrix is invertible or not, as a matrix is invertible iff determinant is not zero.
When it exists, which is not for all matrices, only invertible matrix, the inverse is denoted:
The set of all invertible matrices forms a group: the general linear group with matrix multiplication. Non-invertible matrices don't form a group due to the lack of inverse.
When it distributes it inverts the order of the matrix multiplication:
The transpose and matrix inverse commute:
Since a matrix can be seen as a linear map , the product of two matrices can be seen as the composition of two linear maps:
One cool thing about linear functions is that we can easily pre-calculate this product only once to obtain a new matrix, and so we don't have to do both multiplications separately each time.
Unlike the simple case of a matrix, in infinite dimensional vector spaces, the spectrum may be continuous.
The quintessential example of that is the spectrum of the position operator in quantum mechanics, in which any real number is a possible eigenvalue, since the particle may be found in any position. The associated eigenvectors are the corresponding Dirac delta functions.
Every invertible matrix can be written as:
where:
Note therefore that this decomposition is unique up to swapping the order of eigenvectors. We could fix a canonical form by sorting eigenvectors from smallest to largest in the case of a real number.
Intuitively, Note that this is just the change of bases formula, and so:
  • changes basis to align to the eigenvectors
  • multiplies eigenvectors simply by eigenvalues
  • changes back to the original basis
The general result from eigendecomposition of a matrix:
becomes:
where is an orthogonal matrix, and therefore has .
The theorem states that the number of 0, 1 and -1 in the metric signature is the same for two symmetric matrices that are congruent matrices.
For example, consider:
The eigenvalues of are and , and the associated eigenvectors are:
symPy code:
A = Matrix([[2, sqrt(2)], [sqrt(2), 3]])
A.eigenvects()
and from the eigendecomposition of a real symmetric matrix we know that:
Now, instead of , we could use , where is an arbitrary diagonal matrix of type:
With this, would reach a new matrix :
Therefore, with this congruence, we are able to multiply the eigenvalues of by any positive number and . Since we are multiplying by two arbitrary positive numbers, we cannot change the signs of the original eigenvalues, and so the metric signature is maintained, but respecting that any value can be reached.
Note that the matrix congruence relation looks a bit like the eigendecomposition of a matrix:
but note that does not have to contain eigenvalues, unlike the eigendecomposition of a matrix. This is because here is not fixed to having eigenvectors in its columns.
But because the matrix is symmetric however, we could always choose to actually diagonalize as mentioned at eigendecomposition of a real symmetric matrix. Therefore, the metric signature can be seen directly from eigenvalues.
Also, because is a diagonal matrix, and thus symmetric, it must be that:
What this does represent, is a general change of bases that maintains the matrix a symmetric matrix.
Two symmetric matrices and are defined to be congruent if there exists an in such that:
So, by taking , we understand that two matrices being congruent means that they can both correspond to the same bilinear form in different bases.
This is the possibly infinite dimensional version of a Hermitian matrix, since linear operators are the possibly infinite dimensional version of matrices.
There's a catch though: now we don't have explicit matrix indices here however in general, the generalized definition is shown at: https://en.wikipedia.org/w/index.php?title=Hermitian_adjoint&oldid=1032475701#Definition_for_bounded_operators_between_Hilbert_spaces
The matrix ring of degree n is the set of all n-by-n square matrices together with the usual vector space and matrix multiplication operations.
This set forms a ring.
Members of the orthogonal group.
Applications:
A matrix that equals its transpose:
The definition implies that this is also a symmetric matrix.
The dot product is a positive definite matrix, and so we see that those will have an important link to familiar geometry.
WTF is a skew? "Antisymmetric" is just such a better name! And it also appears in other definitions such as antisymmetric multilinear map.
where:
  • : matrix in the old basis
  • : matrix in the new basis
  • : change of basis matrix
The change of basis matrix is the matrix that allows us to express the new basis in an old basis:
Mnemonic is as follows: consider we have an initial basis . Now, we define the new basis in terms of the old basis, e.g.:
which can be written in matrix form as:
and so if we set:
we have:
The usual question then is: given a vector in the new basis, how do we represent it in the old basis?
The answer is that we simply have to calculate the matrix inverse of :
That is the matrix inverse.
When we have a symmetric matrix, a change of bases keeps symmetry iff it is done by an orthogonal matrix, in which case:
Every vector space is defined over a field.
E.g. in , the underlying field is , the real numbers. And in the underlying field is , the complex numbers.
Any field can be used, including finite field. But the underlying thing has to be a field, because the definitions of a vector need all field properties to hold to make sense.
Elements of the underlying field of a vector space are known as scalar.
A member of the underlying field of a vector space. E.g. in , the underlying field is , and a scalar is a member of , i.e. a real number.
A multilinear form with a domain that looks like:
where is the dual space.
Because a tensor is a multilinear form, it can be fully specified by how it act on all combinations of basis sets, which can be done in terms of components. We refer to each component as:
where we remember that the raised indices refer dual vector.
A linear map can be seen as a (1,1) tensor because:
is a number, . is a dual vector, and W is a vector. Furthermoe, is linear in both and . All of this makes fullfill the definition of a (1,1) tensor.
has order
The Wikipedia page of this article is basically a masterclass why Wikipedia is useless for learning technical subjects. They are not even able to teach such a simple subject properly there!
Bibliography:
TODO what is the point of them? Why not just sum over every index that appears twice, regardless of where it is, as mentioned at: https://www.maths.cam.ac.uk/postgrad/part-iii/files/misc/index-notation.pdf.
Vectors with the index on top such as are the "regular vectors", they are called covariant vectors.
Those in indices on bottom are called contravariant vectors.
It is possible to change between them by Raising and lowering indices.
The values are different only when the metric signature matrix is different from the identity matrix.
Then a specific metric is involved, sometimes we want to automatically add it to products.
E.g., in a context considering the common Minkowski inner product matrix where the 4x4 matrix and is a vector in
which leads to the change of sign of some terms.
The Einstein summation convention works will with partial derivative, and this case is widely used in particle physics.
Partial index partial derivative notation is the partial derivative notation commonly used in this context, as we want to do operations by index rather than by labels such as , , .
This notation also allows us to have raised and lowered indices on the partial derivative symbol TODO how are they different?
Given a vector function of three variables:
so note that we are denoting each component of as with a raised index.
Then, the divergence can be written in Einstein notation as:
It is common to just omit the variables of the function, so we tend to just say:
or equivalently when referring just to the operation:
Given a real function of three variables:
its Laplacian can be written as:
It is common to just omit the variables of the function, so we tend to just say:
or equivalently when referring just to the operation:
Given the function :
the operator can be written in Planck units as:
often written without function arguments as:
Note how this looks just like the Laplacian in Einstein notation, since the D'alembert operator is just a generalization of the laplace operator to Minkowski space.
The Klein-Gordon equation can be written in terms of the D'alembert operator as:
so we can expand the D'alembert operator in Einstein notation to:

Ancestors