The term is not very clear, as it could either mean:
- a real number function whose graph is a line, i.e.:or for higher dimensions, a hyperplane:
- a linear map. Note that the above linear functions are not linear maps unless (known as the homogeneous case), because e.g.:butFor this reason, it is better never to refer to linear maps as linear functions.
A common case is , and .
One thing that makes such functions particularly simple is that they can be fully specified by specifyin how they act on all possible combinations of input basis vectors: they are therefore specified by only a finite number of elements of .
Every linear map in finite dimension can be represented by a matrix, the points of the domain being represented as vectors.
As such, when we say "linear map", we can think of a generalization of matrix multiplication that makes sense in infinite dimensional spaces like Hilbert spaces, since calling such infinite dimensional maps "matrices" is stretching it a bit, since we would need to specify infinitely many rows and columns.
The prototypical building block of infinite dimensional linear map is the derivative. In that case, the vectors being operated upon are functions, which cannot therefore be specified by a finite number of parameters, e.g.
For example, the left side of the time-independent Schrödinger equation is a linear map. And the time-independent Schrödinger equation can be seen as a eigenvalue problem.
Tagged
For the typical case of a linear form over , the form can be seen just as a row vector with n elements, the full form being specified by the value of each of the basis vectors.
The dual space of a vector space , sometimes denoted , is the vector space of all linear forms over with the obvious addition and scalar multiplication operations defined.
Since a linear form is completely determined by how it acts on a basis, and since for each basis element it is specified by a scalar, at least in finite dimension, the dimension of the dual space is the same as the , and so they are isomorphic because all vector spaces of the same dimension on a given field are isomorphic, and so the dual is quite a boring concept in the context of finite dimension.
Infinite dimension seems more interesting however, see: en.wikipedia.org/w/index.php?title=Dual_space&oldid=1046421278#Infinite-dimensional_case
One place where duals are different from the non-duals however is when dealing with tensors, because they transform differently than vectors from the base space .
Dual vectors are the members of a dual space.
In the context of tensors , we use raised indices to refer to members of the dual basis vs the underlying basis:The dual basis vectors are defined to "pick the corresponding coordinate" out of elements of V. E.g.:By expanding into the basis, we can put this more succinctly with the Kronecker delta as:
Note that in Einstein notation, the components of a dual vector have lower indices. This works well with the upper case indices of the dual vectors, allowing us to write a dual vector as:
In the context of quantum mechanics, the bra notation is also used for dual vectors.
Examples:
- a 2x2 matrix can represent a linear map from to , so which is a linear operator
- the derivative is a linear map from to , so which is also a linear operator
Given a linear operator over a space that has a inner product defined, we define the adjoint operator (the symbol is called "dagger") as the unique operator that satisfies:
Linear map of two variables.
More formally, given 3 vector spaces X, Y, Z over a single field, a bilinear map is a function from:that is linear on the first two arguments from X and Y, i.e.:Note that the definition only makes sense if all three vector spaces are over the same field, because linearity can mix up each of them.
The most important example by far is the dot product from , which is more specifically also a symmetric bilinear form.
Analogous to a linear form, a bilinear form is a Bilinear map where the image is the underlying field of the vector space, e.g. .
Some definitions require both of the input spaces to be the same, e.g. , but it doesn't make much different in general.
The most important example of a bilinear form is the dot product. It is only defined if both the input spaces are the same.
As usual, it is useful to think about how a bilinear form looks like in terms of vectors and matrices.
Unlike a linear form, which was a vector, because it has two inputs, the bilinear form is represented by a matrix which encodes the value for each possible pair of basis vectors.
If is the change of basis matrix, then the matrix representation of a bilinear form that looked like:then the matrix in the new basis is:Sylvester's law of inertia then tells us that the number of positive, negative and 0 eigenvalues of both of those matrices is the same.
Proof: the value of a given bilinear form cannot change due to a change of basis, since the bilinear form is just a function, and does not depend on the choice of basis. The only thing that change is the matrix representation of the form. Therefore, we must have:and in the new basis:and so since:
See form.
Analogous to a linear form, a multilinear form is a Multilinear map where the image is the underlying field of the vector space, e.g. .
Subcase of symmetric multilinear map:
The most important example is the dot product, which is also a positive definite symmetric bilinear form.
symmetric bilinear maps that is also a bilinear form.
Like the matrix representation of a bilinear form, it is a matrix, but now the matrix has to be a symmetric matrix.
We can then immediately see that the matrix is symmetric, then so is the form. We have:But because is a scalar, we have:and:
The complex number analogue of a symmetric bilinear form.
The prototypical example of it is the complex dot product.
Note that this form is neither strictly symmetric, it satisfies:where the over bar indicates the complex conjugate, nor is it linear for complex scalar multiplication on the second argument.
Bibliography:
;
Multivariate polynomial where each term has degree 2, e.g.:is a quadratic form because each term has degree 2:but e.g.:is not because the term has degree 3.
More generally for any number of variables it can be written as:
There is a 1-to-1 relationship between quadratic forms and symmetric bilinear forms. In matrix representation, this can be written as:where contains each of the variabes of the form, e.g. for 2 variables:
Strictly speaking, the associated bilinear form would not need to be a symmetric bilinear form, at least for the real numbers or complex numbers which are commutative. E.g.:But that same matrix could also be written in symmetric form as:so why not I guess, its simpler/more restricted.
A positive definite matrix that is also a symmetric matrix.
Subcase of antisymmetric multilinear map:
Skew-symmetric bilinear map that is also a bilinear form.
Same value if you swap any input arguments.
Change sign if you swap two input values.
Implies antisymmetric multilinear map.
The definition of the "dot product" of a general space varies quite a lot with different contexts.
Most definitions tend to be bilinear forms.
We use the unqualified generally refers to the dot product of Real coordinate spaces, which is a positive definite symmetric bilinear form. Other important examples include:The rest of this section is about the case.
- the complex dot product, which is not strictly symmetric nor linear, but it is positive definite
- Minkowski inner product, sometimes called" "Minkowski dot product is not positive definite
The positive definite part of the definition likely comes in because we are so familiar with metric spaces, which requires a positive norm in the norm induced by an inner product.
The default Euclidean space definition, we use the matrix representation of a symmetric bilinear form as the identity matrix, e.g. in :so that:
Definition:
- odd permutation: -1
- even permutation: 1
- not a permutation: 0. This happens iff two more more indices are repeated
An Introduction to Tensors and Group Theory for Physicists by Nadir Jeevanjee (2011) shows that this is a tensor that represents the volume of a parallelepiped.
It takes as input three vectors, and outputs one real number, the volume. And it is linear on each vector. This perfectly satisfied the definition of a tensor of order (3,0).
Given a basis and a function that return the volume of a parallelepiped given by three vectors , .
Name origin: likely because it "determines" if a matrix is invertible or not, as a matrix is invertible iff determinant is not zero.
When it exists, which is not for all matrices, only invertible matrix, the inverse is denoted:
The set of all invertible matrices forms a group: the general linear group with matrix multiplication. Non-invertible matrices don't form a group due to the lack of inverse.
When it distributes it inverts the order of the matrix multiplication:
Since a matrix can be seen as a linear map , the product of two matrices can be seen as the composition of two linear maps:One cool thing about linear functions is that we can easily pre-calculate this product only once to obtain a new matrix, and so we don't have to do both multiplications separately each time.
No 2x2 examples please. I'm talking about large matrices that would be used in supercomputers.
For positive definite matrices only.
TODO application.
TODO speedup over algorithm for general matrices.
www.studentclustercompetition.us/ comments:
The HPCG benchmark uses a preconditioned conjugate gradient (PCG) algorithm to measure the performance of HPC platforms with respect to frequently observed but challenging patterns of computing, communication, and memory access. While HPL provides an optimistic performance target for applications, HPCG can be considered as a lower bound on performance. Many of the top 500 supercomputers also provide their HPCG performance as a reference.
math.stackexchange.com/questions/41706/practical-uses-of-matrix-multiplication/4647422#4647422 highlights deep learning applications.
- math.stackexchange.com/questions/23312/what-is-the-importance-of-eigenvalues-eigenvectors/3503875#3503875
- math.stackexchange.com/questions/1520832/real-life-examples-for-eigenvalues-eigenvectors
- matheducators.stackexchange.com/questions/520/what-is-a-good-motivation-showcase-for-a-student-for-the-study-of-eigenvalues
Set of eigenvalues of a linear operator.
Unlike the simple case of a matrix, in infinite dimensional vector spaces, the spectrum may be continuous.
The quintessential example of that is the spectrum of the position operator in quantum mechanics, in which any real number is a possible eigenvalue, since the particle may be found in any position. The associated eigenvectors are the corresponding Dirac delta functions.
Every invertible matrix can be written as:where:Note therefore that this decomposition is unique up to swapping the order of eigenvectors. We could fix a canonical form by sorting eigenvectors from smallest to largest in the case of a real number.
- is a diagonal matrix containing the eigenvalues of
- columns of are eigenvectors of
Intuitively, Note that this is just the change of basis formula, and so:
- changes basis to align to the eigenvectors
- multiplies eigenvectors simply by eigenvalues
- changes back to the original basis
The general result from eigendecomposition of a matrix:becomes:where is an orthogonal matrix, and therefore has .
The main interest of this theorem is in classifying the indefinite orthogonal groups, which in turn is fundamental because the Lorentz group is an indefinite orthogonal groups, see: all indefinite orthogonal groups of matrices of equal metric signature are isomorphic.
It also tells us that a change of basis does not the alter the metric signature of a bilinear form, see matrix congruence can be seen as the change of basis of a bilinear form.
The theorem states that the number of 0, 1 and -1 in the metric signature is the same for two symmetric matrices that are congruent matrices.
For example, consider:
The eigenvalues of are and , and the associated eigenvectors are:symPy code:and from the eigendecomposition of a real symmetric matrix we know that:
A = Matrix([[2, sqrt(2)], [sqrt(2), 3]])
A.eigenvects()
Now, instead of , we could use , where is an arbitrary diagonal matrix of type:With this, would reach a new matrix :Therefore, with this congruence, we are able to multiply the eigenvalues of by any positive number and . Since we are multiplying by two arbitrary positive numbers, we cannot change the signs of the original eigenvalues, and so the metric signature is maintained, but respecting that any value can be reached.
Note that the matrix congruence relation looks a bit like the eigendecomposition of a matrix:but note that does not have to contain eigenvalues, unlike the eigendecomposition of a matrix. This is because here is not fixed to having eigenvectors in its columns.
But because the matrix is symmetric however, we could always choose to actually diagonalize as mentioned at eigendecomposition of a real symmetric matrix. Therefore, the metric signature can be seen directly from eigenvalues.
What this does represent, is a general change of basis that maintains the matrix a symmetric matrix.
Related:
From effect of a change of basis on the matrix of a bilinear form, remember that a change of basis modifies the matrix representation of a bilinear form as:
So, by taking , we understand that two matrices being congruent means that they can both correspond to the same bilinear form in different bases.
This is the possibly infinite dimensional version of a Hermitian matrix, since linear operators are the possibly infinite dimensional version of matrices.
There's a catch though: now we don't have explicit matrix indices here however in general, the generalized definition is shown at: en.wikipedia.org/w/index.php?title=Hermitian_adjoint&oldid=1032475701#Definition_for_bounded_operators_between_Hilbert_spaces
A good definition is that the sparse matrix has non-zero entries proportional the number of rows. Therefore this is Big O notation less than something that has non zero entries. Of course, this only makes sense when generalizing to larger and larger matrices, otherwise we could take the constant of proportionality very high for one specific matrix.
Of course, this only makes sense when generalizing to larger and larger matrices, otherwise we could take the constant of proportionality very high for one specific matrix.
Forms a normal subgroup of the general linear group.
Forms a normal subgroup of the general linear group.
The matrix ring of degree n is the set of all n-by-n square matrices together with the usual vector space and matrix multiplication operations.
This set forms a ring.
Related terminology:
Members of the orthogonal group.
Complex analogue of orthogonal matrix.
Applications:
- in quantum computers programming basically comes down to creating one big unitary matrix as explained at: quantum computing is just matrix multiplication
Can represent a symmetric bilinear form as shown at matrix representation of a symmetric bilinear form, or a quadratic form.
The definition implies that this is also a symmetric matrix.
The dot product is a positive definite matrix, and so we see that those will have an important link to familiar geometry.
WTF is a skew? "Antisymmetric" is just such a better name! And it also appears in other definitions such as antisymmetric multilinear map.
where:
- : matrix in the old basis
- : matrix in the new basis
- : change of basis matrix
The change of basis matrix is the matrix that allows us to express the new basis in an old basis:
Mnemonic is as follows: consider we have an initial basis . Now, we define the new basis in terms of the old basis, e.g.:which can be written in matrix form as:and so if we set:we have:
The usual question then is: given a vector in the new basis, how do we represent it in the old basis?
That is the matrix inverse.
When we have a symmetric matrix, a change of basis keeps symmetry iff it is done by an orthogonal matrix, in which case:
Every vector space is defined over a field.
E.g. in , the underlying field is , the real numbers. And in the underlying field is , the complex numbers.
Any field can be used, including finite field. But the underlying thing has to be a field, because the definitions of a vector need all field properties to hold to make sense.
Elements of the underlying field of a vector space are known as scalar.
A member of the underlying field of a vector space. E.g. in , the underlying field is , and a scalar is a member of , i.e. a real number.
Because a tensor is a multilinear form, it can be fully specified by how it act on all combinations of basis sets, which can be done in terms of components. We refer to each component as:where we remember that the raised indices refer dual vector.
Tagged
A linear map can be seen as a (1,1) tensor because:is a number, . is a dual vector, and W is a vector. Furthermoe, is linear in both and . All of this makes fullfill the definition of a (1,1) tensor.
Bibliography:
has order
The Wikipedia page of this article is basically a masterclass why Wikipedia is useless for learning technical subjects. They are not even able to teach such a simple subject properly there!
Bibliography:
- www.maths.cam.ac.uk/postgrad/part-iii/files/misc/index-notation.pdf gives a definition that does not consider upper and lower indexes, it only counts how many times the indices appearTheir definition of the Laplacian is a bit wrong as only one appears in it, they likely meant to have written instead of , related:
TODO what is the point of them? Why not just sum over every index that appears twice, regardless of where it is, as mentioned at: www.maths.cam.ac.uk/postgrad/part-iii/files/misc/index-notation.pdf.
Vectors with the index on top such as are the "regular vectors", they are called covariant vectors.
Those in indices on bottom are called contravariant vectors.
It is possible to change between them by Raising and lowering indices.
The values are different only when the metric signature matrix is different from the identity matrix.
Then a specific metric is involved, sometimes we want to automatically add it to products.
E.g., in a context considering the common Minkowski inner product matrix where the 4x4 matrix and is a vector in which leads to the change of sign of some terms.
The Einstein summation convention works will with partial derivatives and it is widely used in particle physics.
In particular, the divergence and the Laplacian can be succintly expressed in this notation:
In order to expresse partial derivatives, we must use what Ciro Santilli calls the "partial index partial derivative notation", which refers to variales with indices such as , , , , and instead of the usual letters , and .
It is common to just omit the variables of the function, so we tend to just say:or equivalently when referring just to the operator:
Consider a real valued function of three variables:
It is common to just omit the variables of the function, so we tend to just say:or equivalently when referring just to the operator:
Given the function :the operator can be written in Planck units as:often written without function arguments as:Note how this looks just like the Laplacian in Einstein notation, since the D'alembert operator is just a generalization of the laplace operator to Minkowski space.
The Klein-Gordon equation can be written in terms of the D'alembert operator as:so we can expand the D'alembert operator in Einstein notation to:
Source: github.com/QBobWatson/ila.
Written in MathBook XML.