Ciro Santilli $$ Sponsor Ciro $$ 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
Mathematics is a beautiful game played on strings, which mathematicians call "theorems".
Here is a more understandable description of the semi-satire that follows:
You start with a very small list of:
  • certain arbitrarily chosen initial strings, which mathematicians call "axioms"
  • rules of how to obtain new strings from old strings, called "rules of inference" Every transformation rule is very simple, and can be verified by a computer.
Using those rules, you choose a target string that you want to reach, and then try to reach it. Before the target string is reached, mathematicians call it a "conjecture".
Mathematicians call the list of transformation rules used to reach a string a "proof".
Since every step of the proof is very simple and can be verified by a computer automatically, the entire proof can also be automatically verified by a computer very easily.
Finding proofs however is undoubtedly an uncomputable problem.
Most mathematicians can't code or deal with the real world in general however, so they haven't created the obviously necessary: website front-end for a mathematical formal proof system.
The fact that Mathematics happens to be the best way to describe physics and that humans can use physical intuition heuristics to reach the NP-hard proofs of mathematics is one of the great miracles of the universe.
Once we have mathematics formally modelled, one of the coolest results is Gödel's incompleteness theorems, which states that for any reasonable proof system, there are necessarily theorems that cannot be proven neither true nor false starting from any given set of axioms: those theorems are independent from those axioms. Therefore, there are three possible outcomes for any hypothesis: true, false or independent!
Some famous theorems have even been proven to be independent of some famous axioms. One of the most notable is that the Continuum Hypothesis is independent from Zermelo-Fraenkel set theory! Such independence proofs rely on modelling the proof system inside another proof system, and forcing is one of the main techniques used for this.
Figure 1. The landscape of modern Mathematics comic by Abstruse Goose. Source. This comic shows that Mathematics is one of the most diversified areas of useless human knowledge.
Much of this section will be dumped at Section "Website front-end for a mathematical formal proof system" instead.
If Ciro Santilli ever becomes rich, he's going to solve this with: website front-end for a mathematical formal proof system, promise.
A proof in some system for the formalization of mathematics.
One of the first formal proof systems. This is actually understandable!
This is Ciro Santilli-2020 definition of the foundation of mathematics (and the only one he had any patience to study at all).
TODO what are its limitations? Why were other systems created?
When Ciro Santilli says set theory, he basically means. Zermelo-Fraenkel set theory.
It seems to implement Zermelo-Fraenkel set theory.
A set of axioms is consistent if they don't lead to any contradictions.
When a set of axioms is not consistent, false can be proven, and then everything is true, making the set of axioms useless.
A theorem is said to be independent from a set of axioms if it cannot be proven neither true nor false from those axioms.
It or its negation could therefore be arbitrarily added to the set of axioms.
An easy to prove theorem that follows from a harder to prove theorem.
Intuitively: unordered container where all the values are unique, just like C++ std::set.
More precisely for set theory formalization of mathematics:
  • everything is a set, including the elements of sets
  • string manipulation wise:
    • {} is an empty set. The natural number 0 is defined as {} as well.
    • {{}} is a set that contains an empty set
    • {{}, {{}}} is a set that contains two sets: {} and {{}}
    • {{}, {}} is not well formed, because it contains {} twice
The size of a set.
For finite sizes, the definition is simple, and the intuitive name "size" matches well.
But for infinity, things are messier, e.g. the size of the real numbers is strictly larger than the size of the integers as shown by Cantor's diagonal argument, which is kind of what justifies a fancier word "cardinality" to distinguish it from the more normal word "size".
The key idea is to compare set sizes with bijections.
A function that maps two sets to a third set.
A Cartesian product that carries over some extra structure of the input groups.
E.g. the direct product of groups carries over group structure on both sides.
Mnemonic: in means into. So we are going into a codomain that is large enough so that we can have a different image for every input.
Mnemonic: sur means over. So we are going over the codomain, and covering it entirely.
Vs: image: the codomain is the set that the function might reach.
The image is the exact set that it actually reaches.
E.g. the function:
could have:
  • codomain
  • image
Note that the definition of the codomain is somewhat arbitrary, e.g. could as well technically have codomain:
even though it will obviously never reach any value in .
The exact image is in general therefore harder to characterize.
A function where the domain is the same as the codomain.
We define this as the functional equation:
It is a bit like cauchy's functional equation but with multiplication instead of addition.
The differential equation that is solved by the exponential function:
with initial condition:
TODO find better name for it, "linear homogenous differential equation of degree one" almost fully constrainst it except for the exponent constant and initial value.
The Taylor series expansion is the most direct definition of the expontial as it obviously satisfies the exponential function differential equation:
  • the first constant term dies
  • each other term gets converted to the one before
  • because we have infinite many terms, we get what we started with!
The basic intuition for this is to start from the origin and make small changes to the function based on its known derivative at the origin.
More precisely, we know that for any base b, exponentiation satisfies:
  • .
  • .
And we also know that for in particular that we satisfy the exponential function differential equation and so:
One interesting fact is that the only thing we use from the exponential function differential equation is the value around , which is quite little information! This idea is basically what is behind the importance of the ralationship between Lie group-Lie algebra correspondence via the exponential map. In the more general settings of groups and manifolds, restricting ourselves to be near the origin is a huge advantage.
Now suppose that we want to calculate . The idea is to start from and then then to use the first order of the Taylor series to extend the known value of to .
E.g., if we split into 2 parts, we know that:
or in three parts:
so we can just use arbitrarily many parts that are arbitrarily close to :
and more generally for any we have:
Let's see what happens with the Taylor series. We have near in little-o notation:
Therefore, for , which is near for any fixed :
and therefore:
which is basically the formula tha we wanted. We just have to convince ourselves that at , the disappears, i.e.:
To do that, let's multiply by itself once:
and multiplying a third time:
TODO conclude.
Is the solution to a system of linear ordinary differential equations, the exponential function is just a 1-dimensional subcase.
Note that more generally, the matrix exponential can be defined on any ring.
The matrix exponential is of particular interest in the study of Lie groups, because in the case of the Lie algebra of a matrix Lie group, it provides the correct exponential map.
Video 1. How (and why) to raise e to the power of a matrix by 3Blue1Brown (2021) Source. mentions it always exists for all invertible complex matrices. But the real condition is more complicated. Notable counter example: -1 cannot be reached by any real .
The Lie algebra exponential covering problem can be seen as a generalized version of this problem, because
  • Lie algebra of is just the entire
  • we can immediately exclude non-invertible matrices from being the result of the exponential, because has inverse , so we already know that non-invertible matrices are not reachable
Most notable example: .
What do you prefer, 1 \times 10^{10} or 1E10.
A good definition is by using Dedekind cuts.
An ordered pair of two real numbers with the complex addition and multiplication defined.
Forms both a:
Constructs the quaternions from complex numbers, octonions from quaternions, and keeps doubling like this indefinitely.
Kind of extends the complex numbers.
Some facts that make them stand out:
Unlike the quaternions, it is non-associative.
sets are unordered, but we can use them to create ordered objects, which are of fundamental importance. Notably, they are used in the definition of functions.
This is the part of the formalization of mathematics that deals only with the propositions.
In some systems, e.g. including Metamath, modus ponens alone tends to be enough, everything else can be defined based on it.
Builds on top of propositional logic, adding notably existential quantification.
Models existence in the context of the formalization of mathematics.
Existence and uniqueness results are fundamental in mathematics because we often define objects by their properties, and then start calling them "the object", which is fantastically convenient.
But calling something "the object" only makes sense if there exists exactly one, and only one, object that satisfies the properties.
One particular context where these come up very explicitly is in solutions to differential equations, e.g. existence and uniqueness of solutions of partial differential equations.