Words: 4

Born: 1965

Died: 2010+-ish

Semiconductor device fabrication

Words: 1k Articles: 41

en.wikipedia.org/wiki/Semiconductor_device

This is the lowest level of abstraction computer, at which the basic gates and power are described.

At this level, you are basically thinking about the 3D layered structure of a chip, and how to make machines that will allow you to create better, usually smaller, gates.

Semiconductor research institute

Words: 39 Articles: 3

IMEC (1984-, Belgium)

Words: 35

Video 1.

imec: The Semiconductor Watering Hole by Asianometry (2022)

Source. A key thing they do is have a small prototype fab that brings in-development equipment from different vendors together to make sure the are working well together. Cool.

Computer research institute

Words: 4 Articles: 1

As mentioned at youtu.be/16BzIG0lrEs?t=397 from Video 4. "Applied Materials by Asianometry (2021)", originally the companies fabs would make their own equipment. But eventually things got so complicated that it became worth it for separate companies to focus on equipment, which then then sell to the fabs.

ASML Holding (1984-)

Words: 169 Articles: 1

As of 2020 leading makers of the most important fab photolithography equipment.

Video 2.

ASML: TSMC's Critical Supplier by Asianometry (2021)

Source.

Video 3.

How ASML Won Lithography by Asianometry (2021)

Source.

First there were dominant Elmer and Geophysics Corporation of America dominating the market.

Then a Japanese government project managed to make Nikon and Canon Inc. catch up, and in 1989, when Ciro Santilli was born, they had 70% of the market.

youtu.be/SB8qIO6Ti_M?t=240 In 1995, ASML had reached 25% market share. Then it managed the folloging faster than the others:

TwinScan, reached 50% market share in 2002
Immersion litography
EUV. There was a big split between EUV vs particle beams, and ASML bet on EUV and EUV won.
youtu.be/SB8qIO6Ti_M?t=459 they have an insane number of software engineers working on software for the machine, which is insanely complex. They are big on UML.
youtu.be/SB8qIO6Ti_M?t=634 they use ZEISS optics, don't develop their own. More precisely, the majority owned subsidiary Carl Zeiss SMT.
youtu.be/SB8qIO6Ti_M?t=703 IMEC collaborations worked well. Notably the ASML/Philips/ZEISS trinity

www.youtube.com/watch?v=XLNsYecX_2Q ASML: Chip making goes vacuum with EUV (2009) Self promotional video, some good shots of their buildings.

ASM International (1964)

Words: 3

Parent/predecessor of ASML.

Applied Materials (1967-)

Words: 6

Video 4.

Applied Materials by Asianometry (2021)

Source. They are chemical vapor deposition fanatics basically.

Power, performance and area (PPA)

Words: 199

en.wikichip.org/wiki/power-performance-area

This is the mantra of the semiconductor industry:

power and area are the main limiting factors of chips, i.e., your budget:
- chip area is ultra expensive because there are sporadic errors in the fabrication process, and each error in any part of the chip can potentially break the entire chip. Although there are
  The percentage of working chips is called the yield.
  In some cases however, e.g. if the error only affects single CPU of a multi-core CPU, then they actually deactivate the broken CPU after testing, and sell the worse CPU cheaper with a clear branding of that: this is called binning www.tomshardware.com/uk/reviews/glossary-binning-definition,5892.html
- power is a major semiconductor limit as of 2010's and onwards. If everything turns on at once, the chip would burn. Designs have to account for that.
performance is the goal.
Conceptually, this is basically a set of algorithms that you want your hardware to solve, each one with a respective weight of importance.
Serial performance is fundamentally limited by the longest path that electrons have to travel in a given clock cycle.
The way to work around it is to create pipelines, splitting up single operations into multiple smaller operations, and storing intermediate results in memories.

Wafer (electronics)

Articles: 1

Czochralski method

Semiconductor fabrication plant (foundry, Fab)

Words: 226 Articles: 11

They put a lot of expensive equipment together, much of it made by other companies, and they make the entire chip for companies ordering them.

Company with a semiconductor fabrication plant

Words: 200 Articles: 5

A list of fabs can be seen at: en.wikipedia.org/wiki/List_of_semiconductor_fabrication_plants and basically summarizes all the companies that have fabs.

 Tagged

Intel

Fairchild Semiconductor

Words: 5

Some nice insights at: Robert Noyce: The Man Behind the Microchip by Leslie Berlin (2006).

GlobalFoundries (2009, AMD spinout)

Words: 36

AMD just gave up this risky part of the business amidst the fabless boom. Sound like a wise move. They then fell more and more away from the state of the art, and moved into more niche areas.

Infineon Technologies (1999)

SMIC (Chinese TSMC)

Words: 4

Video 5.

SMIC, Explained by Asianometry (2021)

Source.

TSMC

Words: 138

One of the companies that has fabs, which buys machines from companies such as ASML and puts them together in so called "silicon fabs" to make the chips

As the quintessential fabless fab, there is on thing TSMC can never ever do: sell their own design! It must forever remain a fab-only company, that will never compete with its customers. This is highlighted e.g. at youtu.be/TRZqE6H-dww?t=936 from Video 31. "How Nvidia Won Graphics Cards by Asianometry (2021)".

Video 6.

How Taiwan Created TSMC by Asianometry (2020)

Source. Some points:

UCM failed because it focused too much on the internal market, and was shielded from external competition, so it didn't become world leading
one of TSMC's great advances was the fabless business model approach.
they managed to do large technology transfers from the West to kickstart things off
one of their main victories was investing early in CMOS, before it became huge, and winning that market

Semiconductor fabrication step

Articles: 4

Chemical vapor deposition

Photolithography

Articles: 2

Extreme ultraviolet lithography (EUV)

Photomask

Standard cell library

Words: 179 Articles: 1

Basically what register transfer level compiles to in order to achieve a real chip implementation.

After this is done, the final step is place and route.

They can be designed by third parties besides the semiconductor fabrication plants. E.g. Arm Ltd. markets its Artisan Standard Cell Libraries as mentioned e.g. at: web.archive.org/web/20211007050341/https://developer.arm.com/ip-products/physical-ip/logic This came from a 2004 acquisition: www.eetimes.com/arm-to-acquire-artisan-components-for-913-million/, obviously.

The standard cell library is typically composed of a bunch of versions of somewhat simple gates, e.g.:

AND with 2 inputs
AND with 3 inputs
AND with 4 inputs
OR with 2 inputs
OR with 3 inputs

and so on.

Each of those gates has to be designed by hand as a 3D structure that can be produced in a given fab.

Simulations are then carried out, and the electric properties of those structures are characterized in a standard way as a bunch of tables of numbers that specify things like:

how long it takes for electrons to pass through
how much heat it produces

Those are then used in power, performance and area estimates.

Open source standard cell library

Words: 18

Open source ones:

www.quora.com/Are-there-good-open-source-standard-cell-libraries-to-learn-IC-synthesis-with-EDA-tools/answer/Ciro-Santilli Are there good open source standard cell libraries to learn IC synthesis with EDA tools?

Electronic design automation (EDA)

Words: 165 Articles: 12

A set of software programs that compile high level register transfer level languages such as Verilog into something that a fab can actually produce. One is reminded of a compiler toolchain but on a lower level.

The most important steps of that include:

logic synthesis: mapping the Verilog to a standard cell library
place and route: mapping the synthesis output into the 2D surface of the chip

Electronic design automation phase

Words: 67 Articles: 4

Logic synthesis

Words: 28

Step of electronic design automation that maps the register transfer level input (e.g. Verilog) to a standard cell library.

The output of this step is another Verilog file, but one that exclusively uses interlinked cell library components.

Place and route

Words: 39 Articles: 2

Given a bunch of interlinked standard cell library elements from the logic synthesis step, actually decide where exactly they are going to go on 2D (stacked 2D) integrated circuit surface.

Sample output format of place and route would be GDSII.

Integrated circuit layout

Words: 6 Articles: 1

GDSII

Words: 6

EDA company

Words: 25 Articles: 4

The main ones as of 2020 are:

Mentor Graphics, which was bought by Siemens in 2017
Cadence Design Systems
Synopsys

Cadence Design Systems

Words: 11 Articles: 1

Alberto Sangiovanni-Vincentelli

Words: 11

Video 7.

The Italian PROFESSOR who founded 2 BILLION-DOLLAR Companies by Marcello Ascani

. Source.

Mentor Graphics

Synopsys

Open source EDA tool

Words: 21 Articles: 1

qflow

Words: 21

Cool looking open source EDA toolchain:

They apparently even produced a real working small RISC-V chip with the flow, not bad.

Semiconductor process node

Semiconductor device fabrication bibliography

Words: 109 Articles: 1

Asianometry

Words: 109

www.youtube.com/channel/UC1LpsuAUaKoMzzJSEt5WImw

Very good channel to learn some basics of semiconductor device fabrication!

Focuses mostly on the semiconductor industry.

youtu.be/aL_kzMlqgt4?t=661 from Video 5. "SMIC, Explained by Asianometry (2021)" from mentions he is of Chinese ascent, ancestors from Ningbo. Earlier in the same video he mentions he worked on some startups. He doesn't appear to speak perfect Mandarin Chinese anymore though based on pronounciation of Chinese names.

asianometry.substack.com/ gives an abbreviated name "Jon Y".

Video 8.

Reflecting on Asianometry in 2022 by Asianometry (2022)

Source. Mentions his insane work schedule: 4 hours research in the morning, then day job, then editing and uploading until midnight. Appears to be based in Taipei. Two videos a week. So even at the current 400k subs, he still can't make a living.

Integrated circuit (IC)

Words: 161 Articles: 3

It is quite amazing to read through books such as The Supermen: The Story of Seymour Cray by Charles J. Murray (1997), as it makes you notice that earlier CPUs (all before the 70's) were not made with integrated circuits, but rather smaller pieces glued up on PCBs! E.g. the arithmetic logic unit was actually a discrete component at one point.

The reason for this can also be understood quite clearly by reading books such as Robert Noyce: The Man Behind the Microchip by Leslie Berlin (2006). The first integrated circuits were just too small for this. It was initially unimaginable that a CPU would fit in a single chip! Even just having a very small number of components on a chip was already revolutionary and enough to kick-start the industry. Just imagine how much money any level of integration saved in those early days for production, e.g. as opposed to manually soldering point-to-point constructions. Also the reliability, size an weight gains were amazing. In particular for military and spacial applications originally.

Video 9.

A briefing on semiconductors by Fairchild Semiconductor (1967)

Source.

Uploaded by the Computer History Museum. There is value in tutorials written by early pioneers of the field, this is pure gold.

Shows:

photomasks
silicon ingots and wafer processing

Interconnect (integrated_circuits, Chip interconnect)

Application-specific integrated circuit (ASIC, Hardware acceleration)

System on a chip (SoC)

Register transfer level (RTL)

Words: 652 Articles: 11

The only two truly relevant RTL languages as of 2020 are: Verilog and VHDL. Everything else compiles to those, because that's all that EDA vendors support.

Much like a C compiler abstracts away the CPU assembly to:

increase portability across ISAs
do optimizations that programmers can't feasibly do without going crazy

Compilers for RTL languages such as Verilog and VHDL abstract away the details of the specific semiconductor technology used for those exact same reasons.

The compilers essentially compile the RTL languages into a standard cell library.

Examples of companies that work at this level include:

Intel. Intel also has semiconductor fabrication plants however.
Arm which does not have fabs, and is therefore called a "fabless" company.

High-level synthesis

Fabless manufacturing

Words: 97 Articles: 1

In the past, most computer designers would have their own fabs.

But once designs started getting very complicated, it started to make sense to separate concerns between designers and fabs.

What this means is that design companies would primarily write register transfer level, then use electronic design automation tools to get a final manufacturable chip, and then send that to the fab.

It is in this point of time that TSMC came along, and benefied and helped establish this trend.

The term "Fabless" could in theory refer to other areas of industry besides the semiconductor industry, but it is mostly used in that context.

Fabless semiconductor company

 Tagged

Cerebras

Logic gate

Articles: 1

Truth table

Verilog

Words: 322 Articles: 3

Examples under verilog, more details at Verilator.

Value change dump (VCD)

Verilator

Words: 315 Articles: 1

Verilog simulator that transpiles to C++.

One very good thing about this is that it makes it easy to create test cases directly in C++. You just supply inputs and clock the simulation directly in a C++ loop, then read outputs and assert them with assert(). And you can inspect variables by printing them or with GDB. This is infinitely more convenient than doing these IO-type tasks in Verilog itself.

Some simulation examples under verilog.

First install Verilator. On Ubuntu:

sudo apt install verilator

Tested on Verilator 4.038, Ubuntu 22.04.

Run all examples, which have assertions in them:

cd verilator
make run

File structure is for example:

verilog/counter.v: Verilog file
verilog/counter.cpp: C++ loop which clocks the design and runs tests with assertions on the outputs
verilog/counter.params: gcc compilation flags for this example
verilog/counter_tb.v: Verilog version of the C++ test. Not used by Verilator. Verilator can't actually run out _tb files, because they do in Verilog IO things that we do better from C++ in Verilator, so Verilator didn't bother implementing them. This is a good thing.

Example list:

verilog/negator.v, verilog/negator.cpp: the simplest non-identity combinatorial circuit!
verilog/counter.v, verilog/counter.cpp: sequential hello world. Synchronous active high reset with active high enable signal. Adapted from: www.asic-world.com/verilog/first1.html
verilog/subleq.v, verilog/subleq.cpp: subleq one instruction set computer with separated instruction and data RAMs

Verilator interactive example

Words: 114

The example under verilog/interactive showcases how to create a simple interactive visual Verilog example using Verilator and SDL.

https://raw.githubusercontent.com/cirosantilli/media/master/verilog-interactive.gif

You could e.g. expand such an example to create a simple (or complex) video game for example if you were insane enough. But please don't waste your time doing that, Ciro Santilli begs you.

The example is also described at: stackoverflow.com/questions/38108243/is-it-possible-to-do-interactive-user-input-and-output-simulation-in-vhdl-or-ver/38174654#38174654

Usage: install dependencies:

sudo apt install libsdl2-dev verilator

then run as either:

make run RUN=and2
make run RUN=move

Tested on Verilator 4.038, Ubuntu 22.04.

File overview:

In those examples, the more interesting application specific logic is delegated to Verilog (e.g.: move game character on map), while boring timing and display matters can be handled by SDL and C++.

VHDL

Words: 113 Articles: 1

Examples under vhdl, more details at: GHDL.

GHDL

Words: 106

github.com/ghdl/ghdl

Examples under vhdl.

First install GHDL. On Ubuntu:

sudo apt install verilator

Tested on Verilator 1.0.0, Ubuntu 22.04.

Run all examples, which have assertions in them:

cd vhdl
./run

Files:

Examples
- Basic
  - vhdl/hello_world_tb.vhdl: hello world
  - vhdl/min_tb.vhdl: min
  - vhdl/assert_tb.vhdl: assert
- Lexer
  - vhdl/comments_tb.vhdl: comments
  - vhdl/case_insensitive_tb.vhdl: case insensitive
  - vhdl/whitespace_tb.vhdl: whitespace
  - vhdl/literals_tb.vhdl: literals
- Flow control
  - vhdl/procedure_tb.vhdl: procedure
  - vhdl/function_tb.vhdl: function
- vhdl/operators_tb.vhdl: operators
- Types
  - vhdl/integer_types_tb.vhdl: integer types
  - vhdl/array_tb.vhdl: array
  - vhdl/record_tb.vhdl.bak: record. TODO fails with "GHDL Bug occurred" on GHDL 1.0.0
  - vhdl/generic_tb.vhdl: generic
- vhdl/package_test_tb.vhdl: Packages
  - vhdl/standard_package_tb.vhdl: standard package
  - textio
    * vhdl/write_tb.vhdl: write
    * vhdl/read_tb.vhdl: read
  - vhdl/std_logic_tb.vhdl: std_logic
- vhdl/stop_delta_tb.vhdl: --stop-delta
Applications
- Combinatoric
  - vhdl/adder.vhdl: adder
  - vhdl/sqrt8_tb.vhdl: sqrt8
- Sequential
  - vhdl/clock_tb.vhdl: clock
  - vhdl/counter.vhdl: counter
Helpers
* vhdl/template_tb.vhdl: template

Microarchitecture

Words: 326 Articles: 8

 Tagged

CPU architecture

Microarchitectural benchmark

Words: 326 Articles: 7

CPU microbenchmark

Words: 326 Articles: 6

Some examples:

c/inc_loop.c

Words: 162

Ubuntu 25.04 GCC 14.2 -O0 x86_64 produces a horrendous:

11c8:       48 83 45 f0 01          addq   $0x1,-0x10(%rbp)
11cd:       48 8b 45 f0             mov    -0x10(%rbp),%rax
11d1:       48 3b 45 e8             cmp    -0x18(%rbp),%rax
11d5:       72 f1                   jb     11c8 <main+0x7f>

To do about 1s on P14s we need 2.5 billion instructions:

time ./inc_loop.out 2500000000

and:

time ./inc_loop.out 2500000000

gives:

          1,052.22 msec task-clock                       #    0.998 CPUs utilized             
                23      context-switches                 #   21.858 /sec                      
                12      cpu-migrations                   #   11.404 /sec                      
                60      page-faults                      #   57.022 /sec                      
    10,015,198,766      instructions                     #    2.08  insn per cycle            
                                                  #    0.00  stalled cycles per insn   
     4,803,504,602      cycles                           #    4.565 GHz                       
        20,705,659      stalled-cycles-frontend          #    0.43% frontend cycles idle      
     2,503,079,267      branches                         #    2.379 G/sec                     
           396,228      branch-misses                    #    0.02% of all branches

With -O3 it manages to fully unroll the loop removing it entirely and producing:

    1078:       e8 d3 ff ff ff          call   1050 <strtoll@plt>
}
    107d:       5a                      pop    %rdx
    107e:       c3                      ret

to is it smart enough to just return the return value from strtoll directly as is in rax.

c/inc_loop.c

#include <stdlib.h>

int main(int argc, char **argv) {
    unsigned long long max;
    if (argc > 1) {
        max = strtoll(argv[1], NULL, 0);
    } else {
        max = 1;
    }
    unsigned long long ret;
    for (ret = 0; ret < max; ret++) {}
    return ret;
}

c/inc_loop_asm.c

Words: 47

This is the only way that we've managed to reliably get a single inc instruction loop, by using inline assembly, e.g. on we do x86:

loop:
  inc %[i];
  cmp %[max], %[i];
  jb loop;

For 1s on P14s Ubuntu 25.04 GCC 14.2 -O0 x86_64 we need about 5 billion:

time ./inc_loop_asm.out 5000000000

c/inc_loop_asm.c

#include <stdlib.h>
#include <stdint.h>

int main(int argc, char **argv) {
    uint64_t max, i;
    if (argc > 1) {
        max = strtoll(argv[1], NULL, 0);
    } else {
        max = 1;
    }
    i = max;
#if defined(__x86_64__) || defined(__i386__)
    __asm__ (
        "loop:"
        "dec %[i];"
        "jne loop;"
        : [i] "+r" (i)
        :
        :
    );
#endif
    return i;
}

c/inc_loop_asm_n.sh

Words: 115

This is a quick Microarchitectural benchmark to try and determine how many functional units our CPU has that can do an inc instruction at the same time due to superscalar architecture.

The generated programs do loops like:

loop:
  inc %[i0];
  inc %[i1];
  inc %[i2];
  ...
  inc %[i_n];
  cmp %[max], %[i0];
  jb loop;

with different numbers of inc instructions.

Figure 2.
c/inc_loop_asm_n.sh results for a few CPUs
.
Quite clearly:
AMD 7840U can run INC on 4 functional units
Intel i7-7820HQ can run INC on 2 functional units
and both have low instruction count effects that destroy performance, AMD at 3 and Intel at 3 and 5. TODO it would be cool to understand those better.
Data from multiple CPUs manually collated and plotted manually with c/inc_loop_asm_n_manual.sh.

Announced at:

c/inc_loop_asm_n.sh was not rendered because it is too large (> 2000 bytes)

c/mul_loop_asm.c

c/mul_loop_asm.c

#include <stdlib.h>
#include <stdint.h>

int main(int argc, char **argv) {
    uint64_t max, i, x0;
    if (argc > 1) {
        max = strtoll(argv[1], NULL, 0);
    } else {
        max = 1;
    }
    i = max;
    x0 = 1;
#if defined(__x86_64__) || defined(__i386__)
    __asm__ (
        "mov %[x0], %%rax;"
        "mov $2, %%rbx;"
        ".align 64;"
        "loop:"
        "mul %%rbx;"
        "dec %[i];"
        "jne loop;"
        "mov %%rax, %[x0];"
        : [i] "+r" (i),
          [x0] "+r" (x0)
        :
        : "rax",
          "rbx",
          "rdx"
    );
#endif
    return x0;
}

c/mul_loop_asm_2.c

c/mul_loop_asm_2.c

#include <stdlib.h>
#include <stdint.h>

int main(int argc, char **argv) {
    uint64_t max, i, x0, x1;
    if (argc > 1) {
        max = strtoll(argv[1], NULL, 0);
    } else {
        max = 1;
    }
    i = max;
    x0 = 1;
    x1 = 1;
#if defined(__x86_64__) || defined(__i386__)
    __asm__ (
        "mov $2, %%rbx;"
        ".align 64;"
        "loop:"

        "mov %[x0], %%rax;"
        "mul %%rbx;"
        "mov %%rax, %[x0];"

        "mov %[x1], %%rax;"
        "mul %%rbx;"
        "mov %%rax, %[x1];"

        "dec %[i];"
        "jne loop;"
        : [i] "+r" (i),
          [x0] "+r" (x0),
          [x1] "+r" (x1)
        :
        : "rax",
          "rbx" ,
          "rdx" 
    );
#endif
    return x0 + x1;
}

c/mul_loop_asm_n.sh

c/mul_loop_asm_n.sh was not rendered because it is too large (> 2000 bytes)

Computer hardware component type

Words: 8k Articles: 158

Processor (computing)

Words: 7k Articles: 101

Instruction set architecture (ISA)

Words: 6k Articles: 62

The main interface between the central processing unit and software.

Assembly language

Words: 18 Articles: 2

A human readable way to write instructions for an instruction set architecture.

One of the topics covered in Ciro Santilli's Linux Kernel Module Cheat.

Assembler (computing)

Articles: 1

GNU Assembler (GNU GAS)

Calling convention

List of instruction set architectures

Words: 6k Articles: 57

List of instruction set architecture.

One instruction set computer (OISC)

stackoverflow.com/questions/3711443/minimal-instruction-set-to-solve-any-problem-with-a-computer-program/38523869#38523869

ARM architecture family

Words: 174

This ISA basically completely dominated the smartphone market of the 2010s and beyond, but it started appearing in other areas as the end of Moore's law made it more economical logical for large companies to start developing their own semiconductor, e.g. Google custom silicon, Amazon custom silicon.

It is exciting to see ARM entering the server, desktop and supercomputer market circa 2020, beyond its dominant mobile position and roots.

Ciro Santilli likes to see the underdogs rise, and bite off dominant ones.

The excitement also applies to RISC-V possibly over ARM mobile market one day conversely however.

Basically, as long as were a huge company seeking to develop a CPU and able to control your own ecosystem independently of Windows' desktop domination (held by the need for backward compatibility with a billion end user programs), ARM would be a possibility on your mind.

in 2020, the Fugaku supercomputer, which uses an ARM-based Fujitsu designed chip, because the number 1 fastest supercomputer in TOP500: www.top500.org/lists/top500/2021/11/
It was later beaten by another x86 supercomputer www.top500.org/lists/top500/2022/06/, but the message was clearly heard.
2012 hackaday.com/2012/07/09/pedal-powered-32-core-arm-linux-server/ pedal-powered 32-core Arm Linux server. A publicity stunt, but still, cool.
AWS Graviton

PowerPC

RISC-V

Words: 169 Articles: 10

The leading no-royalties options as of 2020.

China has been a major RISC-V potential user in the late 2010s, since the country is trying to increase its semiconductor industry independence, especially given economic sanctions imposed by the USA.

E.g. a result of this, the RISC-V Foundation moved its legal headquarters to Switzerland in 2019 to try and overcome some of the sanctions.

 Tagged

WebRISC-V

RISC-V International (RISC-V Foundation)

RISC-V vendor

Words: 41 Articles: 3

Codasip

SiFive

Words: 15

Leading RISC-V consultants as of 2020, they are basically trying to become the Red Hat of the semiconductor industry.

SiPearl

Words: 26

Risky name with the Si prefix, too close to SiFive. Both a reference to silicon no doubt, but still. If they stick they will one day rename.

RISC-V timer

Words: 73 Articles: 1

riscv/timer.S

Words: 73

TODO: the interrupt is firing only once:

www.reddit.com/r/RISCV/comments/ov4vhh/timer_interrupt/

Adapted from: danielmangum.com/posts/risc-v-bytes-timer-interrupts/

Tested on Ubuntu 23.10:

sudo apt install binutils-riscv64-unknown-elf qemu-system-misc gdb-multiarch
cd riscv
make

Then on shell 1:

qemu-system-riscv64 -machine virt -cpu rv64 -smp 1 -s -S -nographic -bios none -kernel timer.elf

and on shell 2:

gdb-multiarch timer.elf -nh -ex "target remote :1234" -ex 'display /i $pc' -ex 'break *mtrap' -ex 'display *0x2004000' -ex 'display *0x200BFF8'

GDB should break infinitel many times on mtrap as interrupts happen.

riscv/timer.S

/* Adapted from: https://danielmangum.com/posts/risc-v-bytes-timer-interrupts/ */

.option norvc
.section .text
.global _start
_start:
    /* MSTATUS.PRIV = 0 */
    li t0, (0b11 << 7)
    csrs mstatus, t0

    /* MTVEC = mtrap
      Where to jump after each timer interrupt. */
    la t0, mtrap
    csrw mtvec, t0

    /* setup timer */
    /* mtime */
    li t1, 0x200BFF8
    lw t0, 0(t1)
    li t2, 50000
    add t0, t0, t2
    /* mtimecmp */
    li t1, 0x2004000
    sw t0, 0(t1)

    /* MSTATUS.MIE = 1 */
    li t0, (1 << 3)
    csrs mstatus, t0

    /* MIE.MTIE = 1 */
    li t0, (1 << 7)
    csrs mie, t0
spin:
    j spin

mtrap:
    /* setup timer */
    /* mtime */
    li t1, 0x200BFF8
    ld t0, 0(t1)
    li t2, 50000
    add t0, t0, t2
    /* mtimecmp */
    li t1, 0x2004000
    sd t0, 0(t1)

    j spin

RISC-V priviledged ISA

Articles: 2

RISC-V MSTATUS register

Articles: 1

RISC-V MSTATUS.MIE field

x86

Words: 6k Articles: 41

x86 Paging Tutorial

Words: 4k Articles: 39

This section is present in another page, follow this link to view it.

x86 custom instructions

Words: 58

Intel is known to have created customized chips for very large clients.

This is mentioned e.g. at: www.theregister.com/2021/03/23/google_to_build_server_socs/

Intel is known to do custom-ish cuts of Xeons for big customers.

Those chips are then used only in large scale server deployments of those very large clients. Google is one of them most likely, given their penchant for Google custom hardware.

TODO better sources.

Y86

Words: 15

esolangs.org/wiki/Y86 mentions:

Y86 is a toy RISC CPU instruction set for education purpose.

One specification at: web.cse.ohio-state.edu/~reeves.92/CSE2421sp13/PracticeProblemsY86.pdf

 Tagged

y86.js.org

Type of processor

Words: 723 Articles: 37

Central processing unit (CPU)

Words: 469 Articles: 20

Arithmetic logic unit

Microcontroller

Words: 59 Articles: 2

As of 2020's, it is basically a cheap/slow/simple CPU used in embedded system applications.

 Tagged

Raspberry Pi Pico

MicroPython

Words: 47

It is interpreted. It actually implements a Python (-like ?) interpreter that can run on a microcontroller. See e.g.: Compile MicroPython code for Micro Bit locally.

As a result, it is both very convenient, as it does not require a C toolchain to build for, but also very slow and produces larger images.

 Tagged

Program Raspberry Pi Pico W with C
Program Raspberry Pi Pico W with MicroPython

Microcontroller vs CPU

CPU architecture

Words: 82 Articles: 11

Superscalar processor

Articles: 1

CPU functional unit

Instruction pipelining

Words: 82 Articles: 8

The first thing you must understand is the Classic RISC pipeline with a concrete example.

Educational CPU microarchitecture simulator

Words: 70 Articles: 4

freess

JavaScript CPU microarchitecture simulator

Words: 70 Articles: 2

y86.js.org

Words: 57

The good:

slick UI! But very hard to read characters, they're way too small.
attempts to show state diffs with a flash. But it goes by too fast, would be better if it were more permanent
Reverse debugging

The bad:

educational ISA
unclear what flags mean from UI, no explanation on hover. Likely the authors assume knowledge of the Y86 book.

WebRISC-V

Words: 13

webriscv.dii.unisi.it/

The good:

Reverse debugging
circuit diagram

The bad:

Clunky UI
circuit diagram doesn't show any state??

Hazard (computer architecture)

Articles: 1

Pipeline stall

Classic RISC pipeline

Microprocessor

Words: 5

Basically a synonym for central processing unit nowadays: electronics.stackexchange.com/questions/44740/whats-the-difference-between-a-microprocessor-and-a-cpu

CPU feature

Words: 323 Articles: 2

Trusted execution environment

Words: 323 Articles: 1

Software Guard Extensions (Intel SGX)

Words: 323

The hole point of Intel SGX is to allow users to be certain that a certain code was executed in a remove server that they rent but don't own, like AWS. Even if AWS wanted to be malicious, they would still not be able to modify your read your input, output nor modify the program.

The way this seems to work is as follows.

Each chip has its own unique private key embedded in the chip. There is no way for software to read that private key, only the hardware can read it, and Intel does not know that private key, only the corrsponding public one. The entire safety of the system relies on this key never ever leaking to anybody, even if they have the CPU in their hands. A big question is if there are physical forensic methods, e.g. using electron microscopes, that would allow this key to be identified.

Then, using that private key, you can create enclaves.

Once you have an enclave, you can load a certain code to run into the enclave.

Then, non-secure users can give inputs to that enclave, and as an output, they get not only the output result, but also a public key certificate based on the internal private key.

This certificates states:

given input X
program Y
produced output Z

and that can then be verified online on Intel's website, since they keep a list of public keys. This service is called attestation.

So, if the certificate is verified, you can be certain that a your input was ran by a specific code.

Additionally:

you can public key encrypt your input to the enclave with the public key, and then ask the enclave to send output back encrypted to your key. This way the hardware owner cannot read neither the input not the output
all data stored on RAM is encrypted by the enclave, to prevent attacks that rely on using a modified RAM that logs data

Field-programmable gate array (FPGA)

Words: 171 Articles: 2

It basically replaces a bunch of discrete digital components with a single chip. So you don't have to wire things manually.

Particularly fundamental if you would be putting those chips up a thousand cell towers for signal processing, and ever felt the need to reprogram them! Resoldering would be fun, would it? So you just do a over the wire update of everything.

Vs a microcontroller: same reason why you would want to use discrete components: speed. Especially when you want to do a bunch of things in parallel fast.

One limitation is that it only handles digital electronics: electronics.stackexchange.com/questions/25525/are-there-any-analog-fpgas There are some analog analogs, but they are much more restricted due to signal loss, which is exactly what digital electronics is very good at mitigating.

Video 10.

First FPGA experiences with a Digilent Cora Z7 Xilinx Zynq by Marco Reps (2018)

Source. Good video, actually gives some rationale of a use case that a microcontroller wouldn't handle because it is not fast enough.

Video 11.

FPGA Dev Board Tutorial by Ben Heck (2016)

Source.

Video 12.

The History of the FPGA by Asianometry (2022)

Source.

FPGA company

Articles: 1

Xilinx (1984-2022)

Graphics processing unit (GPU)

Words: 66 Articles: 8

General-purpose computing on graphics processing units (GPGPU)

Words: 66 Articles: 7

Open source GPU compute benchmark

Words: 4

github.com/ekondis/mixbench GPL
github.com/ProjectPhysX/OpenCL-Benchmark custom non-commercial, non-military license

GPU compute library

Words: 62 Articles: 5

CUDA

Words: 1 Articles: 1

CUDA hello world

Words: 1

Example: github.com/cirosantilli/cpp-cheat/blob/d18a11865ac105507d036f8f12a457ad9686a664/cuda/inc.cu

OpenCL

ROCm

Words: 61 Articles: 1

Official hello world: github.com/ROCm/HIP-Examples/blob/ff8123937c8851d86b1edfbad9f032462c48aa05/HIP-Examples-Applications/HelloWorld/HelloWorld.cpp

ROCm on Ubuntu

Words: 58

Tested on Ubuntu 23.10 with P14s:

sudo apt install hipcc
git clone https://github.com/ROCm/HIP-Examples
cd HIP-Examples/HIP-Examples-Applications/HelloWorld
make

TODO fails with:

/bin/hipcc -g   -c -o HelloWorld.o HelloWorld.cpp
clang: error: cannot find ROCm device library for gfx1103; provide its path via '--rocm-path' or '--rocm-device-lib-path', or pass '-nogpulib' to build without ROCm device library
make: *** [<builtin>: HelloWorld.o] Error 1

Generic Ubuntu install bibliograpy:

AI accelerator

Words: 17 Articles: 3

Video 13.

The Coming AI Chip Boom by Asianometry (2022)

Source.

Amazon AI accelerator silicon

Words: 10

2020: Traininum in 2020, e.g. techcrunch.com/2020/12/01/aws-launches-trainium-its-new-custom-ml-training-chip/
2018: AWS Inferentia, mentioned at en.wikipedia.org/wiki/Annapurna_Labs

Tensor Processing Unit (TPU, 2015, Google AI accelerator)

Tesla Dojo (2022)

I/O device

Words: 471 Articles: 55

 Tagged

Computer keyboard
Computer mouse

Punched card

Words: 76 Articles: 1

Served as both input, output and storage system in the eary days!

Video 14.

1964 IBM 029 Keypunch Card Punching Demonstration by CuriousMarc (2014)

Source.

Video 15.

Using Punch Cards by Bubbles Whiting (2016)

Source. Interview at the The Centre for Computing History.

Video 16.

Once Upon A Punched Card by IBM (1964)

Source. Goes on and on a bit too long. But cool still.

 Tagged

Jacquard machine

Hollerith tabulating machine

Words: 26

Video 17.

The 1890 US Census and the history of punchcard computing by Stand-up Maths (2020)

Source. It was basically a counting machine! Shows a reconstruction at the Computer History Museum.

Computer input device

Computer data storage

Words: 178 Articles: 25

Computer data storage software

Articles: 6

Filesystem

Articles: 5

Clustered file system

Articles: 2

9P (protocol)

Network File System

Computer file

Articles: 1

File signature

 Tagged

JPEG file signature

Computer data storage hardware

Words: 178 Articles: 17

Tape drive (1950s-)

Words: 51

One of the most enduring forms of storage! Started in the 1950s, but still used in the 2020s as the cheapest (and slowest access) archival method. Robot arms are needed to load and read them nowadays.

Video 18.

Web camera mounted insite an IBM TS4500 tape library by lkaptoor (2020)

Source. Footage dated 2018.

Volatile memory

Words: 20 Articles: 6

Random-access memory (RAM)

Words: 20 Articles: 5

In conventional speech of the early 2000's, is basically a synonym for dynamic random-access memory.

Static random-access memory (SRAM)

Dynamic random-access memory (DRAM)

Words: 7 Articles: 2

DRAM is often shortened to just random-access memory.

Synchronous dynamic random-access memory (SDRAM)

Articles: 1

DDR SDRAM (DDR SDRAM)

Magnetoresistive RAM (MRAM)

Non-volatile memory

Words: 77 Articles: 6

The opposite of volatile memory.

 Tagged

Magnetoresistive RAM

Disk storage

Articles: 2

Disk read-and-write head

Articles: 1

Magnetoresistive disk head

Optical storage

Solid-state storage (SSD)

Words: 73 Articles: 1

Erase SSD securely

Words: 73

You can't just shred individual sSD files because SSD writes only at large granularities, so hardware/drivers have to copy stuff around all the time to compact it. This means that leftover copies are left around everywhere.

What you can do however is to erase the entire thing with vendor support, which most hardware has support for. On hardware encrypted disks, you can even just erase the keys:

TODO does shredding the

Solid-state drive (SSD)

Words: 30 Articles: 1

Flash memory

Words: 30

Video 19.

The Engineering Puzzle of Storing Trillions of Bits in your Smartphone / SSD using Quantum Mechanics by Branch Education (2020)

Source. Nice animations show how quantum tunnelling is used to set bits in flash memory.

Peripheral

Words: 217 Articles: 25

Computer mouse

Computer keyboard

Words: 18 Articles: 6

Keyboard layout

Words: 7 Articles: 2

QWERTY

Dvorak keyboard layout

Words: 7

Dvorak users will automatically go to Heaven.

Computer keyboard model

Words: 11 Articles: 2

Kinesis Advantage keyboard

Kinesis Advantage 2 keyboard

Words: 11

kinesis-ergo.com/shop/advantage2/

For Ciro Santilli, this is not a computer keyboard. It is a fetish.

Display device

Words: 138 Articles: 6

 Tagged

Punched card

Blinkenlights

E Ink

Words: 121 Articles: 3

Electronic Ink such as that found on Amazon Kindle is the greatest invention ever made by man.

Once E Ink reaches reasonable refresh rates to replace liquid crystal displays, the world will finally be saved.

It would allow Ciro Santilli to spend his entire life in front of a screen rather in the real world without getting tired eyes, and even if it is sunny outside.

Ciro stopped reading non-code non-news a while back though, so the current refresh rates are useless, what a shame.

OMG, this is amazing: getfreewrite.com/

Amazon Kindle

Words: 5

PDF table of contents feature requests: twitter.com/cirosantilli/status/1459844683925008385

Remarkable (tablet)

Words: 29 Articles: 1

Remarkable 2 is really, really good. Relatively fast refresh + touchscreen is amazing.

No official public feedback forum unfortunately:

PDF table of contents could be better: twitter.com/cirosantilli/status/1459844683925008385

Remarkable 2

Words: 6

Display size: 10.3 inches. Perfect size

Teleprinter

Words: 17

Way, way before instant messaging, there was... teletype!

Video 20.

Using a 1930 Teletype as a Linux Terminal by CuriousMarc (2020)

Source.

Webcam

Peripheral interface

Words: 61 Articles: 8

PCI

Words: 61 Articles: 4

Video 21.

PCIe computer explained by ExplainingComputers (2018)

Source.

PCIe

lspci

Words: 56 Articles: 2

lspci is the name of several versions of CLI tools used in UNIX-like systems to query information about PCI devices in the system.

On Ubuntu 23.10, it is provided by the pciutils package, which is so dominant that when we say "lspci" without qualitication, that's what we mean.

pciutils

Words: 5

Sotware project that provides lspci.

Get vendor and device ID for each PCI device

Words: 8

stackoverflow.com/questions/59010671/how-to-get-vendor-id-and-device-id-of-all-pci-devices

grep PCI_ID /sys/bus/pci/devices/*/uevent

lspci is missing such basic functionality!

USB

Articles: 2

USB Micro-B

USB-C

Computer form factor

Words: 2k Articles: 73

 Tagged

Embedded system

Distributed computing

Words: 173 Articles: 5

Fog computing

Words: 173 Articles: 4

Our definition of fog computing: a system that uses the computational resources of individuals who volunteer their own devices, in which you give each of the volunteers part of a computational problem that you want to solve.

Folding@home and SETI@home are perfect example of that definition.

 Tagged

NuNET

Charity Engine

Folding@home

SETI@home

Is fog computing more efficient than cloud computing?

Words: 129

Advantages of fog: there is only one, reusing hardware that would be otherwise idle.

Disadvantages:

in cloud, you can put your datacenter on the location with the cheapest possible power. On fog you can't.
on fog there is some waste due to network communication.
you will likely optimize code less well because you might be targeting a wide array of different types of hardware, so more power (and time) wastage. Furthermore, some of the hardware used will not not be optimal for the task, e.g. CPU instead of GPU.

All of this makes Ciro Santilli doubtful if it wouldn't be more efficient for volunteers simply to donate money rather than inefficient power usage.

Bibliography:

greenfoldingathome.com/2018/05/28/is-foldinghome-a-waste-of-electricity/: useless article, does not compare to centralize, asks if folding the proteins is worth the power usage...

Mainframe computer

Cloud computing

Words: 1k Articles: 33

Hyperscale computing

Words: 40

Basically means "company with huge server farms, and which usually rents them out like Amazon AWS or Google Cloud Platform

Figure 5.
Global electricity use by data center type: 2010 vs 2018
. Source. The growth of hyperscaler cloud vs smaller cloud and private deployments was incredible in that period!

Cloud computing platform

Words: 894 Articles: 24

Amazon Web Services

Words: 894 Articles: 21

 Tagged

AWS Elastic Beanstalk

aws-cli

AWS service

Words: 894 Articles: 19

Amazon Athena

Words: 1

Google BigQuery alternative.

Amazon Redshift

Amazon S3

Words: 9 Articles: 1

Browse S3 bucket on web browser

Words: 9

They can't even make this basic stuff just work!

stackoverflow.com/questions/16784052/access-files-stored-on-amazon-s3-through-web-browser

Amazon Elastic Compute Cloud (Amazon EC2)

Words: 884 Articles: 14

Amazon EC2 HOWTO

Words: 707 Articles: 2

Amazon EC2 hello world

Words: 142

Let's get SSH access, instal a package, and run a server.

As of December 2023 on a t2.micro instance, the only one part of free tier at the time with advertised 1 vCPU, 1 GiB RAM, 8 GiB disk for the first 12 months, on Ubuntu 22.04:

$ free -h
               total        used        free      shared  buff/cache   available
Mem:           949Mi       149Mi       210Mi       0.0Ki       590Mi       641Mi
Swap:             0B          0B          0B
$ nproc
1
$ df -h /
Filesystem      Size  Used Avail Use% Mounted on
/dev/root       7.6G  1.8G  5.8G  24% /

To install software:

sudo apt update
sudo apt install cowsay
cowsay asdf

Once HTTP inbound traffic is enabled on security rules for port 80, you can:

while true; do printf "HTTP/1.1 200 OK\r\n\r\n`date`: hello from AWS" | sudo nc -Nl 80; done

and then you are able to curl from your local computer and get the response.

Amazon EC2 GPU

Words: 565

As of December 2023, the cheapest instance with an Nvidia GPU is g4nd.xlarge, so let's try that out. In that instance, lspci contains:

00:1e.0 3D controller: NVIDIA Corporation TU104GL [Tesla T4] (rev a1)

so we see that it runs a Nvidia T4 GPU.

Be careful not to confuse it with g4ad.xlarge, which has an AMD GPU instead. TODO meaning of "ad"? "a" presumably means AMD, but what is the "d"?

Some documentation on which GPU is in each instance can seen at: docs.aws.amazon.com/dlami/latest/devguide/gpu.html (archive) with a list of which GPUs they have at that random point in time. Can the GPU ever change for a given instance name? Likely not. Also as of December 2023 the list is already outdated, e.g. P5 is now shown, though it is mentioned at: aws.amazon.com/ec2/instance-types/p5/

When selecting the instance to launch, the GPU does not show anywhere apparently on the instance information page, it is so bad!

Also note that this instance has 4 vCPUs, so on a new account you must first make a customer support request to Amazon to increase your limit from the default of 0 to 4, see also: stackoverflow.com/questions/68347900/you-have-requested-more-vcpu-capacity-than-your-current-vcpu-limit-of-0, otherwise instance launch will fail with:

You have requested more vCPU capacity than your current vCPU limit of 0 allows for the instance bucket that the specified instance type belongs to. Please visit aws.amazon.com/contact-us/ec2-request to request an adjustment to this limit.

When starting up the instance, also select:

image: Ubuntu 22.04
storage size: 30 GB (maximum free tier allowance)

Once you finally managed to SSH into the instance, first we have to install drivers and reboot:

sudo apt update
sudo apt install nvidia-driver-510 nvidia-utils-510 nvidia-cuda-toolkit
sudo reboot

and now running:

nvidia-smi

shows something like:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 525.147.05   Driver Version: 525.147.05   CUDA Version: 12.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
|===============================+======================+======================|
|   0  Tesla T4            Off  | 00000000:00:1E.0 Off |                    0 |
| N/A   25C    P8    12W /  70W |      2MiB / 15360MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

If we start from the raw Ubuntu 22.04, first we have to install drivers:

From there basically everything should just work as normal. E.g. we were able to run a CUDA hello world just fine along:

nvcc inc.cu
./a.out

One issue with this setup, besides the time it takes to setup, is that you might also have to pay some network charges as it downloads a bunch of stuff into the instance. We should try out some of the pre-built images. But it is also good to know this pristine setup just in case.

We then managed to run Ollama just fine with:

curl https://ollama.ai/install.sh | sh
/bin/time ollama run llama2 'What is quantum field theory?'

which gave:

0.07user 0.05system 0:16.91elapsed 0%CPU (0avgtext+0avgdata 16896maxresident)k
0inputs+0outputs (0major+1960minor)pagefaults 0swaps

so way faster than on my local desktop CPU, hurray.

After setup from: askubuntu.com/a/1309774/52975 we were able to run:

head -n1000 pap.txt | ARGOS_DEVICE_TYPE=cuda time argos-translate --from-lang en --to-lang fr > pap-fr.txt

which gave:

77.95user 2.87system 0:39.93elapsed 202%CPU (0avgtext+0avgdata 4345988maxresident)k
0inputs+88outputs (0major+910748minor)pagefaults 0swaps

so only marginally better than on P14s. It would be fun to see how much faster we could make things on a more powerful GPU.

Amazon Machine Image (AMI)

Words: 29 Articles: 2

List of AWS AMIs

Words: 29 Articles: 1

AWS Deep Learning Base GPU AMI (Ubuntu 20.04)

Words: 29

These come with pre-installed drivers, so e.g. nvidia-smi just works on them out of the box, tested on g5.xlarge which has an Nvidia A10G GPU. Good choice as a starting point for deep learning experiments.

Amazon Elastic Block Store

Words: 36 Articles: 1

Laucnh Amazin EC2 with existing EBS volume (Amazon EBS)

Words: 36

Not possible directly without first creating an AMI image from snapshot? So annoying!

The hot and more expensive sotorage for Amazon EC2, where e.g. your Ubuntu filesystem will lie.

The cheaper and slower alternative is to use Amazon S3.

EC2 instance store volume

Words: 39

Large but ephemeral storage for EC2 instances. Predetermined by the EC2 instance type. Stays in the local server disk. Not automatically mounted.

docs.aws.amazon.com/AWSEC2/latest/UserGuide/InstanceStorage.html (archive) notably highlights what it persists, which is basically nothing
serverfault.com/questions/433703/how-to-use-instance-store-volumes-storage-in-amazon-ec2 mentions that you have to mount it

vCPU

EC2 instance type

Words: 73 Articles: 3

Amazon's informtion about their own intances is so bad and non-public that this was created: instances.vantage.sh/

g4ad.xlarge

Words: 5

AMD GPUs as mentioned at: aws.amazon.com/ec2/instance-types/g4/

g4nd.xlarge

Words: 48

1 Nvidia T4 GPU, 4 vCPUs.

Mentioned at: aws.amazon.com/ec2/instance-types/g4/

TODO meaning of "nd"? "n" presumably means Nvidia, but what is the "d"? Compare it g4ad.xlarge which has AMD GPUs. aws.amazon.com/ec2/instance-types/g4/ mentions:

G4 instances are available with a choice of NVIDIA GPUs (G4dn) or AMD GPUs (G4ad).

Price:

2025-03-10: 0.526 USD / Hour

g5.xlarge

Words: 5

1 Nvidia A10G GPU, 4 vCPUs.

Alibaba Cloud

Google Cloud Platform (GCP)

Type of cloud computing

Words: 386 Articles: 5

Infrastructure as a service (IaaS)

Words: 32

You SSH into a an OS like Ubuntu and do whatever you want from there. E.g. Amazon EC2.

The OS is usually virualized, and you get only a certain share of the CPU by default.

Platform as a service (PaaS)

Words: 354 Articles: 3

Highly managed, you don't even see the Docker images, only some higher level JSON configuration file.

These setups are really convenient and cheap, and form a decent way to try out a new website with simple requirements.

 Tagged

Amazon Elastic Compute Cloud

AWS Elastic Beanstalk

Heroku

Words: 319 Articles: 1

This feels good.

One problem though is that Heroku is very opinionated, a likely like other PaaSes. So if you are trying something that is slightly off the mos common use case, you might be fucked.

Another problem with Heroku is that it is extremely difficult to debug a build that is broken on Heroku but not locally. We needed a way to be able to drop into a shell in the middle of build in case of failure. Otherwise it is impossible.

Deployment:

git push heroku HEAD:master

View stdout logs:

heroku logs --tail

PostgreSQL database, it seems to be delegated to AWS. How to browse database: stackoverflow.com/questions/20410873/how-can-i-browse-my-heroku-database

heroku pg:psql

Drop and recreate database:

heroku pg:reset --confirm <app-name>

All tables are destroyed.

Restart app:

heroku restart

Send free emails from Heroku

Words: 196

Arghh, why so hard... tested 2021:

SendGrid: this one is the first one I got working on free tier!
Mailgun: the Heroku add-on creates a free plan. This is smaller than the flex plan and does not allow custom domains, and is not available when signing up on mailgun.com directly: help.mailgun.com/hc/en-us/articles/203068914-What-Are-the-Differences-Between-the-Free-and-Flex-Plans- And without custom domains you cannot send emails to anyone, only to people in the 5 manually whitelisted list, thus making this worthless. Also, gmail is not able to verify the DNS of the sandbox emails, and they go to spam.
Mailgun does feel good otherwise if you are willing to pay. Their Heroku integration feels great, exposes everything you need on environment variables straight away.
CloudMailin: does not feel as well developed as Mailgun. More focus on receiving. Tried adding TXT xxx._domainkey.ourbigbook.com and CNAME mta.ourbigbook.com entires with custom domain to see if it works, took forever to find that page... www.cloudmailin.com/outbound/domains/xxx Domain verification requires a bit of human contact via email.
They also don't document their Heroku usage well. The envvars generated on Heroku are useless, only to login on their web UI. The send username and password must be obtained on their confusing web ui.

High performance computing

Words: 286 Articles: 21

Job scheduler

Words: 223 Articles: 12

Borg (cluster manager)

IBM Spectrum LSF (LSF)

Words: 223 Articles: 10

LSF get version

Words: 13

Most/all commands have the -V option which prints the version, e.g.:

bsub -V

LSF command

Words: 210 Articles: 8

bsub

Words: 190 Articles: 4

Submit a new job. The most important command!

Docs: www.ibm.com/docs/en/spectrum-lsf/10.1.0?topic=bsub-options

bsub get job stdout and stderr

Words: 142

By default, LSF only sends you an email with the stdout and stderr included in it, and does not show or store anything locally.

One option to store things locally is to use:

bsub -oo stdout.log -eo stderr.log 'echo myout; echo myerr 1>&2'

as documented at:

Or to use files with the job id in them:

bsub -oo %J.out -eo %J.err 'echo myout; echo myerr 1>&2'

By default bsub -oo:

also contains the LSF metadata in addition to the actual submitted process stdout
prevents the completion email from being sent

To get just the stdout to the file, use bsub -N -oo which:

stores only stdout on the file
re-enables the completion email

as mentioned at:

Another option is to run with the bsub -I option:

bsub -I 'echo a;sleep 1;echo b;sleep 1;echo c'

This immediately prints stdout and stderr to the terminal.

bsub on foreground

Words: 39

Run bsub on foreground, show stdout on host stdout live with an interactive with the bsub -I option:

bsub -I 'echo a;sleep 1;echo b;sleep 1;echo c'; echo done

Ctrl + C kills the job on remote as well as locally.

Bibliography:

superuser.com/questions/46312/wait-for-one-or-all-lsf-jobs-to-complete

bsub option

bsub `-I` option

www.ibm.com/docs/en/spectrum-lsf/10.1.0?topic=options-i

bpeek

Words: 10

View stdout/stderr of a running job.

Documented at: www.ibm.com/docs/en/spectrum-lsf/10.1.0?topic=reference-bpeek

Documented at:

www.bsc.es/support/LSF/9.1.2/lsf_command_ref/index.htm?bpeek.1.html~main

bkill

Words: 4

Kill jobs.

Documented at: www.ibm.com/docs/en/spectrum-lsf/10.1.0?topic=reference-bkill

bkill all jobs

Words: 6

By the current user:

bkill 0

Slurm Workload Manager (SLURM)

Supercomputer

Words: 63 Articles: 6

Some good insights on the earlier history of the industry at: The Supermen: The Story of Seymour Cray by Charles J. Murray (1997).

Exascale computing

Words: 15

The scale where human brain simulation becomes possible according to some estimates.

First publicly reached by Frontier.

TOP500

Supercomputer by owner

Articles: 2

Oak Ridge supercomputer

Articles: 1

Frontier (supercomputer)

Personal computer

Words: 68 Articles: 7

Laptop

Desktop computer

Mobile phone

Words: 68 Articles: 4

History of mobile phone

The first application of mobile phones was in motor vehicles

Words: 68

Early models were heavy and not practical for people to carry them, so the main niche they initially filled was being carried in motor vehicles, notably trucks where drivers are commercially driving all day long.

It also helps in the case of trucks that you only need to cover a one-dimensional region of the main roads.

For example, this niche was the original entry point of companies such as:

Smartphone

 Tagged

Fingerprint imaging with smartphone

Mobile app

Workstation

 Tagged

Sun Microsystems

Computer manufacturer

Words: 1k Articles: 27

This section is about companies that integrate parts and software from various other companies to make up fully working computer systems.

Dell

Lenovo

Words: 330 Articles: 2

Their websites a bit shitty, clearly a non cohesive amalgamation of several different groups.

E.g. you have to create several separate accounts, and different regions have completely different accounts and websites.

The Europe replacement part website for example is clearly made by a third party called flex.com/ and has Flex written all over it, and the header of the home page has a slightly broken but very obviously broken CSS. And you can't create an account without a VAT number... and they confirmed by email that they don't sell to non-corporate entities without a VAT number. What a bullshit!

ThinkPad

Words: 233 Articles: 1

This is Ciro Santilli's favorite laptop brand. He's been on it since the early 2010's after he saw his then-girlfriend-later-wife using it.

Ciro doesn't know how to explain it, but ThinkPads just feel... right. The screen, the keyboard, the lid, the touchpad are all exactly what Ciro likes.

The only problem with ThinkPad is that it is owned by Lenovo which is a Chinese company, and that makes Ciro feel bad. But he likes it too much to quit... what to do?

Ciro is also reassured to see that in every enterprise he's been so far as of 2020, ThinkPads are very dominant. And the same when you see internal videos from other big tech enterprises, all those nerds are running... Ubuntu on ThinkPads! And the ISS.

Those nerds like their ThinkPads so much, that Ciro has seen some acquaintances with crazy old ThinkPad machines, missing keyboard buttons or the like. They just like their machines that much.

ThinkPads are are also designed for repairability, and it is easy to buy replacement parts, and there are OEM part replacement video tutorials: www.youtube.com/watch?v=vseFzFFz8lY No visible planned obsolescence here! With the caveat that the official online part stores can be shit as mentioned at Section "Lenovo".

Further more, in 2020 Lenovo is announced full certification for Ubuntu www.forbes.com/sites/jasonevangelho/2020/06/03/lenovos-massive-ubuntu-and-red-hat-announcement-levels-up-linux-in-2020/#28a8fd397ae0 which fantastic news!

The only thing Ciro never understood is the trackpoint: superuser.com/questions/225059/how-to-get-used-of-trackpoint-on-a-thinkpad Why would you use that with such an amazing touchpad? And vimium.

ThinkPad series

www.reddit.com/r/thinkpad/comments/crw08i/series_differences_t_vs_x_vs_p_vs_e_vs_etc/

Raspberry Pi Foundation

Words: 651 Articles: 22

Raspberry Pi Foundation project

Words: 651 Articles: 21

Raspberry Pi OS

Words: 11

Change password without access:

raspberrypi.stackexchange.com/questions/24770/change-reset-password-without-monitor-keyboard

Enable SSH on boot:

sudo touch /boot/ssh

Raspberry Pi (2012)

Words: 640 Articles: 19

Raspberry Pi 1

Raspberry Pi 2

Words: 6

Model B V 1.1.

SoC: BMC2836

www.raspberrypi.org/products/raspberry-pi-2-model-b/

Raspberry Pi 3

Words: 12

Model B V 1.2.

SoC: BCM2837

Serial from cat /proc/cpuinfo: 00000000c77ddb77

Raspberry Pi Pico (2021)

Words: 622 Articles: 15

Some key specs:

SoC:
- name: RP2040. Custom designed by Raspberry Pi Foundation, likely the first they make themselves rather than using a Broadcom chip. But the design still is closed source, likely wouldn't be easy to open source due to the usage of closed proprietary IP like the ARM
- dual core ARM Cortex-M0+
- frequency: 2 kHz to 133 MHz, 125 MHz by default
- memory: 264KB on-chip SRAM
GPIO voltage: 3.3V

Datasheet: datasheets.raspberrypi.com/pico/pico-datasheet.pdf

Raspberry Pi Pico variant (2022)

Words: 558 Articles: 14

Raspberry Pi Pico H

Words: 17

Has Serial wire debug debug. Why would you ever get one without unless you are a clueless newbie like Ciro Santilli?!?!

Raspberry Pi Pico W (2022)

Words: 541 Articles: 12

Datasheet: datasheets.raspberrypi.com/picow/pico-w-datasheet.pdf

Raspberry Pi Pico W UART

Words: 62

You can connect form an Ubuntu 22.04 host as:

screen /dev/ttyACM0 115200

When in screen, you can Ctrl + C to kill main.py, and then execution stops and you are left in a Python shell. From there:

Ctrl + D: reboots
Ctrl + A K: kills the GNU screen window. Execution continues normally

but be aware of: Raspberry Pi Pico W freezes a few seconds after after screen disconnects from UART.

Other options:

ampy run command, which solves How to run a MicroPython script from a file on the Raspberry Pi Pico W from the command line?

Program Raspberry Pi Pico W with MicroPython

Words: 281 Articles: 9

How to run a MicroPython script from a file on the Raspberry Pi Pico W from the command line?

Words: 31

The first/only way Ciro could find was with ampy: stackoverflow.com/questions/74150782/how-to-run-a-micropython-host-script-file-on-the-raspbery-pi-pico-from-the-host/74150783#74150783 That just worked and it worked perfectly!

python3 -m pip install --user adafruit-ampy
ampy --port /dev/ttyACM0 run blink.py

TODO: possible with rshell?

MicroPython connection tool

Words: 23 Articles: 3

ampy

Words: 11

Source: github.com/scientifichackers/ampy

Install on Ubuntu 22.04:

python3 -m pip install --user adafruit-ampy

Bibliography:

www.digikey.co.uk/en/maker/projects/micropython-basics-load-files-run-code/fb1fcedaf11e4547943abfdd8ad825ce

rshell

Words: 12 Articles: 1

github.com/dhylands/rshell

How to exit from repl in rshell?

Words: 12

Ctrl + X. Documented by running help repl from the main shell.

Raspberry Pi Pico W freezes a few seconds after after screen disconnects from UART

Program Raspberry Pi Pico W with MicroPython code from the command line

Words: 3

stackoverflow.com/questions/66183596/how-can-you-make-a-micropython-program-on-a-raspberry-pi-pico-autorun/74078142#74078142

Examples at: Raspberry Pi Pico W MicroPython example.

Program the Raspberry Pi Pico W with MicroPython from Thonny

Words: 3

stackoverflow.com/questions/66183596/how-can-you-make-a-micropython-program-on-a-raspberry-pi-pico-autorun/74078142#74078142

Examples at: Raspberry Pi Pico W MicroPython example.

Raspberry Pi Pico W MicroPython example

Words: 221

An upstream repo at: github.com/raspberrypi/pico-micropython-examples

Our examples at: rpi-pico-w/upython.

The examples can be run as described at Program Raspberry Pi Pico W with MicroPython.

rpi-pico-w/upython/blink.py: blink on-board LED. Note that they broke the LED hello world compatibility from non-W to W for God's sake!!!
rpi-pico-w/upython/led_on.py: turn on-board LED on and leave it on forever
rpi-pico-w/upython/uart.py: has automatic UART via USB. Any print() command ends up on the Raspberry Pi Pico W UART! Is is just like with Micro Bit, must be a standard Micro Python thing. The onboard LED is blinked as a heartbeat.
rpi-pico-w/upython/blink_gpio.py: toggle GPIO pin 0 on and off twice a second. Also toggle the on-board LED and print to UART for correlation. You can see this in action e.g. by linking an LED between pin 0 and one of the GND pins of the Pi, and the LED will blink.
rpi-pico-w/upython/pwm.py: pulse width modulation. Using the same circuit as the rpi-pico-w/upython/blink_gpio.py example, you will now see the external LED go from dark to bright continuously and then back
rpi-pico-w/upython/adc.py: analog-to-digital converter. The program prints to the UART the value of the ADC on GPIO 26 once every 0.2 seconds. The onboard LED is blinked as a heartbeat. The hello world is with a potentiometer: extremes on GND and VCC pins of the Pi, and middle output on pin 26, then as you turn the knob, the uart value goes from about 0 to about 64k.

Program Raspberry Pi Pico W with C

Words: 197

Ubuntu 22.04 build just worked, nice! Much feels much cleaner than the Micro Bit C setup:

sudo apt install cmake gcc-arm-none-eabi libnewlib-arm-none-eabi libstdc++-arm-none-eabi-newlib

git clone https://github.com/raspberrypi/pico-sdk
cd pico-sdk
git checkout 2e6142b15b8a75c1227dd3edbe839193b2bf9041
cd ..

git clone https://github.com/raspberrypi/pico-examples
cd pico-examples
git checkout a7ad17156bf60842ee55c8f86cd39e9cd7427c1d
cd ..

export PICO_SDK_PATH="$(pwd)/pico-sdk"
cd pico-exampes
mkdir build
cd build
# Board selection.
# https://www.raspberrypi.com/documentation/microcontrollers/c_sdk.html also says you can give wifi ID and password here for W.
cmake -DPICO_BOARD=pico_w ..
make -j

Then we install the programs just like any other UF2 but plugging it in with BOOTSEL pressed and copying the UF2 over, e.g.:

cp pico_w/blink/picow_blink.uf2 /media/$USER/RPI-RP2/

Note that there is a separate example for the W and non W LED, for non-W it is:

cp blink/blink.uf2 /media/$USER/RPI-RP2/

Also tested the UART over USB example:

cp hello_world/usb/hello_usb.uf2 /media/$USER/RPI-RP2/

You can then see the UART messages with:

screen /dev/ttyACM0 115200

TODO understand the proper debug setup, and a flash setup that doesn't require us to plug out and replug the thing every two seconds. www.electronicshub.org/programming-raspberry-pi-pico-with-swd/ appears to describe it, with SWD to do both debug and flash. To do it, you seem need another board with GPIO, e.g. a Raspberry Pi, the laptop alone is not enough.

Semiconductor industry

Words: 752 Articles: 66

Semiconductor industry bibliography

Articles: 1

Crystal Fire: The Birth of the Information Age (1997)

Film about the semiconductor industry

Words: 10 Articles: 1

Halt and Catch Fire (TV series, 2014-2017)

Words: 10

Season 1 was amazing. The others fell off a bit.

Semiconductor company

Words: 736 Articles: 60

This section is about companies that design semiconductors.

For companies that manufature semiconductors, see also: company with a semiconductor fabrication plant.

 Tagged

FPGA company

Acorn Computers

AMD (1969)

Words: 199 Articles: 20

Video 22.

How AMD went from nearly Bankrupt to Booming by Brandon Yen (2021)

Source.

youtu.be/Rtb4mjIACTY?t=118 Buldozer series CPUs was a disaster
youtu.be/Rtb4mjIACTY?t=324 got sued for marketing claims on number of cores vs number of hyperthreads
youtu.be/Rtb4mjIACTY?t=556 Ryzen first gen was rushed and a bit buggy, but it had potential. Gen 2 fixed those.
youtu.be/Rtb4mjIACTY?t=757 Ryzen Gen 3 surpased single thread performance of Intel. Previously Gen 2 had won multicore.

AMD product

Words: 81 Articles: 16

AMD CPU

Words: 65 Articles: 6

They have been masters of second sourcing things for a long time! One can ony imagine the complexity of the Intel cross licensing deals.

Ryzen

Words: 42 Articles: 4

This was the CPU architecure that saved AMD in the 2010's, see also: Video 22. "How AMD went from nearly Bankrupt to Booming by Brandon Yen (2021)"

Ryzen 7

Words: 29 Articles: 3

en.wikichip.org/wiki/amd/ryzen_7

Ryzen 7 microarchitecture

Words: 29 Articles: 2

Each microarchitecture appears to fully specify all core parameters, it feels likely that they just reuse most of all of the RTL, or even pre-synthesize core blobs.

Zen 4

Words: 2 Articles: 1

en.wikichip.org/wiki/amd/microarchitectures/zen_4

AMD 7840U (3.30 GHz - 5.10 GHz, 8 cores - 16 threads, 2023)

Words: 2

Official page: www.amd.com/en/products/processors/laptop/ryzen/7000-series/amd-ryzen-7-7840u.html

Epyc

AMD GPU

Words: 16 Articles: 8

AMD GPU driver

Words: 8 Articles: 1

AMDGPU

Words: 8

Bibliography:

wiki.archlinux.org/title/AMDGPU
gitlab.freedesktop.org/drm/amd an issue tracker
github.com/ROCm/ROCK-Kernel-Driver TODO vs the GitLab?

RDNA

Words: 8 Articles: 2

RDNA 3 (2022)

Words: 8 Articles: 1

gfx1103

Words: 8

Mentioned e.g. at: videocardz.com/newz/amd-begins-rdna3-gfx11-graphics-architecture-enablement-for-llvm-project as being part of RDNA 3.

Radeon

AMD Instinct

ATI Technologies (1985-2006)

AMD employee

Words: 57 Articles: 2

Jerry Sanders (AMD co-founder and CEO until 2002)

Words: 57

Video 23.

AMD Founder Jerry Sanders Interview (2002)

Source. Source: exhibits.stanford.edu/silicongenesis/catalog/hr396zc0393. Fun to watch.

youtu.be/HqWWoaA8pIs?t=779 Newton Minow mandated UHF on all television sets in 1961, and the oscillator needed for the tuner was one of the first major non-military products from Fairchild, the 28918 (?).
youtu.be/HqWWoaA8pIs?t=1053 Fairchild had won the first round of a Minuteman contract, but lost the second one due to poor management

Lisa Su

Arm (company)

Words: 115 Articles: 6

Video 24.

Arm 30 Years On: Episode One by Arm Ltd. (2022)

Source.

Video 25.

Arm 30 Years On: Episode Two by Arm Ltd. (2022)

Source.

Video 26.

Arm 30 Years On: Episode Three by Arm Ltd. (2022)

Source. This one is boring US expansion. Other two are worth it.

 Tagged

ARM architecture family

Allen Wu

Words: 80

www.linkedin.com/in/allenxwu

This situation is the most bizarre thing ever. The dude was fired in 2020, but he refused to be fired, and because he has the company seal, they can't fire him. He is still going to the office as of 2022. It makes one wonder what are the true political causes for this situation. A big warning sign to all companies tring to setup joint ventures in China!

2022 www.reuters.com/technology/arm-china-says-its-ousted-ceo-wu-is-refusing-pack-up-2022-05-05/

Video 27.

ARM Fired ARM China’s CEO But He Won’t Go by Asianometry (2021)

Source.

Arm product

Articles: 4

Arm Artisan

ARM CPU

Articles: 2

ARM Cortex-M

Articles: 1

ARM Cortex-M0+

Broadcom

Cerebras (2015-)

Words: 39

For some reason they attempt to make a single chip on an entire wafer!

They didn't care about MLperf as of 2019: www.zdnet.com/article/cerebras-did-not-spend-one-minute-working-on-mlperf-says-ceo/

2023: www.eetimes.com/cerebras-sells-100-million-ai-supercomputer-plans-8-more/ Cerebras Sells $100 Million AI Supercomputer, Plans Eight More

Video 28.

Cerebras Architecture Deep Dive by Sean Lie

. Source. 2022.

Graphcore

Intel (1968-)

Words: 21 Articles: 13

 Tagged

Intel supercomputer market share

Intel employee

Articles: 2

Intel employee grade

Articles: 1

Intel fellow

 Tagged

Marc Verdiell

Intel hardware

Words: 14 Articles: 7

Intel CPU

Words: 2 Articles: 1

Intel i7-7820HQ (Q1'17, $378.00)

Words: 2

Official page: www.intel.com/content/www/us/en/products/sku/97496/intel-core-i77820hq-processor-8m-cache-up-to-3-90-ghz/specifications.html

Intel GPU

Words: 12 Articles: 4

Intel discrete GPU

Words: 12 Articles: 2

Intel Xe

Intel Arc

Words: 12

Video 29.

Worst We've Tested: Broken Intel Arc GPU Drivers by Gamers Nexus (2022)

Source.

Intel Graphics Technology (Intel integrated GPUs)

Intel department

Words: 7 Articles: 1

Intel Research (Intel Research Lablets)

Words: 7

"Intel Research Lablets", that's a terrible name.

Nvidia

Words: 187 Articles: 11

Open source driver/hardware interface specification??? E.g. on Ubuntu, a large part of the nastiest UI breaking bugs Ciro Santilli encountered over the years have been GPU related. Do you think that is a coincidence??? E.g. ubuntu 21.10 does not wake up from suspend.

Video 30.

Linus Torvalds saying "Nvidia Fuck You" (2012)

Source.

Video 31.

How Nvidia Won Graphics Cards by Asianometry (2021)

Source.

Doom was the first killer app of personal computer 3D graphics! As opposed to professional rendering e.g. for CAD as was supported by Silicon Graphics
youtu.be/TRZqE6H-dww?t=694 they bet on Direct3D
youtu.be/TRZqE6H-dww?t=749 they wrote their own drivers. At the time, most drivers were written by the computer manufacturers. That's insane!

Video 32.

How Nvidia Won AI by Asianometry (2022)

Source.

Software developed by Nvidia

Articles: 1

 Tagged

nvidia-smi

Nvidia GPU

Words: 98 Articles: 8

The list: en.wikipedia.org/wiki/List_of_Nvidia_graphics_processing_unit

Nvidia GPU feature

Words: 1 Articles: 1

Nvidia tensor core

Words: 1

Bibliography:

developer.nvidia.com/blog/programming-tensor-cores-cuda-9/

Nvidia compute GPU

Words: 95 Articles: 5

This section is about Nvidia GPUs that are focused on compute rather than rendering.

Until 2020 these were branded as Nvidia Tesla, but then Nvidia dropped that brand due to confusion with the Tesla Inc. the car maker.^[ref].

Nvidia Tesla (2007-2020)

List of Nvidia compute GPUs

Words: 61 Articles: 3

Nvidia T4 (2018, 65 TFLOPS, 16 GB mem)

Words: 23

Official page: www.nvidia.com/en-gb/data-center/tesla-t4/

According to wccftech.com/nvidia-drops-tesla-brand-to-avoid-confusion-with-tesla/ this was the first card that semi-dropped the "Nvidia Tesla" branding, though it is still visible in several places.

Nvidia A10

Words: 38 Articles: 1

Official page: www.nvidia.com/en-gb/data-center/products/a10-gpu/

Nvidia A10G

Words: 36

According to www.baseten.co/blog/nvidia-a10-vs-a10g-for-ml-model-inference/ the Nvidia A10G is a variant of the Nvidia A10 created specifically for AWS. As such there isn't much information publicly available about it.

the A10 prioritizes tensor compute, while the A10G has a higher CUDA core performance

Qualcomm

Words: 9

Ciro Santilli has always had a good impression of these people.

Silicon Graphics (1981-2009)

Words: 150

This company is a bit like Sun Microsystems, you can hear a note of awe in the voice of those who knew it at its peak. This was a bit before Ciro Santilli's awakening.

Those people created OpenGL for God's sake! Venerable.

Both of them and Sun kind of died in the same way, unable to move from the workstation to the personal computer fast enough, and just got killed by the scale of competitors who did, notably Nvidia for graphics cards.

Some/all Nintendo 64 games were developed on it, e.g. it is well known that this was the case for Super Mario 64.

Also they were a big UNIX vendor, which is another kudos to the company.

Chinese semiconductor industry

Words: 6

Video 34.

China's Making x86 Processors by Asianometry (2021)

Source.

 Tagged

SMIC