Ciro Santilli OurBigBook.com $£ Sponsor €¥ 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
ourbigbook-com.bigb
= OurBigBook.com
{c}
{numbered}
{scope}
{tag=Personal knowledge base software}

https://ourbigbook.com is a <website> created by <Ciro Santilli>.

The website is the reference instance of <OurBigBook Web>, which is part of the <OurBigBook Project>, the other main part of the project are software that users can run locally to publish their content such as the <OurBigBook CLI>.

The source code for ourbigbook.com is present at: https://github.com/ourbigbook/ourbigbook/tree/master/web

The project documentation is present at: https://docs.ourbigbook.com#ourbigbook-web-user-manual

This page contains further information about the project's rationale, motivation and planning.

\Video[https://www.youtube.com/watch?v=BR2dXeR5jt8]
{title=Intro to the <OurBigBook Project>}
{height=500}

= How the website works
{parent=OurBigBook.com}

See: https://docs.ourbigbook.com/#ourbigbook-web-user-manual

= Alternatives
{parent=OurBigBook.com}

These are websites that offer somewhat overlapping services, many of which served inspirations, and why we think something different is needed to achieve our goals.

Notably, OurBigBook is the result of <Ciro Santilli>'s experiences with:
* <Wikipedia>
* <GitHub>
* <Stack Exchange> (or as non techies might point out, <Urban Dictionary>, or <Quora> before it was such an incomprehensible shitshow)
OurBigBook could be seen as a cross between those three websites.

Quick mentions:
* https://handwiki.org/wiki/HandWiki:About[]: technically the same as <Wikipedia>, but with more aligned moderation policies
* https://ecotext.co/ similar goals. Their website seems quite broken now though as of 2021, can't see text properly. Crunchbase entry: https://www.crunchbase.com/organization/ecotext says they are from Durham, New Hampshire, United States. Cannot see how to publish, curated material only? Twitter: https://twitter.com/ecotextinc?lang=en One of the founders: https://twitter.com/BigNel_21 | https://www.linkedin.com/in/ecotextnelsonthomas/[]. Their LinkedIn: https://www.linkedin.com/company/ecotext/people/
* https://fiveable.me/ bad: separates students and teachers, as a student I don't see where to create my content. Good: focus on teaching university level stuff to people outside of university via <Advanced Placement>. Bad: Lots of video content. Bad: Can't see the issue tracker attached to each page.
* <LessWrong>: their website system does have some similar feature sets to what we want. Reputation, Q&A sections, links between articles most likely, sort by upvote everywhere.
* https://crowdpub.org collaborative writing website, somehow goes to paragraph level, TODO how they reconcile different authors? Closed beta as of writing, so hard to be sure. From quick presentation on beta website, appears to attempt to share revenue to authors proportionally to the size of their contribution. Some <blockchain>-based reputation. Meh.
* TODO migrate all from: https://github.com/booktree/booktree/blob/master/alternatives.md
* https://studynotes.ie/[]. Admin approval on everything. No ToC. Fixed tag list for <university entry exams> topics.
* https://mindstone.com[]: there appears to be no sharing focus? File upload basesd? Not sure.
* <EverybodyWiki>
* looking for <open source> <Confluence (software)>-alternatives is an interesting way to go:
  * lists:
    * https://opensource.com/article/20/9/open-source-alternatives-confluence
    * https://www.nuclino.com/alternatives/confluence-alternative#ynaw
  * <BookStack>:
    * fixed 3-level page hierarchy
    * writen in <PHP>
    * <Markdown> support: https://www.bookstackapp.com/docs/user/markdown-editor/
    * no source-level import-export apparently: https://www.bookstackapp.com/docs/admin/backup-restore/[], https://youtu.be/WUvtzJfCAKE?t=904
    * <WYSIWYG>: https://www.bookstackapp.com/docs/user/wysiwyg-editor/[] via <TinyMCE>
    * page content repeating: https://www.bookstackapp.com/docs/user/reusing-page-content/ (will be useful for course modelling)
* https://github.com/shuding/nextra converts <Markdown> links to Next.js links. We should look into how it works.
* https://zettelkasten.de/the-archive/ "The Archive" from zettelkasten.de/. Closed source. By German software engineer Christian Tietze https://twitter.com/ctietze?lang=en
* <LLM generated wiki> e.g.:
  * <Kinnu>
* https://docs.tigyog.app/cli beautiful website, but doesn't achieve much. Has a Markdown upload mechanism. Ah, those newbs who think the average user will care about markup upload to DB... Oh, wait...
* https://www.stuvia.com/en-gb/school/uk/oxford-university/physics[]. PDF uploads. In theory you have to own copyright: https://www.stuvia.com/en-gb/copyright/guidelines but it feels unlikely that most material was uploaded by the copyright owners. If those people are up, then why can't we? Maybe... Registred in the UK. People: some Dutch dudes:
  * https://www.linkedin.com/in/jaapvannes
  * https://www.linkedin.com/in/hugo-kuijzer-bb81a625
* <Project Xanadu>: crazy overlaps, though that project is <vaporware> apparently?
  \Q[Administrators of Project Xanadu have declared it superior to the World Wide Web, with the mission statement: "Today's popular software simulates paper. The World Wide Web (another imitation of paper) trivialises our original hypertext model with one-way ever-breaking links and no management of version or contents.]

<Static website>-only alternatives:
* https://quarto.org/
* https://vitepress.dev[]. https://vitepress.dev/guide/markdown unmanaged internal links. Sample website: https://wiki.nikiv.dev/[].

Conceptual:
* <The Final Encyclopedia (Paul Allen)>: <science fiction> concept, but the name was reused by <Paul Allen> in a research project
* <second brain>
* <collective intelligence>

Markup editors:
* https://www.zettlr.com Good local Markdown editor with some extensions, including cross file links. Good mixture of <WYSIWYG> and markup code, e.g. images show up.
  * Internal cross references extension are not crisp though, open up a search apparently, and no error checking, and don't render correctly on HTML?

TODO:
* https://pressbooks.directory/

= Wikipedia
{c}
{parent=Alternatives}

* you don't get any/sufficient recognition for your contributions. The closest they have to upvotes and reputation is the incredibly obscure "thank" feature which is only visible to the receiver itself: https://en.wikipedia.org/wiki/Help:Notifications/Thanks
* <deletionism> is a tremendous problem on Wikipedia, for two main causes:
  * tutorial-like subjectivity
  * notability
  The stuff you wrote can be deleted anytime by some random admin/opposing editor, examples at: <deletionism on Wikipedia>{full}.

  This also possibly leads to <edit wars> in the case of sub-page content (full page deletion is more clearly arbitrated).
* Scope too limited, and politics defined. Everything has to sound encyclopedic and be notable enough. This basically excludes completely good tutorials.
* Insane impossible to use <markup language>-base talk pages instead of issue trackers?! Ridiculous!!! That change alone could make Wikipedia so much more amazing. Wikipedia could become a Stack Exchange killer by doing that alone + some basic reputation system. Some work on that is being done at: https://www.mediawiki.org/wiki/Extension:DiscussionTools[], already in Beta as of 2022.
* <Edit wars>

= Stack Exchange
{c}
{parent=Alternatives}

<Stack Exchange> solves to a good extent the use  cases:
* I have a very specific question, type it on <Google>, find top answers
* I have an answer, and I put it here because it has a much greater chance of being found due to the larger <PageRank> than my <personal web page> will ever have
points of view. It is a big open question if we can actually substantially improve it.

Major shortcoming are mentioned at <idiotic Stack Overflow policies>:
* Scope restrictions can lead to a lot of content deletion: <closing questions as off-topic>
  * <closing questions as off-topic>
  * <Stack Overflow content deletion>
  * <stack Overflow link-only answer policy>
  * <stack Overflow no duplicate answers policy>
  This greatly discourages new users, who might still have added value to the project.

  On our website, anyone can post anything that is legal in a given country. No one can ever delete your content if it is legal, no matter their reputation.
* Although you can answer your own question, there's no way to write an organized multi-page book with Stack Exchange due to shortcomings such as no table of contents, 30k max chars on answer, huge risk of deletion due to "too broad"
* Absolutely no algorithmic attempt to overcome the fastest gun in the West problem (early answers have huge advantage over newer ones): https://meta.stackoverflow.com/questions/404535/closing-an-old-upvoted-question-as-duplicate-of-new-unvoted-questions/404567#404567
* Native reputation system:
  * if the living ultimate <God> of `C++` upvotes you, you get `10` reputation
  * if the first-day newb of `Java` upvotes you, you also get `10` reputation
* Randomly split between sites like Stack Overflow vs Super User, with separate user reputations, but huge overlaps, and many questions that appears as dupes on both and never get merged.
* Possible edit wars, just like <Wikipedia>, but these are much less common since content ownership is much clearer than in Wikipedia however

Bibliography:
* https://dev.to/codemouse92/has-stackoverflow-become-an-antipattern-3icb (http://web.archive.org/web/20191021090247/https://dev.to/codemouse92/has-stackoverflow-become-an-antipattern-3icb[archive])

= Blogs
{parent=Alternatives}

Where <blog> is taken in a wide sense, including e.g. <Medium (website)>, <WordPress>, <Facebook>, <Twitter>, etc., etc.

The main shortcoming of blogs is the lack of topic convergence across blogs. Each blog is a moderated castle. So who is the best user for a given topic, or the best content for a given tag, across the entire website?

The only reasonable free material we have for advanced subjects nowadays are <university lecture notes>.

While some of those are awesome, when writing a large content, no one can keep quality high across all sections, there will always be knowledge that you don't have which is enlightening. And <Googlers> are more often than not interested only in specific sections of your content.

Our website aims to make smaller subjects vertically curated across horizontal single author tutorials.

``
MIT calculus course             UCLA calculus course

* Calculus                <---> * Calculus
  * Limit                 <--->   * Limit
    * Limit of a function
    * Limit of a series   <--->     * Limit of a series
  * Derivative            <--->   * Derivative
                                    * L'Hôpital's rule
  * Integral              <--->   * Integral
``

Some more links:
* https://prose.sh/ multiblog, the only feature is easy of publishing from CLI

= University lecture notes
{parent=Alternatives}

Basically everything that applies to the <blogs> section also applies here, but university lecture notes are so important to us that they deserve a bit more talk.

It is arguable that this is currently the best way to learn any university subject, and that it can already be used to learn any subject.

We basically just want to make the process more efficient and enjoyable, by making it easier:
* to find what you want based on an initial subject hit across the best version of any author
* and to publish your own stuff with one click, and get feedback if people like it or not, and improvement suggestions like you do you GitHub

One major problem with lecture notes is that, as the name suggests, they are merely a complement to the lecture, and don't contain enough detail for you to really learn solely from them without watching the lecture.

The only texts that generally teach in enough depth are actual books, which are almost always commercial.

So in a sense, this project can be seen as a path to upgrade free lecture notes into full blown free books, from which you can learn from scratch without any external material.

And a major way in which we believe this can be done is through the reuse of sections of lecture notes by from other universities, which greatly reduces the useless effort of writing things from scratch.

The intended mental picture is clear: the topics feature https://docs.ourbigbook.com/#ourbigbook-web-topics will is intended to act as the missing horizontal topic integration across lecture notes of specific universities, e.g:

``
MIT calculus course             UCLA calculus course

* Calculus                <---> * Calculus
  * Limit                 <--->   * Limit
    * Limit of a function
    * Limit of a series   <--->     * Limit of a series
  * Derivative            <--->   * Derivative
                                    * L'Hôpital's rule
  * Integral              <--->   * Integral
``

\Image[https://raw.githubusercontent.com/ourbigbook/ourbigbook-media/master/Fundamental_theorem_of_calculus_topic_page.png]
{title=Example topics page of OurBigBook.com}
{height=700}
{border}

One important advantage of lecture notes is that since they are written by the teacher, they should match exactly what "students are supposed to learn to get good grades", which <students must have a flexible choice of what to learn>[unfortunately is a major motivation for student's learning] weather we want it or not.

One big open question for this project is to what extent notes written for lectures at one university will be relevant to the lectures at another university?

Is it possible to write notes in a way that they are naturally reusable?

It is our gut feeling that this is possible. But it almost certainly requires an small intentional effort on the part of authors.

The question then becomes whether the "become famous by getting your content viewed in other universities" factor is strong enough to attract users.

And we believe that it might, it just might be.

= How to convince teachers to use CC BY-SA
{parent=University lecture notes}

A major difficulty of getting such this to work is that may university teachers want to retain closed copyright of their work because they:
* want to publish a book later and get paid. Yes, the root problem is that teachers get paid way too little and have way too little job security for the incredibly important and difficult extremely difficult job they are doing, and we have to <vote> to change that
* are afraid that if amazing material is made freely available, then they would not be needed and lose their jobs.  Once again, job security issue.
* believe that if anyone were allowed to touch their precious content, those people would just "screw it up" and make it worse
* don't even want to publish their notes online because "someone will copy it and take their credit". What a mentality! In order to prevent a theft, you are basically guaranteeing that your work will be completely forgotten!
* don't want students to read the notes and skip class, because spoken word has magic properties and imparts knowledge that cannot otherwise conveyed by a book
* are afraid that mistakes will be found in their material. Reputation is of course everything in academia, since there is no money.

  So it's less risky to have closed, more buggy notes, than open, more correct ones.

  This can be seen clearly for example on <Physics Stack Exchange>, and most notably in <particle physics> (well, which is basically the only subject that really gets asked, since anything more experimental is going to be blocked off by patents/interlab competition), where a large proportion incredibly amazing users have anonymous profiles.

  They prefer to get no reputation gains from their amazing contributions, due to the fear that a single mistake will ruin their career.

  This is in stark contrast for example to <Stack Overflow>, where almost all top users are not anonymous:

  List of top users: https://physics.stackexchange.com/users?tab=Reputation&filter=all and some notable anonymous ones:
  * https://physics.stackexchange.com/users/2451/qmechanic
  * https://physics.stackexchange.com/users/50583/acuriousmind
  * https://physics.stackexchange.com/users/43351/profrob
  * https://physics.stackexchange.com/users/84967/accidentalfouriertransform
  * https://physics.stackexchange.com/users/56997/curiousone
  * https://physics.stackexchange.com/users/139781/probably-someone
  * https://physics.stackexchange.com/users/206691/chiral-anomaly

Therefore the only way is to find teachers who are:
* enlightened to use such licenses
* forced by their organizations to use such licenses
The forced option therefore seems like a more bulk efficient starting point for searches.

No matter how much effort a single person puts into writing perfect tutorials, they will never beat 1000x people + an algorithm.

It is not simply a matter of how much time you have. The fundamental reason is that each person has a different background and different skills. Notably the <there is value in tutorials written by beginners>[young students have radically different understanding than that of the experienced teacher].

Therefore, those that refuse to contribute to such platforms, or at least license their content with open licenses, will inevitably have their work forgotten in favor of those that have contributed to the more open platform, which will eventually dominate everything.

Perhaps <OurBigBook.com> is not he killer platform that will make this happen. Perhaps the world is not yet ready for it. But Ciro believes that this will happen, sooner or later, inevitable, and he wants to give it a shot.

Also worth checking:
* https://jornal.usp.br/universidade/usp-de-sao-carlos-oferece-aulas-de-graduacao-em-matematica-e-estatistica-abertas-ao-publico/ "Open Classroom" program from the <University of São Paulo>. We should Google for "Open Classroom" a bit more actually.
* https://open.ed.ac.uk/about/[]: talk only

\Image[https://web.archive.org/web/20220120132903if_/https://phdcomics.com/comics/archive/phd100610s.gif]
{title=The <postgraduate education>[Grad Student] <Brain> by <PhD Comics> (2010)}
{height=500}
{description=Convincing academics that their tutorial are not always perfect is one of blocking points to the acceptance of solutions such as <OurBigBook.com>. To thrive in the competition of academia, those people are amazing at publishing novel results. Explaining to beginners however, not necessarily so.}
{source=https://phdcomics.com/comics/archive.php?comicid=1379}

= Existing data sources
{parent=Alternatives}

Some possible/not possible sources that could be used to manually bootstrap content:
* <LibreTexts>. Good project. "Teacher-only-content" unfortunately as usual. But besides that fundamental flaw, they do exactly what we want to do in a sense.
* <OpenStax>: <CC BY>. This could be a great entry point, as they already have some university integration going on, and might be interested in this project.
* https://physics.stackexchange.com/questions/6157/list-of-freely-available-physics-books "List of freely available physics books" explicitly asks for:
  \Q[a list of physics books with open-source licenses, like Creative Commons, GPL]
  but the thread <closing questions as off-topic>[was locked], and basically none of the sources in the answers have free licenses, nor do they note it. It just seems that the <physicists> don't know what a <free license> is.
* <MIT OpenCourseWare>: <CC BY-NC-SA>, so not really usable
* https://github.com/certik/theoretical-physics[]: <MIT License>. https://opensource.stackexchange.com/questions/324/are-permissive-licenses-mit-bsd-zlib-compatible-with-cc-by[Workable] but wonky.
* https://subwiki.org/[]: <wiki> with some upper graduate <math> subjects presumably by this <Indian> dude: https://www.linkedin.com/in/vipul-naik-0ab1898/[]. Description on his homepage: https://vipulnaik.com/subwiki/[]. He's also got other interesting but not so relevant projects:
  * pro freer immigration laws: https://vipulnaik.com/openborders/
  * https://vipulnaik.com/cognito-mentoring/ free mentoring project for interested students
  He's also into <Stack Overflow>, <Quora> and <Wikipedia> editing. That's a cool dude. He's into in <LessWrong> it seems.
* massive <mathematics> books
  * <Infinite Napkin>.<CC BY-SA> <mathematics> infinite book: https://github.com/vEnhance/napkin/issues/77[]. Very similar type of content to what we want in this project!
  * <Stacks Project>

Existing lecture notes by students:
* https://github.com/mb2g17/NotesNetworkArchive <Google Docs>-based: https://docs.google.com/document/d/1OIcQ8dJ_FAhdkirU94M29-ZbNZ4oQs1LbWF3Nz-mq_U/edit#heading=h.vehxib58w1iw[]. An actual student uploading tons of lecture notes in one coherent system. <CC By-NC-SA> unfortunately.
* https://academia.stackexchange.com/questions/148261/do-you-keep-your-study-notes-publicly-available mentions:
  * Cambridge Mathematics Lecture Notes by Dexter Chua (2014-2018)
    * http://dec41.user.srcf.net/notes/
    * https://github.com/dalcde/cam-notes
  Related: https://academia.stackexchange.com/questions/40381/how-common-is-it-that-professors-have-their-students-write-textbooks

Lecture note upload website:
* https://nexusnotes.com likely illegal reuploads of PDFs from teachers
* https://www.studocu.com/en-gb Paywall. PDF uploads. Unclear if simple teacher reuploads or actual novel notes.
* https://www.studydrive.net/
* Chinese GitHub repos. Some of these are very advanced in terms of content quantity and organizational quality! The Chinese are miles ahead in this area:
  * https://github.com/PKUanonym/REKCARC-TSC-UHT Guidance for courses in Department of Computer Science and Technology, Tsinghua University. Chinese. Appears to try and store all past exams.
  * https://github.com/lib-pku/libpku
  * https://github.com/openwhu/OpenWHU[]: Wuhan University
  * https://github.com/USTC-Resource/USTC-Course[]: USTC
  * https://github.com/Zeal-L/UNSW[]: <UNSW> from <Australia>, but by a Chinese dude
  * https://github.com/apachecn/mit-18.06-linalg-notes[]: translation of <MIT> course to Chinese
  * https://github.com/chenyang1999/MyComputerCollegeCourses[]: TODO which univeresity
  * https://github.com/elder-frog/OpenCourseCatalog[]: nothing to do with this project, but since I'm making a list, this dude is copying <YouTube> videos to <Bilibili>. And he's edgy anti-CCP on Twitter, what a legend.
  * https://github.com/TheBloodthirster/BUAA_Course_Sharing[]: https://en.wikipedia.org/wiki/Beihang_University

Exams uploads:
* https://questions.tripos.org/part-ib/all/ <University of Cambridge> Mathematics past examinations

<Personal knowledge base> recommendation threads:
* https://news.ycombinator.com/item?id=23942342
* https://www.reddit.com/r/selfhosted/comments/l37w0j/personal_knowledge_base/
* https://www.reddit.com/r/selfhosted/comments/esso3u/personal_knowledgebase/
* https://www.youtube.com/watch?v=XRpHIa-2XCE
  * Joplin, TODO

Personal knowledge bases:
* Forester
  * https://www.jonmsterling.com/tfmt-0001.xml

= Knowledge graph editors
{parent=Alternatives}

TODO gotta research those more:
* https://roamresearch.com/ no public graphs
* https://nesslabs.com/roam-research-alternatives a bunch of open source alternatives to it
  * <Trillium Notes>. Notable project! Pun unintended!
  * Stroll https://giffmex.org/stroll/stroll.html[]. How to publish? How to see tree?
  * tiddlyroam https://joekroese.github.io/tiddlyroam/ graph rather than text searchable ToC. Public instance? Multiuser?
  * Athens https://github.com/athensresearch/athens rudimentary <WYSIWYG>
  * Logseq https://github.com/logseq/logseq no web interface/centralized server?
  * Zim https://zim-wiki.org/ Local only. <WYSIWYG>.
* https://itsfoss.com/obsidian-markdown-editor[]. <Closed source>. They have an OK static website publication mechanism: https://publish.obsidian.md/ram-rachum-research/Public/Agents+aren't+objective+entities%2C+they're+a+model but leaf node view only for now, no cross source page render. They are committed to having plaintext source which is cool: https://twitter.com/kepano/status/1675626836821409792
* https://www.notion.so/[]. Possible to publish pages: https://www.notion.so/help/public-pages-and-web-publishing

  But non-paid plan currently disables "Search engine indexing" of that sharing, so it's useless. There's an "Allow duplicate as template" button though which is nice.

  URLs are horrendous however, e.g.: https://lofty-flower-be4.notion.site/aa-2274c59a06124d5b974b781a67340670 Only the `aa` in that came from us. They don't even have the guts for a fixed subdomain.

  Also it does not work without JavaScript, no SSR, everything is dynamic.

  They don't show multiple input pages on the same render, e.g.: https://lofty-flower-be4.notion.site/aa-2274c59a06124d5b974b781a67340670 does not contain the child https://lofty-flower-be4.notion.site/bb-45df7212a2e14e04b3f9604035c7acf4[] as already implemented on https://docs.ourbigbook.com/ourbigbook-web-dynamic-article-tree[OurBigBook Web Dynamic Article Tree].

  Cross page links to work fine. But you don't link to explicit IDs, only internal hidden IDs. This can be even slightly confusing to users as multiple identical options can show up when you start creating a link. They do try to disambiguate with the parent page however.

  So this is a reasonable single-person publishing platform for your notes.

  But no upvotes/topics. Favorites could be used as upvotes, but it does not seem like you can favorite other people's work. It's just not their focus, will never be.

  They have an education push: https://www.notion.so/product/notion-for-education but it's likely hopeless.
Major downsides that most of those personal knowledge databases have:
* very little/no focus on public publishing, which is the primary focus of OurBigBook.com
* either limited or no multiuser features, e.g. edit protection and cross user topics
* graph based instead of tree based. For books we need a single clear ordering of a tree. Graph should come as a secondary thing through tags.

= Learning management systems
{parent=Alternatives}

This website basically aims to be a </learning management system>, allowing in particular a teacher to focus his help on students that he is legally obliged to help due to their job. But it will have the following unusual characteristics in current LMS solutions:
* public first, to allow reuse across universities, rather than paywalled as is the case for most top universities
* students can create material just like teachers, both are on equal footing. Students/teachers will see an indicator "this is your teacher"/"this is your student for this/past semester", but that is the only difference between their interfaces.

= GitHub
{c}
{parent=Alternatives}

If <Ciro Santilli> were to write a book about <quantum mechanics> as of 2020 (before <OurBigBook.com> went live), he would upload an <OurBigBook Markup> website to <GitHub Pages>.

But there is one major problem with that: the entry barrier for new contributors is very large.

If they submit a pull request, Ciro has to review it, otherwise, no one will ever see it.

Our amazing website would allow the reader to add his own example of, say, The <uncertainty principle>, whenever they wants, under the appropriate section.

Then, people who want to learn more about it, would click on the "defined tag" by the article, and our amazing analytics would point them to the best such articles.

= Other projects
{parent=Alternatives}

* <HyperCard>: we are kind of a "multiuser" version of HyperCard, trying to tie up cards made by different users. It is worth noting that <HyperCard> was one of the inspirations for <WikiWikiWeb>, which then inspired <Wikipedia>
* https://en.wikipedia.org/wiki/Semantic_Web[Semantic Web]
* <NLab>
* https://physicstravelguide.com/ Nice manifesto: https://physicstravelguide.com/about by <Jakob Schwichtenberg>.
* <OpenStax>
* https://www.ft.com/content/5515ec3e-0040-4d90-85a9-df19d6e3ebd2 (https://archive.ph/nlr1h[archive]) Twilio’s Jeff Lawson: an evangelist for software developers
  \Q[
  As a student at the University of Michigan, he started a company that made lecture notes available free online, drawing a large audience of Midwestern college students and, soon enough, advertisers. At the height of the dotcom bubble, he dropped out of college, raised \$10m from the venture firm Venrock and moved the company to Silicon Valley.

  His start-up drew interest from an acquirer that was planning to go public early in 2000. They closed the acquisition but missed their IPO window as the market plunged, and by August the company had filed for bankruptcy. Stock that Lawson and investors in his start-up received from the sale became worthless.
  ]
  You can never be first. But you can have the correct business model. That company's website must have gone into <intellectual property>[IP] <Purgatory>, and could never be released as an open source website.

  This project won't make a lot of <money>. <Open source> and <not-for-profit> seems like the way to go.

  The website was called https://stubhub.com/[], as of 2021 the domain had been sold to an unrelated website.

  He might actually be interested in donating to <OurBigBook.com> if it move forward now that he's a billionaire.
* <Knol>: basically the exact same thing by <Google> but 14 years earlier and declared a failure. Quite ominous:
  \Q[Any contributor could create and own new Knol articles, and there could be multiple articles on the same topic with each written by a different author.]
* <leanpub>: similar goals, <markdown>-based, but the usual "you own your book copyright and you are trying to sell your book" approach
* <nature Scitable>

OK, just going random now:
* https://twitter.com/jwmares/status/1136273581455958017

= Action plan
{parent=OurBigBook.com}

The steps are sorted in roughly chronological order. The project might fail at any point, and some steps may be carried in parallel:
* make <OurBigBook Markup> good enough, to the point that it allows to create a static version of the website, which is used to prototype certain ideas, and for Ciro to start writing test content.

  Status March 2022: reached a point that it is already highly usable. The following website may continue.
* create a basic implementation of the website, without advanced features like PageRank sorting and WYSIWYG. This is not much more than a blog with some extra metadata, so it is definitely achievable with constrained resources.
* find a university teacher would would like to try it out.

  Ciro would like to volunteer to work for free for this teacher and students to help the students learn.

  He would like act like a "super student" who has a lot of free time and motivation.

  Ciro would start by mapping the headers of the lecture notes onto the website, and then slowly adding content as he feels the need to improve certain explanations.

  Finding teachers willing to allow this will be a major roadblock: <how to convince teachers to use CC BY-SA>.

  If such enlightened teacher is found, it will allow for the initial validation of the website, to decide what kind of tweaking the idea might need, and start uploading quality technical content to the site.
* once some level of validation as been done, Ciro will start looking for charitable <charitable grant opportunities> more aggressively
* if things seem to be working, start adding more advance features: <PageRank-like ranking> sorting and WYSIWYG editing

  The recommendation algorithms notably is left for a second stage because it needs real world data to be tested. And at the beginning, before <Eternal September> kicks in, there would be few posts written by well educated university students, so a simple sort by upvote would likely be good enough.

Ciro decided to start with a decent <markup language> with a decent implementation: <OurBigBook Markup>. Once that gets reasonable, he will move on to another attempt at the website itself.

The project description was originally at: https://github.com/cirosantilli/write-free-science-books-to-get-famous-website but being migrated here. The original working project name was "Write free books to get famous website", until Ciro decided to settle for `OurBigBook.com` and fixed the domain name.

= Philosophy
{parent=OurBigBook.com}

= Desired social impact
{parent=Philosophy}

Crush the current grossly inefficient educational system, replace today's students + teachers + researchers with unified "online content creators/consumers".

Gamify them, and pay the best creators so they can work it full time, until some company hires for more them since they are so provenly good.

Destroy useless <exams>, the only metrics of society are either:
* how much money you make
* how high is your educational content creator reputation score

Reduce the entry barrier to education, like Uber has done for taxis.

Help create much greater <equal opportunity> to talented poor students as described at <free gifted education>.

Give the students <students must have a flexible choice of what to learn>[a flexible choice of what to learn], which basically implies that a much large proportion of students get a de-facto <gifted education>.

In some ways, Ciro wants the website to feel like a <video game>, where you fluidly interact with headers, comments and their metadata. If game developers can achieve impressively complicated game engines, why can't we achieve a decent amazing elearning website? :-)

Related:
* https://www.edsurge.com/news/2021-08-17-it-s-time-to-blur-the-boundaries-between-high-school-college-and-work

= Motivation
{parent=Philosophy}

Many subjects have changed very little in the last hundred years, and so it is mind-blowing that people have to pay for books that teach them!

If <video A computer is the equivalent of a bicycle for our minds by Steve Jobs (1980)>[computers are bicycles for the mind], Ciro wants this website to be the Ferrari of the mind.

Since <Ciro Santilli> was young, he has been bewildered by the natural sciences and mathematics <Ciro Santilli's bad old event memory>[due to his bad memory].

The beauty of those subjects has always felt like intense sunlight in a fresh morning to Ciro. Sometimes it gets covered by clouds and obscured by less important things, but it always comes back again and again, weaker or stronger with its warmth, guiding Ciro's life path.

As a result, he has always suffered a lot at school: his grades were good, but he wasn't really learning those beautiful things that he wanted to learn!

School, instead of helping him, was just wasting his time with superficial knowledge.

First, before <university>, school organization had only one goal: put you into the best universities, to make a poster out of you and get publicity, so that more parents will be willing to pay them <money> to put their kids into good university.

Ciro once asked a <chemistry> teacher some "deeper question" after course was over, related to the superficial vision of the topic they were learning to get grades in university entry <exams>. The teacher replied something like:
\Q[You remind me of a friend of mine. He always wanted to understand the deeper reason for things. He now works at NASA.]
Ciro feels that this was one of the greatest compliments he has ever received in his life. This teacher, understood him. Funny how some things stick, <Ciro Santilli's bad old event memory>[while all the rest fades].

Another interesting anecdote is how <Ciro Santilli's mother> recalls that she always found out about exams in the same way: when the phone started ringing as Ciro's friends started asking for help with the subjects just before the exam. Sometimes it was already too hopelessly late, but Ciro almost always tried. Nothing shows how much better you are than someone than teaching them.

Then, after entering university, although things got way better because were are able to learn things that are borderline useful.

Ciro still felt a strong emotion of nostalgia when after university his mother asked if she could throw away his high school books, and Ciro started tearing them all down for recycling. Such is life.

University teachers were still to a large extent researchers who didn't want to, know how to and above all have enough time and institutional freedom to teach things properly and make you see their beauty, some good relate articles:
* <The Purpose of Harvard is Not to Educate People by Sean Carroll (2008)>
* <How To Get Tenure at a Major Research University by Sean Carroll (2011)>

The very fact that you had very little choice of what to learn so that a large group can get a "Diploma", makes it impossible for people to deeply learn what the really want.

This is especially true because Ciro was in <Brazil>, a third world country, where the opportunities are comparatively extremely limited to the <what poor countries have to do to get richer>[first world].

Also extremely frustrating is how you might have to wait for years to get to the subject you really want. For example, on a <physics> course, <quantum mechanics> is normally only taught on the third year! While there is value to knowing the pre-requisites, holding people back for years is just too sad, and Ciro much prefers <backward design>. And just like the university entry exams, this creates an entry barrier situation where you might in the end find that "hey, that's not what I wanted to learn after all", see also: <students must have a flexible choice of what to learn>.

We've created a system where people just wait, and wait, and wait, never really doing what they really want. They wait through school to get into university. They wait through university to get to masters. They wait through masters to get to <PhD>. They wait through <PhD> to become a PI. And for the minuscule fraction of those that make it, they become fund proposal writers. And if you make any wrong choice along the, it's all over, you can't continue anymore, the cost would be too great. So you just become software engineer or a consultant. Is this the society that we really want?

And all of this is considering that he was very lucky to not be in a poor family, and was already in some of the best educational institutions locally available already, and had comparatively awesome teachers, without which he wouldn't be where he is today if he hadn't had such advantages in the first place.

But no matter how awesome one teacher is, no single person can overcome a system so large and broken. Without technological innovation that is.

The key problem all along the way is the Society's/Government's belief that everyone has to learn the same things, and that grades in exams mean anything.

Ciro believes however, that <exams> are useless, and that there are only two meaningful metrics:
* how much <money> you make
* fame for doing for doing useful work for society without earning money, which notably includes creating new or better free knowledge such as in <academic papers>, either novel or <review paper>[review]

Even if you wanted to really learn natural sciences and had the time available, it is just too hard to find good resources to properly learn it. Even attending university courses are hit and miss between amazing and mediocre teachers.

If you go into a large book shop, the science section is tiny, and useless popular science books dominate it without <videos of all key physics experiments>[precise experiment descriptions]. And then, the only few "serious" books are a huge list of formulas without any experimental motivation.

And if you are lucky to have access to an university library that has open doors, most books are likely to be old and boring as well. <Googling> for <PDFs> from university courses is the best bet.

Around 2012 however, he finally saw the light, and started his path to <Ciro Santilli's Open Source Enlightenment>. University was not needed anymore. He could learn whatever he wanted. A vision was born.

To make things worse, for a long time he was tired of <social inequality>[seeing poor people begging on the streets every day and not doing anything about it]. He thought:
\Q[He who teaches one thousand, saves one million.]
which like everything else is likely derived subconsciously from something else, here https://en.wikiquote.org/wiki/Talmud[Schindler's list possibly adapted quote from the Talmud]:
\Q[He who saves the life of one man saves the entire world.]

So, by the time he left University, instead of pursuing a PhD in theoretical Mathematics or Physics just for the beauty of it as he had once considered, he had new plans.

We needed a new educational system. One that would allow people to fulfill their potential and desires, and truly <unconditional basic income>[improve society as a result], both in rich and poor countries.

And he found out that programming and applied mathematics could also be fun, so he might as well have some fun while doing this! ;-)

So he started https://github.com/booktree/booktree[Booktree] in 2014, a <GitLab> fork, worked on it for an year, noticed the https://github.com/booktree/booktree/blob/master/blog/2015-01-why-ciro-stopped-working-on-booktree.md[approach was dumb], and a few years later started building this new version. The repo https://github.com/booktree/booktree is a small snapshot of Ciro's 2014 brain on the area, there were quite a few similar projects at the time, and most have died.

Ciro is basically a librarian at heart, and wants to be the next:
* https://en.wikipedia.org/wiki/Jimmy_Wales[Jimmy Wales]
* https://en.wikipedia.org/wiki/Brewster_Kahle[Brewster Kahle]
* https://en.wikipedia.org/wiki/Tim_Berners-Lee[Tim Berners Lee]
* <Tim O'Reilly>, who once brilliantly described <O'Reilly Media> as "a lifestyle business that got out of control" https://www.inc.com/magazine/20100501/the-oracle-of-silicon-valley.html{ref}
* <Aaron Swartz>. Minus <suicide> hopefully.

\Video[https://youtube.com/watch?v=WQR0gx0QBZ4]
{title="<Jimmy Wales>: How a ragtag band created <Wikipedia>" 2005 <TED (conference)> talk}
{description=Original source: https://www.ted.com/talks/jimmy_wales_the_birth_of_wikipedia[].}

\Video[https://youtube.com/watch?v=pXoHC2D15hM]
{title="Brewster Kahle: A digital library, free to the world." 2007 <TED (conference)> Talk}
{description=Talks about the <Internet Archive> which he created.}

\Video[https://www.youtube.com/watch?v=-MTRxRO5SRA]
{title=<Sal Khan> from <Khan Academy> 2016 <TED (conference)> talk}
{description=Ciro is not a big fan of the "basis on top of basis focus" because of his obsession with <backward design>, but "learn to mastery at your own pace" and "everyone can be a world class innovator" are obviously good.}

= Manifesto
{parent=Philosophy}

Education has become an expensive bureaucratic exercise, completely dissociated from reality and usefulness.

It completely rejects what the individual wants to achieve, and instead attempts to mass homogenize and test people through endless hours of boredom.

And the only goals it achieves are testing student's resilience to stress, and facilitating the finding of sexual partners. True learning is completely absent.

Teachers only teach because they have to do it to get paid, not for passion. Their only true incentive is co-authoring papers.

We reject this bullshit.

Education is meant to help us, the students, achieve our goals through passionate learning.

And, we, the students, are individuals, with different goals and capabilities.

The way we protest is to publish the knowledge from University for free, on the Internet, so that anyone can access it.

And we do this is a law-abiding way, without copyright infringement, so that no one can legally take it down.

We come to our courses just for the useless roll calls. But we already know all the subject better than the "teacher" on the very first day.

And we are already more famous than the "teacher" online, and through the Internet have already taught more way way more people than they ever will.

The effect of this is to demoralize the entire school system at all levels, until only one conclusion is possible: implosion.

And from the ashes of the old system, we will build a new one, which does only what matters with absolute efficiency: help the individual students achieve their goals.

A system in which the only reason why university exist will be to <the only reason for universities to exist should be the laboratories>[allow the most knowledgeable students to access million dollar laboratory equipment], and to pay the most prolific content creators so they can continue content creating.

No more useless courses. No more useless tests. Only passion, usefulness and focus.

= Feature ideas
{parent=OurBigBook.com}

In this section we will gather some more advanced ideas besides the basic features described at <how the website works>.

= PageRank-like ranking
{c}
{parent=Feature ideas}

It would be really cool to have a <PageRank>-link algorithm that answers the key questions:
* what is the best content for subject X.

  For example, if you are reading `cirosantilli/riemann-integral` and it is crap, you would be able to click the button
  \Q[Versions by other authors]
  which leads you to the URL: http://ourbigbook.com/subject/mathematics[]. This URL then contains a list of all pages people have written about the subject `mathematics`, sorted by some algorithm, containing for example:
  * http://ourbigbook.com/johnsmith/riemann-integral
  * http://ourbigbook.com/cirosantilli/riemann-integral

  This URL would also contain a list of issues/comments that are related to the subject.
* who knows the most about subject X. This can be found by visiting: http://ourbigbook.com/users/mathematics[] "Top Mathematics users", which would contain the list of users sorted by the algorithm:
  * http://ourbigbook.com/johnsmith
  * http://ourbigbook.com/cirosantilli
However, Ciro has decided to leave this for phase two <action plan>, because it is impossible to tune such an algorithm if you have no users or test data.

Perhaps it is also worth looking into <ExpertRank>, they appear to do some kind of "expert in this area", but with <clustering> (unlike us, where the clustering would be more explicit).

Other dump of things worth looking into:
* https://en.wikipedia.org/wiki/Hilltop_algorithm

= User acquisition
{parent=OurBigBook.com}

The general and ideal user acquisition is of course organic Googling:
* user does not understand his teacher's explanation of a subject
* user <Googles> into rare specific subject
* looks around, then login/create account with <OAuth> to leaves a comment or upvote
* notice that you can fork anything
* <mind blown>[mind = KABOOM]

However, before that point, it is very likely that Ciro will have to physically do some very hard and specific user acquisition work at some University. Maybe there is a more virtual way of achieving this.

This work will involve going through some open set of <university lecture notes>, and creating a superior version of them on OurBigBook.com, and somehow getting students to notice it and use it as a superior alternative to their crappy lecture notes.

Another very promising route is publishing the answers to old examination questions on the website. It is likely that we will be able to overcome any copyright issues by uploading only the answers to numbered questions. There is a minor risk that these would be considered <derivative works> of the copyrighted questions. But universities would have to be very anal to enforce a DMCA for that!!!
* https://www.reddit.com/r/learnmath/comments/lj12pg/problem_sets_and_solutions_equivalent_to/

Getting in contact with students is an epic challenge, as an incredibly deep chasm separates us:
* it is basically impossible to try and approach teachers: <how to convince teachers to use CC BY-SA>
* and on the other hand, how will you get university students to trust you are not a pedophile and that you actually want to help them?

  The missing aspect is how to join their main "class communication group", e.g. a <WhatsApp> or <Discord> chat they have. That would be the perfect entry point to communicate with the end users. But that entry point is also generally closed exclusively for students, and sometimes lecturers, and will not accept anyone external.

  Perhaps Ciro would be able to do something with one of the two Universities he attended in the past: <École Polytechnique> or <University of São Paulo>. But there was no clear channel in those institutions for that. There is either an "infinitely noisy Facebook with everyone that bothered" or silence, deathly silence and isolation of no contact. The key hard part is getting a per-course granularity chat. <Discord> Student Hubs are a fantastic initiative in that area. Shame that Discord is an unusable mess with zero ways to select which notifications you care about: <Discord email notifications>{full}!

  One approach method that shows some promise is to follow the <Student societies>, which often host open events of interest outside of work hours.

Walking with advertisement t-shirts mentioning specific course names in some university location is something Ciro seriously considers, that's how desperate things are. Watch out: https://docs.ourbigbook.com/#public-relations for T-shirt news!

= Funding
{parent=OurBigBook.com}

Ciro is looking for:
* university teachers who might be interested in trying it out as described at <action plan>{full}, especially <how to convince teachers to use CC BY-SA>[those who already use open licenses for their lecture notes]
* <funding>[funding possibilities] for this project, including donations as mentioned at <sponsor>{full} and contracts

The initial incentive for the creators is to make them famous and allow them to get more fulfilling jobs more easily, although Ciro also wants to add <knowledge market>[money transfer] mechanisms to it later on.

We <how to convince teachers to use CC BY-SA>[can't rely on teachers writing materials, because they simply don't have enough incentive]: publication count is all that matters to their careers. The students however, are desperate to prove themselves to the world, and becoming famous for amazing educational content is something that some of them might want to spend their times on, besides grinding for useless <grade (exam)>.

= Why it is hard to make money from this website
{parent=Funding}

There is basically only one scalable business model in education as of the 2020's: helping teenagers pass <university entry exams>. And nothing else. Everything else is a "waste" of time.

Perhaps there is a little bit of publicity incentive to helping them win <knowledge olympiads> as well, but it is tiny in comparison, and almost certainly not a scalable investment. This may also depend on whether universities consider anything but exams, which varies by country.

That marked is completely saturated, and <Ciro Santilli> refuses to participate in it for <motivation>[moral reasons].

Beyond that, there is no scalable investment. Other non-scalable investments that could allow one to make a <lifestyle business> are:
* extra-curricular initiatives to get younger children interested in science. These may have some money stream coming from the parents of the children. This happens because for young children, the parents are more in control, and the parents, unlike the students, have some money to spend. An example: https://www.littlehouseofscience.com/

  The space is also further crowded by several not-for profits.

  This business model is possible because experiments for young children may be cheap to realize, unlike any experiment that would matter to a teenager or adult.
* creating a private university, for profit or not. Of course, at this point, you would be either:
  * competing against the reputation and funding of century old universities
  * or be offering more boring, lower tech or techless courses, to (God forbid the phrasing) "worse students", i.e. at a "worse university"

Teenagers and young adults:
* don't have money to give you if you want to "help them learn for real"
* are somewhat forced to obtain their "reputable university" reputation to kickstart their careers

It is this perfect storm that places this specific section of education in such a bad shape that it is today.

This project is likely to fail. It could become the <TempleOS> of <wikis>. The project' <autism> score is quite high. It might be an impossible attempt at a <lifestyle business>. But Ciro is beyond caring now. It must be done. Other things that come to mind:
* https://www.youtube.com/playlist?list=PLibNZv5Zd0dzvoxXrjA9xNHLpdgLhTkZz "Obsessed" playlist by Wired. Helps Ciro feel better about himself.
* https://en.wikipedia.org/wiki/Don_Quixote[Don Quixote]
* https://en.wiktionary.org/wiki/pipe_dream[pipe dream]
* <video Don't Try - The Philosophy of Charles Bukowski by Pursuit of Wonder (2019)>

Dangerous combination:
\Q[One man with a laptop and a dream.]
and for any crazy person who might wish to join: <Men Wanted for Hazardous Journey>.

\Video[https://www.youtube.com/watch?v=4b2IOOhJmxw]
{title=One Man's Dream - Ken Fritz Documentary (2021)}
{description=
In some ways, Ciro was reminded of <OurBigBook.com> by this documentary. Ken built his ultimate audio system without regard to money and time, to enjoy until he dies. Ciro is doing something similar. There is one fundamental difference however: everyone can enjoy a website all over the world.

A bit ominous though that the whole thing was eventually sold off for a fraction of the building cost: https://www.washingtonpost.com/style/interactive/2024/ken-fritz-greatest-stereo-auction-cost/[].
}

= Crowdfunding
{parent=Funding}

See: <sponsor>{full}.

= Charitable grant opportunities
{parent=Funding}

Once the ball starts rolling, these are people who should be contacted.

Basically anything under <educational charitable organization> counts.

It is also worth having a look under the <Wikipedia> page for <open educational resources>: https://en.wikipedia.org/wiki/Open_educational_resources

= Consulting
{parent=Funding}

Start with consulting for universities to get some cash flowing.

Help teachers create perfect courses.

At the same time, develop the website, and use the generated content to bootstrap it.

Choose a domain of knowledge, generate perfect courses for it, and find all teachers of the domain in the world who are teaching that and help them out.

Then expand out to other domains.

TODO: which domain of knowledge should we go for? The more precise the better.
* maths is perfect because it "never" changes. But does not make money.
* computer science might be good, e.g. machine learning.

= Knowledge market
{parent=Funding}

If enough people use it, we could let people sell knowledge content through us.

Teachers have the incentive of making open source to get more students.

Students pay when they want help to learn something.

We take a cut of the transactions.

However this goes a bit against our "open content" ideal.

Forced <sponsorware> would be a possibility.

Would be a bit like <Fiverr>. Hmmm, maybe this is not a good thing ;-)

= Advertisement
{parent=Funding}

Don't like this very much, but if it's the only way...

Maybe focus on job ads like </Stack Overflow>.

Then:
* like <YouTube>, pay creators proportionally to views/metrics
* paid subscription to remove ads from site

= Association with innovative schools
{parent=Funding}

Maybe we should talk to <innovative schools>, as they might be more open to such use of technology.

= Venture capital
{parent=Funding}

Not a fun of giving up control for such a low-maintenance cost venture... but keeping a list just in case...
* https://twitter.com/Borthwick/status/1533511605081845761 Betaworks