26.1.3.7.3. Wayback Machine (web.archive.org, 网站时光机)
Their Wayback Machine tool archive snapshots of webpages and allows youto browse the history of all archives:
By the Internet Archive organization:
This Chrome extension helps to quickly archive pages: https://chrome.google.com/webstore/detail/wayback-machine/fpnmgdkabkmnadcjpehmlllkndpkmiak
But remember that if you archive too many in a very quick succession before the previous ones have been archived, even if manually through that plugin, your account/IP might get blocked, so just give a few seconds for the current archive to terminate before starting new ones.
This happened to Ciro in 2020-06-14, but then he emailed the admins as mentioned at: https://help.archive.org/hc/en-us/articles/360016379432-Accounts-Tips-Troubleshooting- and they re-enabled it.
As of 2021, it annoys Ciro very much that you have to wait for a few minutes for the archived page to become visible. That’s an eternity when you are writting a text and need to link to an archive.
It is also extremelly annoying that they sometimes change image URLs randomly after they’ve become visible. Why is that? E.g. Ciro saw breaks between different timestamps:
and aso betweeen im
vs if
as in:
so it is just not reliable enough to use generally.
It is possible to request social media pages you own to be removed from the archive in some circumstances: https://webapps.stackexchange.com/questions/143529/is-it-possible-to-request-to-remove-page-snapshots-from-a-personal-social-media
Some website specific techniques:
-
YouTube videos: they do seem to download them!
-
But as as of 2020, they seem to have some serious bugs failing with "Sorry the Wayback Machine does not have this video (<id>) archived (or not indexed yet)." on most videos: https://webapps.stackexchange.com/questions/149933/why-does-the-archive-org-of-most-youtube-videos-fail-with-sorry-the-wayback-mac
-
And the interface may be very broken, possibly due to the arabic youtube interface, you might just have to click around sometimes. So the best thing is to just always use the magic direct video link: of form: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/<video-id>
-
E.g. a deleted pair from Aaron DeWitt (The Chinese Tea):
-
It is possible to get a raw mp4 link with as mentioned at https://forum.videohelp.com/threads/391951-How-to-download-a-YouTube-video-archived-by-Wayback-Machine: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/<video-id> e.g.: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/c6uFGGunRxc, the download did work on 2020-12-31. It redirects to
-
Google Docs: visit the page while logged off (e.g. via browser private mode), click the print button, and then archive the resulting PDF URL
How to search by prefix/pattern: https://webapps.stackexchange.com/questions/146661/how-to-search-the-internet-archives-wayback-machine-by-url-prefix-or-pattern
How to archive with an API:
-
https://stackoverflow.com/questions/33811582/how-to-access-wayback-machine-programmatically
-
-
https://blog.archive.org/2019/10/23/the-wayback-machines-save-page-now-is-new-and-improved/
And, yes, of course SPN has a brand new API that you can use to automate a range of Web archiving projects. Please write to us at info@archive.org if you would like to learn more about the API.
No publicly documented API? Docs by email only? Ridiculous, you make me laugh.
One downside of Archive.org is that is is age restricted by some ISPs for containing adult containing, e.gn. notably mobile ISPs. Those idiots, sacrifice freedom of learning for a bit of innefective Pornography ban (色情禁令) protection.
Their Wayback Machine tool archive snapshots of webpages and allows youto browse the history of all archives:
By the Internet Archive organization:
This Chrome extension helps to quickly archive pages: https://chrome.google.com/webstore/detail/wayback-machine/fpnmgdkabkmnadcjpehmlllkndpkmiak
But remember that if you archive too many in a very quick succession before the previous ones have been archived, even if manually through that plugin, your account/IP might get blocked, so just give a few seconds for the current archive to terminate before starting new ones.
This happened to Ciro in 2020-06-14, but then he emailed the admins as mentioned at: https://help.archive.org/hc/en-us/articles/360016379432-Accounts-Tips-Troubleshooting- and they re-enabled it.
As of 2021, it annoys Ciro very much that you have to wait for a few minutes for the archived page to become visible. That’s an eternity when you are writting a text and need to link to an archive.
It is also extremelly annoying that they sometimes change image URLs randomly after they’ve become visible. Why is that? E.g. Ciro saw breaks between different timestamps:
and aso betweeen im
vs if
as in:
so it is just not reliable enough to use generally.
It is possible to request social media pages you own to be removed from the archive in some circumstances: https://webapps.stackexchange.com/questions/143529/is-it-possible-to-request-to-remove-page-snapshots-from-a-personal-social-media
Some website specific techniques:
-
YouTube videos: they do seem to download them!
-
But as as of 2020, they seem to have some serious bugs failing with "Sorry the Wayback Machine does not have this video (<id>) archived (or not indexed yet)." on most videos: https://webapps.stackexchange.com/questions/149933/why-does-the-archive-org-of-most-youtube-videos-fail-with-sorry-the-wayback-mac
-
And the interface may be very broken, possibly due to the arabic youtube interface, you might just have to click around sometimes. So the best thing is to just always use the magic direct video link: of form: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/<video-id>
-
E.g. a deleted pair from Aaron DeWitt (The Chinese Tea):
-
It is possible to get a raw mp4 link with as mentioned at https://forum.videohelp.com/threads/391951-How-to-download-a-YouTube-video-archived-by-Wayback-Machine: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/<video-id> e.g.: https://web.archive.org/web/2oe_/http://wayback-fakeurl.archive.org/yt/c6uFGGunRxc, the download did work on 2020-12-31. It redirects to
-
-
Google Docs: visit the page while logged off (e.g. via browser private mode), click the print button, and then archive the resulting PDF URL
How to search by prefix/pattern: https://webapps.stackexchange.com/questions/146661/how-to-search-the-internet-archives-wayback-machine-by-url-prefix-or-pattern
How to archive with an API:
-
https://stackoverflow.com/questions/33811582/how-to-access-wayback-machine-programmatically
-
https://blog.archive.org/2019/10/23/the-wayback-machines-save-page-now-is-new-and-improved/
And, yes, of course SPN has a brand new API that you can use to automate a range of Web archiving projects. Please write to us at info@archive.org if you would like to learn more about the API.
No publicly documented API? Docs by email only? Ridiculous, you make me laugh.
One downside of Archive.org is that is is age restricted by some ISPs for containing adult containing, e.gn. notably mobile ISPs. Those idiots, sacrifice freedom of learning for a bit of innefective Pornography ban (色情禁令) protection.