Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you try to manually crawl the post with Internet Archive's "liveweb" project (which inserts stuff into Wayback Machine for live lookups) you'll get:

"Page cannot be crawled or displayed due to robots.txt."

So no, the Internet Archive will most likely not archive public Facebook posts - because of Facebooks robots.txt. The IA respects and honours the domains robots.txt.

I've taken a snapshot of the post with my own script which will put it into a IA Wayback Machine friendly format (WARC).



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: