Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The robots.txt time makes it at times easier to scrape a target by the info that a website can reveal in it (e.g. allow a specific bot to scrape all). Their sitemaps are another gem.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: