Andy Reid@lemmy.world to Technology@lemmy.worldEnglish · 2 years agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square65linkfedilinkarrow-up1664arrow-down19
arrow-up1655arrow-down1external-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid@lemmy.world to Technology@lemmy.worldEnglish · 2 years agomessage-square65linkfedilink
minus-squarePretzilla@lemmy.worldlinkfedilinkEnglisharrow-up2·2 years agoJust thought of a nasty hack the browser makers (or hackers) could use to scrape unlisted sites - by surreptitiously logging user browser history for a crawl list
minus-squareSpotlight7573@lemmy.worldlinkfedilinkEnglisharrow-up3·2 years agoWhile there are some extensions that do this, last I saw Google didn’t use Chrome for populating Search: https://blogs.perficient.com/2017/03/15/does-google-use-chrome-to-discover-new-urls-for-crawling/
Just thought of a nasty hack the browser makers (or hackers) could use to scrape unlisted sites - by surreptitiously logging user browser history for a crawl list
While there are some extensions that do this, last I saw Google didn’t use Chrome for populating Search:
https://blogs.perficient.com/2017/03/15/does-google-use-chrome-to-discover-new-urls-for-crawling/