Ever more websites are using #Cloudflare to block #AI scrapers. Cloudflare is still a man-in-the-middle #MITM attack on the web, but I do think people should have the ability to block the AI crap. So I now have some sympathies for using Cloudflare. What if we had real gov #eID that could be used for captchas? This requires privacy-respecting services that only see the data they need, e.g. "are you a human with an eID? yes/no". There are concerns with eIDs but in implementations not the core idea
@eighthave but a client side scrupt could automate the interaction witha real id card then automate scrappung using this id.
And no doubt people will rent the use of their id by scrapping companies...
@tuxicoman sounds like a well known problem with known solutions, for example, APIs with rate limiting, tokens, etc.
@eighthave you mean throttling usage per "detected" same request origin (same id here)?
@tuxicoman yeah, I think putting legal obligations on AI companies is really something that should happen. It is funny, after all these years of trying to reduce the usage of #copyright, I think it is a good idea that AI companies have to respect copyright. The law clearly has a key role to play here.