Ever more websites are using to block scrapers. Cloudflare is still a man-in-the-middle attack on the web, but I do think people should have the ability to block the AI crap. So I now have some sympathies for using Cloudflare. What if we had real gov that could be used for captchas? This requires privacy-respecting services that only see the data they need, e.g. "are you a human with an eID? yes/no". There are concerns with eIDs but in implementations not the core idea

@eighthave but a client side scrupt could automate the interaction witha real id card then automate scrappung using this id.

And no doubt people will rent the use of their id by scrapping companies...

@tuxicoman sounds like a well known problem with known solutions, for example, APIs with rate limiting, tokens, etc.

@eighthave you mean throttling usage per "detected" same request origin (same id here)?

@eighthave

It looks you want to do the same as copyright holders. Control usage of published content. DRM in a sense.

There could be watermarks in your content you could retrieve in the AI outputs but it's fragile.

There could be legal obligations by AI companies to justify the source of their materials (we have it for food, intellectual property) and show proof of rights of use.

How can AI comapnies train of anna's archive and it's fine for them....only.

Follow

@tuxicoman yeah, I think putting legal obligations on AI companies is really something that should happen. It is funny, after all these years of trying to reduce the usage of , I think it is a good idea that AI companies have to respect copyright. The law clearly has a key role to play here.

Sign in to participate in the conversation
Librem Social

Librem Social is an opt-in public network. Messages are shared under Creative Commons BY-SA 4.0 license terms. Policy.

Stay safe. Please abide by our code of conduct.

(Source code)

image/svg+xml Librem Chat image/svg+xml