I noticed something upsetting(?) in my search results yesterday. Search engine listings for a Wiktionary article had some weird text sort-of-summarizing the page. This text is *not* present in the page itself.

Is this from an LLM? If so, which party is generating the summary?

I don't *think* it's DDG, or Wiktionary. (More in thread.)

There *is* a way to set page descriptions in MediaWiki that isn't visible in the normal editing view. I don't know how it works, and couldn't find it. So I can't rule it out for sure.

It reads a little bit like the output of a cheeky editor going a little silly with metaphorical meanings of the word (all present in the quotation list in the page) But it reads more like an LLM's dumb-ass summary. Particularly that "learn how to"—very LLM-flavored.

I remember something about Wikimedia Foundation trying to push LLMs (for summaries), but I think that was nixed after, erm, strong community pushback.

Other search engine experiments:

- Kagi shows this weird shit too, as does Bing (though truncated).
- DDG shows a *normal* excerpt if I search for `fletcherize site:en.wiktionary.org`.
- Google shows a normal excerpt.

Show thread

My best guess at some point is that one of the search engine sources that DDG and Kagi both use (perhaps Bing?) is LLM-summarizing crawl entries.

I'm not sure how to verify this. Any ideas?

Show thread

I filed feedback with DDG and Kagi.

reddit.com/r/duckduckgo/commen -- people are saying that yeah, this is coming from Bing.

kagifeedback.org/d/7814-unwant -- official response is that this is unwanted behavior, although they haven't specifically identified the source (but implied that it's a search provider)

Show thread
Follow

@varx Bing wouldn't be much a surprise as it is a Microsoft product, and Microsoft has been pushing the LLMs hard.

Sign in to participate in the conversation
Librem Social

Librem Social is an opt-in public network. Messages are shared under Creative Commons BY-SA 4.0 license terms. Policy.

Stay safe. Please abide by our code of conduct.

(Source code)

image/svg+xml Librem Chat image/svg+xml