Post by @emilymbender@dair-community.social

TagHunt

@TagHunt@infosec.exchange · 3 hours ago

@emilymbender
Good thing they block adblockers then aye?

Sherri W (SyntaxSeed)

@syntaxseed@phpc.social · 5 hours ago

@emilymbender These companies desperately want to stop us from finding & enjoying each other's content. Walled gardens of social media & mega platforms weren't enough, now the search directories will be the ultimate walled garden.

All built on stealing the work of everyone who's ever published anything online.

Why would anyone create anything anymore? I already feel my own desire to blog is waning.

Bruce Simpson, Ph.D.

@bms48@mastodon.social · 5 hours ago

@emilymbender My bigger worry is if they start ensloppifying scholar.google.com. Zotero Connector eats it right up. Semantic Scholar isn't a patch on it for grabbing bibliographic records and tracking references, I have seen noticeable distortion in its results. Their pipeline is somewhat ML driven but they do use LLM summaries. This is expected given the origins of the service. Provenance is everything.

Jessie Kirk • 🏳️‍⚧️ :ace:

@thejessiekirk@ohai.social · 6 hours ago

@emilymbender Kagi 👍

Casdeiro

@casdeiro@15-15-15.social · 6 hours ago

@emilymbender #FuckAI #BoycottGoogle 'Nuff said! 😉

Álvaro G. Molinero 👉 @KimeraGupta

@KimeraGupta@todon.eu · 6 hours ago

@emilymbender Six alternatives to Google Search hosted in Europe

https://european-alternatives.eu/alternative-to/google-search

I usually use Qwant and I'm very happy

European Alternatives

European alternatives to Google Search | European Alternatives

Google is the biggest search engine in the world, from the USA-based company Alphabet.

ℂ𝕖𝕝𝕖𝕤𝕥𝕖@world: /# ▯

@rogue_cells@chaos.social · 7 hours ago

@emilymbender I want my printed encyclopedia back...

Allan

@allancavanagh@mastodon.social · 9 hours ago

@emilymbender so… search is now Google Wave but with robots.

Roadskater, Ph.D.

@roadskater@mastodon.social · 10 hours ago

@emilymbender I saw someone toot about that Tuesday afternoon, and my reaction to the article was, "So TechCrunch is rewriting Google press releases?" Not a critical thought expressed in the article. IIRC, no quotes from anyone at all, positive or negative. Just a list of supposedly useful features and enhancements.

Feh.

Gerhard D.

@GerhardD@olching.social · 10 hours ago

@emilymbender The good news: nobody needs #google. I've blocked all their MTAs, and resisted using their apps for quite some time. No problems so far.

Rob

@Eurospoofer@mastodon.social · 10 hours ago

@emilymbender 2.5 billion users a month they claim.
Having an AI summary forced into every search doesn't make the searcher a user of the tech. How many people simply ignore the summary?

Kristoffer Lawson

@Setok@attractive.space · 10 hours ago

@emilymbender yeah, even for someone who is pro-tech and interested in AI developments, this @TechCrunch piece read like a press release. It was journalistically extremely weak to the point I felt dumber for reading it.

Tommi 🤯

@tommi@pan.rent · 11 hours ago

@emilymbender

SEO is going to be so dead.

To me, it’s Search Engines Ostracism now.

Akash Mondal

@theakashmondal@mstdn.social · 13 hours ago

@emilymbender I use #DuckDuckGo

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 13 hours ago

@emilymbender i'm fucking weepi ng

Information-gathering agents are an evolution of Google

i have never observed google demonstrate any behavior that struck me as evidence of them gathering any form of information

Links will become an afterthought

that reminds me how they own the w3c and use it to ensure no one who has ever created a webpage will ever be able to show it to anyone without exposing them to the most openly broken cryptography i've ever seem

There’s little time left for publishers to adapt.

openly gloating

which will eventually be free

that's right. we will all be free. that's a cryptographic guarantee

Toni Aittoniemi

@gimulnautti@mastodon.green · 11 hours ago

@hipsterelectron @emilymbender Seems more like we are free to give everything to Google for free and not get any traffic in return. 🤔

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

5 years ago (2021) Google researchers Metzler et al put out a preprint talking about how LLMs would change information access ("Rethinking Search"). It was full of TERRIBLE ideas, and Chirag Shah and I wrote a reply ("Situating Search"):

https://dl.acm.org/doi/10.1145/3498366.3505816

>>

https://dl.acm.org/doi/10.1145/3498366.3505816

Bruce Simpson, Ph.D.

@bms48@mastodon.social · 7 hours ago

@emilymbender The point that their search product was already, to use the vernacular, crap, at this point, must have been lost on them, and Sundar Pichai in particular is responsible for this, as Ed Zitron has neatly elaborated upon elsewhere.

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

We followed a couple of years later with further arguments about, inter alia, protecting the information ecosystem:
https://dl.acm.org/doi/10.1145/3649468

While Nora Lindemann was writing about similar ideas:
https://link.springer.com/article/10.1007/s00146-024-01944-w

>>

SpringerLink

Chatbots, search engines, and the sealing of knowledges - AI & SOCIETY

In 2023, online search engine provider Microsoft integrated a language model that provides direct answers to search queries into its search engine Bing. Shortly afterwards, Google also introduced a similar feature to its search engine with the launch of Google Gemini. This introduction of direct answers to search queries signals an important and significant change in online search. This article explores the implications of this new search paradigm. Drawing on Donna Haraway’s theory of Situated Knowledges and Rainer Mühlhoff’s concept of Sealed Surfaces, I introduce the term Sealed Knowledges to draw attention to the increasingly difficult access to the plurality of potential answers to search queries through the output of a singular, authoritative, and plausible text paragraph. I argue that the integration of language models for the provision of direct answers into search engines is based on a de-situated and disembodied understanding of knowledge and affects the subjectivities of its users. At the same time, the sealing of knowledges can lead to an increasing spread of misinformation and may make marginalized knowledge increasingly difficult to find. The paper concludes with an outlook on how to resist the increasing sealing of knowledges in online search.

https://dl.acm.org/doi/10.1145/3649468

d@nny disc@ mc²

@hipsterelectron@circumstances.run · 11 hours ago

@emilymbender i want you to know your repeated scientific deconstruction of google's ideological warfare under the guise of a search interface has enabled me to extrapolate at great length how the entire formalisms of automata theory have been constructed to exclude any investigation that could ever produce a paper with more than 1% performance improvement over their state of the art. parsing and formal languages has been dead for several decades.

google calls it a "parser confusion vulnerability" when python maintainers use features of the zip file format in their own published releases that make it impossible for specifically google to insert a cryptograpic backdoor onto their users' machines (because google owns pypi), while at the same time the python METADATA file format actively right now supports an "ambiguity' intentionally invisible to human reviewers but instructs the standard packaging software to download and execute code that won't show up in the output.

just as you said:

We revisit foundational work related to information behavior, information seeking, information retrieval, information filtering, and information access to resurface what we know about these fundamental questions and what may be missing.

i very recently realized these questions can be quantified in the field of operating system design, in a really drastic sense that led me to switch my research focus because i'm confident i can convince people every computer should work this way.

just this weekend i realized (quite by mistake) that a fact i'd known since 2019, when google made mozila and twitter lay off their teams of scientists who had just publicly demonstrated that google chrome and bazel products were neither "fast" nor "correct", in ways that are easily and obviously quantifiable, also represented a shocking and obvious failure in the entire theory of operating system design. not just that computers are slow and fail to protect the user, but how demonstrate a thrilling counterexample

which is to say, after an intensive literature review (including "standards" for "portable" behavior that were neither), i'm confident i've identified a novel property for computer security which results from a computer performance property that i could already prove 12 ways from sunday. and i now know how to construct a system that achieves both.

i found a dissertation from the single person who tried something pretty close https://circumstances.run/@hipsterelectron/116602585443289491 but otherwise i have performed sufficient literature review to be confident i can express this result in a way that will convince any convince anyone familiar with the field that there's a whole other field they'd been missing this whole time.

and it will run on a computer or phone, to protect people from harm. it demonstrates how the temporal and spatial structure of computer memory in response to user input can be described as a correctness property of the operating system. like i did to google's bazel, it will defeat them in their own terms

thank you

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

But all the academic papers in the world showing why something is a bad idea won't stop companies from doing it, if it's profitable and/or fits into their quasi-religious beliefs that "AI" is the future, alas.

So let's look at what Google is up to now, or at least says they are, via TechCrunch as stenographer:

>>

Michael Moore

@mrmoore@mstdn.social · 4 hours ago

@emilymbender Sometimes it feels like these people are trying to create a world from science fiction stories. For example, in Star Trek when they ask the computer something and it just gives them the answer. They don't understand that plenty of science fiction has no basis in reality and/or are terrible for how our brains actually work (like you've pointed out).

That or they are just greedy bastards that are looking to exploit anyone and everyone for a profit.

Bruce Simpson, Ph.D.

@bms48@mastodon.social · 7 hours ago

@emilymbender I'll see you that and raise you: https://www.asc.upenn.edu/news-events/news/largest-quantitative-synthesis-date-reveals-what-predicts-human-behavior-and-how-change-it Secondary reference from the APNIC blog about why IPv6 uptake has failed, which, ironically, is a part of the Internet relatively free from unwanted LLM scraper incursion, so far. The neat Hilbert curve based heat maps for IPv4 address spaces do not map so neatly to IPv6 because of the massively increased address space.

Largest Quantitative Synthesis to Date Reveals What Predicts Human Behavior and How to Change It

Bruce Simpson, Ph.D.

@bms48@mastodon.social · 7 hours ago

@emilymbender I'll see you that and raise you: https://www.asc.upenn.edu/news-events/news/largest-quantitative-synthesis-date-reveals-what-predicts-human-behavior-and-how-change-it

Largest Quantitative Synthesis to Date Reveals What Predicts Human Behavior and How to Change It

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

Not satisfied to cut people off from the important sense-making of looking at information in its context and finding and navigating different perspectives (what "AI overviews" do), Google also wants to tell you what to search for:

>>

Screencap from linked article:
With the revamped Search experience, the new search box simply expands to accommodate longer, more conversational queries, rather than making you decide what type of search experience or mode you want to choose at the start of your query. It will also have a new AI-powered query suggestion system that goes beyond autocomplete to help people craft more complex and nuanced queries, Google says. — Screencap from linked article: With the revamped Search experience, the new search box simply expands to accommodate longer, more conversational queries, rather than making you decide what type of search experience or mode you want to choose at the start of your query. It will also have a new AI-powered query suggestion system that goes beyond autocomplete to help people craft more complex and nuanced queries, Google says.

2+ more replies

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

How infantilizing --- you thought you were looking to find something that someone else wrote on the web. But woah! Now you've been "dropped into" an "interactive experience". Yeah, Google can just fuck right off with that.

>>

Screencap from linked article:
Instead of returning a simple list of links, Google Search will drop users into AI-powered interactive experiences at times. Google is also introducing tools that can dispatch “information agents” to gather information on a user’s behalf, along with tools that let users build personalized mini apps tailored to their needs. — Screencap from linked article: Instead of returning a simple list of links, Google Search will drop users into AI-powered interactive experiences at times. Google is also introducing tools that can dispatch “information agents” to gather information on a user’s behalf, along with tools that let users build personalized mini apps tailored to their needs.

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

Look, I hate pointy-clicky interfaces as much as the next Gen-Xer (let me use the keyboard, dammit) but it is so weird to reduce the important, and importantly effortful, work of navigating the information ecosystem to the apparent drudgery of clicking on links that are (*shudder*) blue!!!

>>

2 media

Screencap from linked article:
Google is also introducing agentic capabilities and AI-powered interactive features into the search experience. This means people will spend even less time clicking the traditional blue links that Google Search used to return. — Screencap from linked article: Google is also introducing agentic capabilities and AI-powered interactive features into the search experience. This means people will spend even less time clicking the traditional blue links that Google Search used to return.

Screencap from linked article:
This shift means that “searching the web” will increasingly be performed by AI agents rather than humans. Instead, people will focus more on acting on the information those agents provide instead of manually clicking links. — Screencap from linked article: This shift means that “searching the web” will increasingly be performed by AI agents rather than humans. Instead, people will focus more on acting on the information those agents provide instead of manually clicking links.

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

Here is where it really starts to show that this journalist is just lightly paraphrasing a press release. "Links will become an afterthought," will they? What is your evidence for that confident statement about the future?

>>

Screencap from linked article:
Links will become an afterthought with the coming changes to the Search results experience, which builds on Google’s earlier launches of AI search features, like its short summaries known as AI Overviews and its conversational search, AI Mode. — Screencap from linked article: Links will become an afterthought with the coming changes to the Search results experience, which builds on Google’s earlier launches of AI search features, like its short summaries known as AI Overviews and its conversational search, AI Mode.

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

Spot the magical thinking here. No, the "AI" isn't making sense of anything. It's making papier-mache of the input, and preventing the use from doing the sense-making.

Also, is that the Pokemon sense of "evolution"?

>>

Screencap from linked article:
In 2003, Google launched Google Alerts, a change-detection service that emailed users when new web results matched their search terms. The web was smaller and more manageable then, of course, so this became a part of many information workers’ toolsets. (That service still exists in some form, but is no longer the way most web users go about aquiring new information.)

Information-gathering agents are an evolution of Google Alerts. Beyond spotting changes, they can make sense of them, too. — Screencap from linked article: In 2003, Google launched Google Alerts, a change-detection service that emailed users when new web results matched their search terms. The web was smaller and more manageable then, of course, so this became a part of many information workers’ toolsets. (That service still exists in some form, but is no longer the way most web users go about aquiring new information.) Information-gathering agents are an evolution of Google Alerts. Beyond spotting changes, they can make sense of them, too.

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

To expand just a little bit: the point of a Google Alert was to gain access to things that people were saying about a topic that you were tracking, which you otherwise might not turn up. And every (blue, even!) link that you clicked on brought you to a web page you could examine to get a sense of who was writing, in what context, and why.

>>

Prof. Emily M. Bender(she/her)

@emilymbender@dair-community.social · 14 hours ago

More stenography here. Google starting shoving the "AI Overviews" into query results as an opt-out situation. That is, you have to take action to have them not pop up. I don't doubt they are *shown to* 2.5 billion monthly users, but that doesn't mean they are used by as many or desired by them.

>>

Screencap from linked article:
AI Overviews are now used by more than 2.5 billion monthly users; meanwhile, its conversational search mode, launched last year, now tops 1 billion monthly users. (ChatGPT, for comparison, has 900 million weekly active users, as of earlier this year. This suggests that ChatGPT is now seeing more frequent engagement, with users coming back repeatedly throughout the week, while Google has more total unique people touching its AI features over the course of a month.) — Screencap from linked article: AI Overviews are now used by more than 2.5 billion monthly users; meanwhile, its conversational search mode, launched last year, now tops 1 billion monthly users. (ChatGPT, for comparison, has 900 million weekly active users, as of earlier this year. This suggests that ChatGPT is now seeing more frequent engagement, with users coming back repeatedly throughout the week, while Google has more total unique people touching its AI features over the course of a month.)

Qybat

@Qybat@batchats.net · 11 hours ago

@emilymbender Priority is on maximising usage. Sure, they lose money on every user, but they will make it up on scale.