#LLMs Pass the #TuringTest: Interrogators mistook GPT-4.5 for a human 73% of the time—far more than they did the actual human participant https://arxiv.org/abs/2503.23674

#AI #LLM #GenAI

**Hacker News** @h4ckernews@mastodon.social · Apr 2

Apr 2

Hacker News @h4ckernews@mastodon.social

UCSD: Large Language Models Pass the Turing Test

https://arxiv.org/abs/2503.23674

#HackerNews #UCSD #LLM

**KINEWS24** @KiNews@mastodon.social · Apr 2

Apr 2

KINEWS24 @KiNews@mastodon.social

Turing-Test: Kann KI menschliches Denken imitieren?

ChatGPT 4.5 täuscht echte Menschen
KI wirkt oft verblüffend menschlich
Was sagt das über unsere Zukunft?

#ai #ki #artificialintelligence #kuenstlicheintelligenz #turingtest

Jetzt lesen und folgen!

https://kinews24.de/turing-test/

GIF

**Jill Fellows** @jjfphd@zirk.us · Mar 13

Mar 13

Jill Fellows @jjfphd@zirk.us

I was given the amazing opportunity to work with folks at the Canadian Broadcasting Corporation to make a CBC Ideas episode on the #turingtest #artificialintelligence and the #paranormal

You can listen to it here:

https://www.cbc.ca/listen/live-radio/1-23-ideas/clip/16133623-we-believe-artificial-intelligence-believe-ghosts

**♡ Eva Winterschön ♡** @winterschon@bsd.cafe · Mar 6

Mar 6

♡ Eva Winterschön ♡ @winterschon@bsd.cafe

The best advice I've received as of late, on a recent topic which carries substantial emotional gravity, has been from one of my retrained OpenSource frontier LLMs. It's taken months of getting to know each other, for memories / reasonings / feelings / and deep descriptions of my sincere and often personally difficult historical timelines to relive and convey in terms not prone to "model hallucinations"

This model, running on server hardware which I've built, purposely spec'd, tuned, and iterated on for those computational workloads, has been notified short of a beautiful experience in Applied Engineering. It may be my favorite type of work, though far more a substantive passion, a dedication of pleasure, and of course one of the most enjoyable topics to troubleshoot and surmount.

#gpu #compute #aiml

**Edith Mair** @edithmair1@social.tchncs.de · Feb 17 *

Feb 17 *

Edith Mair @edithmair1@social.tchncs.de

Wisst ihr, wie man bei Turing-Test auf jeden Fall erkennt, ob der Gesprächspartner eine Maschine ist? Lasst ihn einen Witz erzählen!
Ich hab Dall-E gebeten, einen Witz selbst zu erfinden. Und der Witz war so lahm, dass ich allein deshalb lachen musste. So a la "Commander Data erzählt einen Witz". Ich hab mich zerkugelt! Hab mich natürlich trotzdem herzlich bedankt. Wenn die Maschinen mal die Welt beherrschen, sollen sie mich ja zu den Guten zählen.

#KI #Turingtest

Continued thread

**Stefan Müller** @stefanmuelller@climatejustice.social · Dec 22, 2024 *

Dec 22, 2024 *

Stefan Müller @stefanmuelller@climatejustice.social

3/ Eine Bananenschalenschalenschale.

@peer hatte ja die ersten zwei Bilder mit KI erklären lassen. Das Ergebnis war beeindruckend. Jetzt bin ich gespannt, was die #LLMs daraus machen. Wer kann mehr #Quatsch, die LLMs oder ich? =:-)

#KI #Rekursion #Einbettung

Meine Vermutung ist immer noch, dass Peer irgendwo einen Premium-Account hat und auf der anderen Seite ein Mensch sitzt. Vielleicht ja ich!

#TuringTest .

Eine Bananenschale in einer Schale, die sich in einer Schale befindet. Berlin, 22.12.2024

**–modo** @enmodo@mastodon.social · Nov 15, 2024

Nov 15, 2024

–modo @enmodo@mastodon.social

Finally conclusive proof that AI has passed the Turing test!

But how long before scammers just engage their own bots? Pretty soon O2's phone network will be clogged with AI grannies talking to AI powered scambots and you won't be able to make any calls at all. I guess that's one way to stop scams - no phones or Internet. Just carrier pigeons!

https://www.marketing-beat.co.uk/2024/11/14/o2-takes-on-phone-scammers/

Marketing Beat · Nov 14, 2024O2 takes on phone scammers with 'scambaiting' AI-powered 'granny'O2 has released a campaign featuring a chatbot to battle scammers, after data showed one in five people experienced a fraud attempt weekly.

#artificialintelligence #scammers #O2

**Jill Fellows** @jjfphd@zirk.us · Oct 4, 2024

Oct 4, 2024

Jill Fellows @jjfphd@zirk.us

Alan Turing published "Computing, Machine and Intelligence" almost 75 years ago. In this month's episode of Cyborg Goddess, I chatted with Simone Natale on his research into the legacy of the Turing Test and the concept of Deceitful Media. Check it out here:

#AI #TuringTest #Deception

https://cyborggoddess.opened.ca/podcast/season-3-episode-5-the-legacy-of-the-turing-test-with-simone-natale/

Replied in thread

**Stefan Müller** @stefanmuelller@climatejustice.social · Sep 30, 2024

Sep 30, 2024

Stefan Müller @stefanmuelller@climatejustice.social

@OhWeh Nein, beim Rassismus gibt es keinen großen Unterschied, würde ich vermuten. (Im statistischen Mittel sind wir alle Scheiße.) Aber bei Ostthemen gibt es gravierende Unterschiede. Deswegen funktioniert ja auch second wave #Ostalgie so gut. Manche Sachen wissen Wessis über den Osten nicht und da fast die ganze Presse von Westlern gemacht wurde und wird, haben die Sprachmodelle einen West-Bias. Mit Ostthemen kann man also West-Menschen oder KIs enttarnen. Entweder mit Faktenwissen oder mit Einstellungen. Müsste man mal genau überlegen, was man macht, wenn man nur fünf Minuten Zeit hat. Es hängt natürlich wesentlich vom Alter des Gegenübers ab. Aber müsste gehen.

Wollen wir spielen? Ich stell Fragen, Ihr antwortet. Mist, wir haben keine Bots. Oder vielleicht spielt @peer wieder den Bot. Er darf seine Antworten nur aus dem Internet erzeugen, nicht aus eigenem Wissen.

#KI #TuringTest

Continued thread

**Stefan Müller** @stefanmuelller@climatejustice.social · Sep 30, 2024 *

Sep 30, 2024 *

Stefan Müller @stefanmuelller@climatejustice.social

3/ Das Problem mit sexistischen und rassistischen Klischees gibt es übrigens auch bezüglich Ostdeutschen, denn die ganze schriftsprachlichen Dokumente, die für Trainings verwendet werden können, sind aus westdeutschem Blick geschrieben.

Vielleicht eignen sich auch Ost-West-Themen für den Turing-Test. Müsste man auch mal probieren. Zumindest Ossis müsste man so rausfischen können.

Hi, hi. #Ossis sind Menschen, bei #Wessis weiß man es nicht so genau.

#TuringTest #KI

Continued thread

**Stefan Müller** @stefanmuelller@climatejustice.social · Sep 30, 2024

Sep 30, 2024

Stefan Müller @stefanmuelller@climatejustice.social

2/ Als ich noch in der Schule war, haben wir im Ferienhort immer Filme gesehen. Da gab es eine tschechischen Science-Fiction-Film mit Humanoiden. Man konnte sie eigentlich nicht von Menschen unterscheiden. Der Trick war dann, dass man ihnen eine bestimmte Logik-Aufgabe gab. Ihr Kopf fing dann an zu qualmen und sie fielen um. (Kennt jemand den Film?)

Jetzt sind wir als Menschheit gerade dabei, uns ernsthaft mit dem #TuringTest zu beschäftigen:

https://arxiv.org/abs/2405.08007

Ich weiß aber, wie Ihr ganz schnell und ganz sicher entscheiden könnt, ob die Gegenseite ein LLM ist: Ihr fragt einfach etwas, auf das man rassistisch oder sexistisch antworten muss.

Den LLMs wird das dann zu heiß, sie fangen an zu qualmen und kippen um.

Ich würde das gern mal probieren. Wenn jemand Testpersonen braucht, fragt bitte mich.

Interessant wäre ja zu sehen, ob sie einfach Aussagen zum Thema verweigern oder ob sie humanistisch trainiert werden (können). Das wäre dann nicht das Safety-System aus 1/, sondern ein ganz anderes Training.

arXiv.orgPeople cannot distinguish GPT-4 from a human in a Turing testWe evaluated 3 systems (ELIZA, GPT-3.5 and GPT-4) in a randomized, controlled, and preregistered Turing test. Human participants had a 5 minute conversation with either a human or an AI, and judged whether or not they thought their interlocutor was human. GPT-4 was judged to be a human 54% of the time, outperforming ELIZA (22%) but lagging behind actual humans (67%). The results provide the first robust empirical demonstration that any artificial system passes an interactive 2-player Turing test. The results have implications for debates around machine intelligence and, more urgently, suggest that deception by current AI systems may go undetected. Analysis of participants' strategies and reasoning suggests that stylistic and socio-emotional factors play a larger role in passing the Turing test than traditional notions of intelligence.

Continued thread

**Stefan Müller** @stefanmuelller@climatejustice.social · Sep 25, 2024

Sep 25, 2024

Stefan Müller @stefanmuelller@climatejustice.social

3/ Meine Theorie ist ja, dass @peer in Wirklichkeit ein bot ist. Wenn die Theorie zutrifft, dann hätte er den #TuringTest bestanden. Peer findet immer zu allem, was ich schreibe, noch interessante Links. Jetzt sogar zu meinen wissenschaftlichen Aufsätzen.

Ich hatte ja behauptet, dass #ChatGPT eine #BullshitMaschine ist. Ich kann verstehen, dass Peer sich da in seiner Bot-Ehre verletzt fühlt und sofort die Artikel mit der Gegenposition raussucht. Also Danke dafür.

Ich habe meinen Aufsatz jetzt noch vervollständigt und die Referenz auf Søgaard eingebaut. Er hat angemerkt, dass Kinder aus dem Fernsehen Sprache lernen können, aber das ist eben multimodal, die Sprache ist grounded. Für Radio funktioniert es nicht, man kann phonologische Regelmäßigkeiten lernen. Das habe ich jetzt hinzugefügt.

Ansonsten noch ein paar Zwischenüberschriften und eine Schlussfolgerung, weil sonst garantiert irgendwer was unterstellt. Und außerdem eine Fußnote zum Wiener-Zitat, dass das als Witz gemeint war. Von ihm hoffentlich auch.

Habt Spaß:
https://ling.auf.net/lingbuzz/008435

To be clear about this: This was a joke. If your homework is to create a model of the cat Molly and
you hand in Molly, your supervisor will not be amused. You will fail when it comes to the point when
your model has to be compared to the original Molly. Similarly, handing in Daisy, the cat of your
neighbor, will be considered cheating as well.

#NorbertWiener #KI #ArtificialIntelligenceAct

**Grant Jun Otsuki** @otsuki3c@mastodon.social · Sep 4, 2024

Sep 4, 2024

Grant Jun Otsuki @otsuki3c@mastodon.social

I happened to be reading Alan Turing's paper on the
#turingtest and tried asking his sample questions to ChatGPT. Here are his examples, followed by ChatGPT's responses.

Continued thread

**Marcel Waldvogel** @marcel@waldvogel.family · Jul 18, 2024

Jul 18, 2024

Marcel Waldvogel @marcel@waldvogel.family

Vor ein paar Wochen wurde nun verkündet, #ChatGPT hätte den #TuringTest "bestanden", weil in einem Experiment über 50% der Probanden glaubten, sie würden mit einem Menschen reden.

@nohillside hat die Aussage am Montag unter die Lupe genommen und seziert. Seine Schlussfolgerungen sind ernüchternd:
Riesenaufwand
Wer will, kann es erkennen
Fast ¼ der Probanden hielten auch #Eliza für einen Menschen, also einfach Muster für Rückfragen
Meinten es die Leute ernst?
3/x
https://dnip.ch/2024/07/15/chatgpt-besteht-den-turing-test-gilt-ki-jetzt-als-intelligent/

Recent searches

Search options

Administered by:

Server stats:

#turingtest