CommunityNews

Playing games with AIs: The limits of GPT-3 and similar large language models

Playing Games with Ais: The Limits of GPT-3 and Similar Large Language Models - Minds and Machines.
This article contributes to the debate around the abilities of large language models such as GPT-3, dealing with: firstly, evaluating how well GPT does in the Turing Test, secondly the limits of such models, especially their tendency to generate falsehoods, and thirdly the social consequences of the problems these models have with truth-telling. We start by formalising the recently proposed notion of reversible questions, which Floridi & Chiriatti (2020) propose allow one to ‘identify the nature of the source of their answers’, as a probabilistic measure based on Item Response Theory from psychometrics. Following a critical assessment of the methodology which led previous scholars to dismiss GPT’s abilities, we argue against claims that GPT-3 completely lacks semantic ability. Using ideas of compression, priming, distributional semantics and semantic webs we offer our own theory of the limits of large language models like GPT-3, and argue that GPT can competently engage in various semantic tasks. The real reason GPT’s answers seem senseless being that truth-telling is not amongst them. We claim that these kinds of models cannot be forced into producing only true continuation, but rather to maximise their objective function they strategize to be plausible instead of truthful. This, we moreover claim, can hijack our intuitive capacity to evaluate the accuracy of its outputs. Finally, we show how this analysis predicts that a widespread adoption of language generators as tools for writing could result in permanent pollution of our informational ecosystem with massive amounts of very plausible but often untrue texts.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.

View thread on forum

#games

0 609 0

2023-01-08 01:31:35 UTC

Where Next?

View thread on forum

games

Home General Dev>In The News

#games

0 609 0

Last post

Popular General Dev topics

General Dev>In The News

Russia wants to ban the use of secure protocols such as TLS 1.3, DoH, DoT, ESNI

Quite scary if you ask me. And it seems China is already blocking TLS 1.3 traffic with their Great Firewall. On the other hand it’s a co...

#internet #encryption #censorship

1 971 1

2020-09-23 19:12:33 UTC

New

General Dev>In The News

How to design a good API and why it matters (2006)

ABSTRACT In lieu of a traditional , I’ve tried to distill the essence of the talk into a collection of maxims: All programmers are API ...

dl.acm.org

#api #design

2 1407 1

2022-10-07 10:11:24 UTC

New

General Dev>In The News

Writing a Python SQL engine from scratch

sqlglot/python_sql_engine.md at main · tobymao/sqlglot. Python SQL Parser and Transpiler. Contribute to tobymao/sqlglot development by c...

github.com

/python #sql #writing

0 1647 0

2023-01-27 14:38:38 UTC

New

General Dev>In The News

ONNX Runtime merges WebGPU backend

[js/web] WebGPU backend via JSEP by fs-eire · Pull Request #14579 · microsoft/onnxruntime. Description This change introduced the follo...

github.com

#backend

0 1372 0

2023-04-25 15:04:03 UTC

New

General Dev>In The News

Apple Patents Suggest Future AirPods Could Monitor Biosignals and Brain Activity

Apple Patents Suggest Future AirPods Could Monitor Biosignals & Brain Activity - AppleMagazine. The US Patent & Trademark Office...

applemagazine.com

#apple #monitor

0 1477 0

2023-10-11 01:56:37 UTC

New

General Dev>In The News

Go Package for Building Progressive Web Apps

A Go package for building Progressive Web Apps. A package for building progressive web apps (PWA) with the Go programming language (Gola...

go-app.dev

#apps /go #web /wasm

2 1283 1

2023-10-22 13:16:10 UTC

New

General Dev>In The News

Review of Linux on Minisforum V3 AMD Ryzen Tablet

A Brief Review of the Minisforum V3 AMD Tablet. Update: I have created an awesome-minisforum-v3 GitHub repository to list information fo...

mudkip.me

#linux #review #amd

0 4635 0

2024-06-24 02:26:38 UTC

New

General Dev>In The News

Self-Hosting a Firefox Sync Server

After switching from Firefox to LibreWolf, I became interested in the idea of self-hosting my own Firefox Sync server. Although I had see...

blog.diego.dev

#hosting #firefox

0 1154 0

2025-03-09 03:43:04 UTC

New

General Dev>In The News

Phlex for Rails Emails: Action Mailer without ERB

Rendering Action Mailer emails with Phlex components and layouts: Clean, Composable, and Completely Ruby - Blog post by Camillo Visini

camillovisini.com

/rails #emails

0 805 0

2025-03-11 18:50:49 UTC

New

General Dev>In The News

GitSyncPad - Effortless Git Version Control

GitSyncPad is an innovative micro keypad designed for effortless Git version control. Execute commands like git add, git commit, and git ...

gitsyncpad.xyz

#git

0 616 0

2025-03-13 01:42:30 UTC

New

Other popular topics

Backend>Learning Resources

Seven Languages in Seven Weeks

Ruby, Io, Prolog, Scala, Erlang, Clojure, Haskell. With Seven Languages in Seven Weeks, by Bruce A. Tate, you’ll go beyond the syntax—and...

pragprog.com

#pragprog /clojure /erlang /haskell /prolog /ruby /scala #published-book /book-seven-languages-in-seven-weeks

5 5730 1

2022-01-20 13:48:55 UTC

New

Android>Learning Resources

Kotlin and Android Development featuring Jetpack: Build Better, Safer Android Apps

Start building native Android apps the modern way in Kotlin with Jetpack's expansive set of tools, libraries, and best practices. Learn h...

pragprog.com

#pragprog #android #game-dev /kotlin #published-book /book-kotlin-and-android-development-featuring-jetpack

7 5084 1

2020-11-03 20:38:30 UTC

New

Backend>Questions

Erlang's not installing on macOS Big Sur "You are natively building Erlang/OTP for a later version of MacOSX than current version"

Just done a fresh install of macOS Big Sur and on installing Erlang I am getting: asdf install erlang 23.1.2 Configure failed. checking ...

#macos /erlang #big-sur #asdf

10 6212 8

2021-01-16 12:33:23 UTC

New

General Dev>Hardware

BIIP MT3 Extended 2048 Custom Keycap Set (Drop)

This looks like a stunning keycap set :orange_heart: A LEGENDARY KEYBOARD LIVES ON When you bought an Apple Macintosh computer in the e...

/keyboards #apple #keycaps #mechanical-keyboards

14 6713 7

2020-12-12 19:58:26 UTC

New

General Dev>Dev Chat

Roc Language - a new purely functional programming language built for speed and ergonomics

Hi folks, I don’t know if I saw this here but, here’s a new programming language, called Roc Reminds me a bit of Elm and thus Haskell. ...

#programminguages #functional-programming

49 5164 14

2021-11-10 20:03:09 UTC

New

Backend>Chat

Data Structures and Algorithms with Elixir

This is going to be a long an frequently posted thread. While talking to a friend of mine who has taken data structure and algorithm cou...

/elixir #algorithms #data-structures

108 11869 31

2024-11-14 02:14:00 UTC

New

Frontend>Chat

Online Hand to eye coordination test

Was just curious to see if any were around, found this one: I got 51/100: Not sure if it was meant to buy I am sure at times the b...

#online-tools

4 4562 1

2022-03-27 10:53:45 UTC

New

Backend>Learning Resources

Programming Ruby 3.2 (5th Edition)

Programming Ruby is the most complete book on Ruby, covering both the language itself and the standard library as well as commonly used t...

twitter.com

#pragprog /ruby /rails #published-book /book-programming-ruby-3-2-5th-edition

28 8550 13

2024-11-17 04:34:14 UTC

New

General Dev>Questions

Do you prefer regular mechanical keyboards or low profile mechanical keyboards and why?

I have always used antique keyboards like Cherry MX 1800 or Cherry MX 8100 and almost always have modified the switches in some way, like...

/keyboards #mechanical-keyboards

27 3877 9

2023-02-06 21:10:15 UTC

New

Backend>Learning Resources

Advanced Functional Programming with Elixir

Use advanced functional programming principles, practical Domain-Driven Design techniques, and production-ready Elixir code to build scal...

joekoski.com

#pragprog /elixir #published-book #functional-programming /book-advanced-functional-programming-with-elixir

43 4989 22

2025-10-06 09:04:44 UTC

New

General Dev>In The News

Qalculate time hacks

General Dev>In The News

Space Datacenters - if you see a datacenter in orbit, it means something went wrong on Earth

General Dev>In The News

Distinguishing variables from parameters

General Dev>In The News

"One Hot Node"

General Dev>In The News

Salary information to be shown on job ads under new laws

General Dev>In The News

Digital Bandung

General Dev>In The News

If HEIC has no haters I’m dead

General Dev>In The News

I'm a USB-C Maximalist

General Dev>In The News

Introducing Precursor: detecting agentic behavior with continuous client-side signals

General Dev>In The News

Dell sued by Finnish company over $70m price increase for data centre servers

General Dev>In The News

General Dev In The News ❯

Latest on Devtalk

AI didn’t replace our Security Team, it multiplied it

AI>In The News

Visuali.io: AI Image Generator & Photo Editor

AI>In The News

Terence McKenna's Mega Bad Trip

Science/Tech>Science

Understanding the Rust hype for the busy developer

Backend>In The News

Qalculate time hacks

General Dev>In The News

Grails v8.0.0-M4 released!

Backend>Official News

Agents Are Invention Machines

AI>In The News

Fable 5.11.0 released!

Frontend>Official News

Claude Code: Anatomy of a Misfeature

AI>In The News

Linus Torvalds to critics of AI coding in Linux: "Fork it. Or just walk away."

Linux>In The News

It's official: EU will force Google to share search data and open up AI on Android

Android>In The News

Kimi K3 - Intelligence, Performance & Price Analysis

AI>In The News

Introducing LM Studio Bionic: the AI agent for open models

AI>In The News

Fable 5.10.0 released!

Frontend>Official News

Crystal 1.21.0 released!

Backend>Official News

Devtalk ❯

We ❤️ helpful members!

We reward our most helpful members via our MOTM scheme - by giving away a whopping 25 books per year!

Sub Categories:

We're in Beta

About us Mission Statement See our Roadmap

Playing games with AIs: The limits of GPT-3 and similar large language models

CommunityNews

Playing games with AIs: The limits of GPT-3 and similar large language models

Where Next?

Popular General Dev topics

Russia wants to ban the use of secure protocols such as TLS 1.3, DoH, DoT, ESNI

How to design a good API and why it matters (2006)

Writing a Python SQL engine from scratch

ONNX Runtime merges WebGPU backend

Apple Patents Suggest Future AirPods Could Monitor Biosignals and Brain Activity

Go Package for Building Progressive Web Apps

Review of Linux on Minisforum V3 AMD Ryzen Tablet

Self-Hosting a Firefox Sync Server

Phlex for Rails Emails: Action Mailer without ERB

GitSyncPad - Effortless Git Version Control

Other popular topics

Seven Languages in Seven Weeks

Kotlin and Android Development featuring Jetpack: Build Better, Safer Android Apps

Erlang's not installing on macOS Big Sur "You are natively building Erlang/OTP for a later version of MacOSX than current version"

BIIP MT3 Extended 2048 Custom Keycap Set (Drop)

Roc Language - a new purely functional programming language built for speed and ergonomics

Data Structures and Algorithms with Elixir

Online Hand to eye coordination test

Programming Ruby 3.2 (5th Edition)

Do you prefer regular mechanical keyboards or low profile mechanical keyboards and why?

Advanced Functional Programming with Elixir

Sponsor Spotlight

General Dev>In The News

Latest on Devtalk

We ❤️ helpful members!

Devtalk Sponsors

Categories:

Sub Categories:

Popular Portals

Devtalk Sponsors

We're in Beta