CommunityNews

CommunityNews

Developer survey shows trust in AI coding tools is falling as usage rises

“AI solutions that are almost right, but not quite” lead to more debugging work.

Read in full here:

Most Liked

brennan

brennan

Probably the best way to use them now is not to trust 100% the code that they generate.

jmagnani

jmagnani

Might take a couple of years more, but it will get better.

MyrddinE

MyrddinE

‘Survey shows that when Devs start to use the tools, they realize they are unreliable’.

Assuming the same architecture, it will probably get better… but not because LLMs stop hallucinating. It’ll get better because they’ll get larger context windows, and they’ll start putting the docs for the frameworks and languages into that context window before starting. It’ll get better when they can reliably test their own work before submitting. And it’ll get better when they have better processes around workflow incorporated into their programming system prompts.

These are all low hanging fruit, so I expect this to be rapidly improving in the coming months.

One thing that I have learned is that documenting what is not supported is often just as important as documenting what is supported, but it’s a commonly overlooked task in documentation leading to both human and AI hallucinations. For example, if a language structure only supports && conditions an LLM might hallucinate a solution that utilizes || conditions. Improving the documentation for LLMs will also help humans.

Where Next?

Popular Ai topics Top

AstonJ
Well done DeepMind… wonder what else they’re working on… One of biology’s biggest mysteries has been solved using artificial intelligen...
New
First poster: CommunityNews
The use of facial recognition for surveillance, or algorithms that manipulate human behaviour, will be banned under proposed EU regulatio...
New
First poster: CommunityNews
Many recent big advances in tech have one key thing at the heart of then: artificial intelligence.
New
CommunityNews
We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understandin...
New
First poster: bot
Ghostwriter generates, completes, or transforms code in 16 languages, similar to GitHub Copilot.
New
New
First poster: gflashner
Google’s openly available Gemma collection of AI models has reached a milestone: over 150 million downloads. Omar Sanseviero, a developer...
New
First poster: mercyf
Google’s Veo 3 delivers AI videos of realistic people with sound and music. We put it to the test.
New
New
CommunityNews
Comparison and ranking the performance of over 100 AI models (LLMs) across key metrics including intelligence, price, performance and spe...
New

Other popular topics Top

Exadra37
I am thinking in building or buy a desktop computer for programing, both professionally and on my free time, and my choice of OS is Linux...
New
brentjanderson
Bought the Moonlander mechanical keyboard. Cherry Brown MX switches. Arms and wrists have been hurting enough that it’s time I did someth...
New
AstonJ
Thanks to @foxtrottwist’s and @Tomas’s posts in this thread: Poll: Which code editor do you use? I bought Onivim! :nerd_face: https://on...
New
New
AstonJ
Biggest jackpot ever apparently! :upside_down_face: I don’t (usually) gamble/play the lottery, but working on a program to predict the...
New
PragmaticBookshelf
Build efficient applications that exploit the unique benefits of a pure functional language, learning from an engineer who uses Haskell t...
New
AstonJ
Was just curious to see if any were around, found this one: I got 51/100: Not sure if it was meant to buy I am sure at times the b...
New
PragmaticBookshelf
Author Spotlight Mike Riley @mriley This month, we turn the spotlight on Mike Riley, author of Portable Python Projects. Mike’s book ...
New
PragmaticBookshelf
Fight complexity and reclaim the original spirit of agility by learning to simplify how you develop software. The result: a more humane a...
New
mindriot
Ok, well here are some thoughts and opinions on some of the ergonomic keyboards I have, I guess like mini review of each that I use enoug...
New