- cross-posted to:
- technology@lemmy.ml
- cross-posted to:
- technology@lemmy.ml
and as always, the culprit is ChatGPT. Stack Overflow Inc. won’t let their mods take down AI-generated content
and as always, the culprit is ChatGPT. Stack Overflow Inc. won’t let their mods take down AI-generated content
I imagine it’ll be possible in the near future to improve the accuracy of technical AI content somewhat easily. It’d go something along these lines: have an LLM generate a candidate response, then have a second LLM capable of validating that response. The validator would have access to real references it can use to ensure some form of correctness, ie a python response could be plugged into a python interpreter to make sure it, to some extent, does what it is proported to do. The validator then decides the output is most likely correct, or generates some sort of response to ask the first LLM to revise until it passes validation. This wouldn’t catch 100% of errors, but a process like this could significantly reduce the frequency of hallucinations, for example.
Best description I’ve heard is that LLM is good at figuring out what the correct answer should look like, not necessarily what it is.