Programming

21586 readers

175 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities [email protected]

founded 2 years ago

MODERATORS

[email protected]

380

AI coders think they’re 20% faster — but they’re actually 19% slower (pivot-to-ai.com)

submitted 4 days ago by [email protected] to c/[email protected]

95 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 6 points 2 days ago* (last edited 2 days ago) (1 children)

they don't check. you gotta think in statistics terms.

based on the previously inputed words (tokens actually, but I'll use words for the sake of simplicity), which is the system prompt + user prompt, the LLM generates a list of the next possible words that makes most sense, then picks one from the top few. How much it goes down the list on lower possible words is based on temperature configuration. Then the next word, and the next, etc, each time looking back.

I haven't checked on the reasoning models, what that step actually does, but I assume it just expands the user prompt to fill in stuff that thr LLM thinks the user was lazy to input, then works on the final answer.

so basically is like tapping on your phone keyboard next word prediction.

[–] [email protected] 2 points 2 days ago (1 children)

The chatbots are not just LLMs though. They run scripts in which some steps are queries to an LLM.

[–] [email protected] 1 points 2 days ago (1 children)

ok.. what are you trying to point out?

[–] [email protected] 1 points 2 days ago* (last edited 2 days ago)

That the script could incorporate some checking mechanisms and implement an "i dont know" for when the LLMs answers fails some tests.

They already do some of that but for other purposes, like censoring, or as by recent news, grok looks up musks opinions before answering questions, or to make more accurate math calculations they actually call a normal calculator, and so on...

They could make the LLM produce an answer A, then look up the question on google and ask that LLM to "compare" answer A with the main google results looking for inconsistencies and then return "i dont know" if its too inconsistent. Its not a rigorous test, but its something, and im sure the actual devs of those chatbots could make something much better than my half baked idea.