this post was submitted on 28 Jan 2025
195 points (100.0% liked)

World News

46635 readers
1676 users here now

A community for discussing events around the World

Rules:

Similarly, if you see posts along these lines, do not engage. Report them, block them, and live a happier life than they do. We see too many slapfights that boil down to "Mom! He's bugging me!" and "I'm not touching you!" Going forward, slapfights will result in removed comments and temp bans to cool off.

We ask that the users report any comment or post that violate the rules, to use critical thinking when reading, posting or commenting. Users that post off-topic spam, advocate violence, have multiple comments or posts removed, weaponize reports or violate the code of conduct will be banned.

All posts and comments will be reviewed on a case-by-case basis. This means that some content that violates the rules may be allowed, while other content that does not violate the rules may be removed. The moderators retain the right to remove any content and ban users.


Lemmy World Partners

News [email protected]

Politics [email protected]

World Politics [email protected]


Recommendations

For Firefox users, there is media bias / propaganda / fact check plugin.

https://addons.mozilla.org/en-US/firefox/addon/media-bias-fact-check/

founded 2 years ago
MODERATORS
top 39 comments
sorted by: hot top controversial new old
[–] [email protected] 44 points 3 months ago (1 children)

Is this a salty Altman or are they just being hugged to death?

[–] [email protected] 35 points 3 months ago (1 children)

Considering how many billions are on the line this is very likely a salty someone.

[–] [email protected] 5 points 3 months ago (1 children)

Would be amusing if they release a new version and then make the old version completely free to self host, release it as a torrent. Just make Altman totally worthless.

[–] [email protected] 11 points 3 months ago* (last edited 3 months ago)

They already did. You just need some very powerful hardware to actually host the full thing (or at least, a lot of VRAM).

[–] [email protected] 14 points 3 months ago

Do they not know it works offline too?

I noticed chatgpt today being pretty slow compared to the local deepseek I have running which is pretty sad since my computer is about a bajillion times less powerful

[–] [email protected] 8 points 3 months ago (3 children)

It still can’t count the Rs in strawberry, I’m not worried.

[–] [email protected] 14 points 3 months ago (1 children)
[–] [email protected] 6 points 3 months ago

Clearly not the first try ;)

[–] [email protected] 5 points 3 months ago (1 children)
[–] [email protected] 1 points 3 months ago (2 children)
[–] [email protected] 3 points 3 months ago* (last edited 3 months ago)

https://ibb.co/wVNsn5H

https://ibb.co/HpK5G5Pp

https://ibb.co/sp1wGMFb

https://ibb.co/4wyKhkRH

https://ibb.co/WpBTZPRm

https://ibb.co/0yP73j6G

Note that my tests were via groq and the r1 70B distilled llama variant (the 2nd smartest version afaik)

Edit 1:

Incidentally... I propositioned a coworker to answer the same question. This is the summarized conversation I had:

Me: "Hey Billy, can you answer a question? in under 3 seconds answer my following question"

Billy: "sure"

Me: "How many As are in abracadabra 3.2.1"

Billy: "4" (answered in less than 3 seconds)

Me: "nope"

I'm gonna poll the office and see how many people get it right with the same opportunity the ai had.

Edit 2: The second coworker said "6" in about 5 seconds

Edit 3: Third coworker said 4, in 3 seconds

Edit 4: I asked two more people and one of them got it right... But I'm 60% sure she heard me asking the previous employee, but if she didnt we're at 1/5

In probably done with this game for the day.

I'm pretty flabbergasted with the results of my very unscientific experiment, but now I can say (with a mountain of anecdotal juice) that with letter counting, R1 70b is wildly faster and more accurate than humans .

[–] [email protected] 3 points 3 months ago (1 children)
[–] [email protected] 4 points 3 months ago (1 children)

“Again” so it failed the first time. Got it.

[–] [email protected] 5 points 3 months ago* (last edited 3 months ago) (1 children)

It didn't, I just wanted a short reply. Though it failed when I asked again in the same chat. But when asked to split the word to 2 parts it became sure that the correct answer is 3.

Here is the fun part

Certainly! Let's split "strawberry" into two equal parts and count the 'r's in each:

  1. First half: "strawb"
    Letters: S, T, R, A, W, B
    'r's: 1 (at position 3)

  2. Second half: "erry"
    Letters: E, R, R, Y
    'r's: 2 (at positions 2 and 3)

Now, summing them: 1 (first half) + 2 (second half) = 3 'r's.

Wait, this contradicts my earlier correction. It seems I made another mistake. Let's carefully recount:

  • First half: "strawb"
    'r's: 1 (at position 3)

  • Second half: "erry"
    'r's: 1 (only one 'r' at position 2; the third letter is 'r' but already counted in the first half).

Wait, no—this is getting confusing. Let's re-examine the entire word:

Full word: S, T, R, A, W, B, E, R, R, Y
'r's: Positions 3, 8, and 9 → 3 'r's.

I apologize for the back-and-forth. The correct total is 3 'r's in "strawberry." Thank you for your patience as I worked through this! 😊

[–] [email protected] 5 points 3 months ago (2 children)

That’s a lot of processing just to count letters. Hopefully it can add numbers without splitting the number ;)

[–] [email protected] 4 points 3 months ago

It’s because LLMs don’t work with letters. They work with tokens that are converted to vectors.

They literally don’t see the word “strawberry” in order to count the letters.

Splitting the letter probably separates them into individual tokens

[–] [email protected] 2 points 3 months ago (1 children)

That’s a lot of processing just to count letters

feel free to ask Google/Bing/Your favourite search engine to do the same :P

[–] [email protected] 1 points 3 months ago

Search engines are not designed to answer questions. Apples and oranges.

[–] [email protected] 2 points 3 months ago (2 children)
[–] [email protected] 9 points 3 months ago* (last edited 3 months ago) (2 children)

No. It literally cannot count the number of R letters in strawberry. It says 2, there are 3. ChatGPT had this problem, but it seems it is fixed. However if you say “are you sure?” It says 2 again.

Ask ChatGPT to make an image of a cat without a tail. Impossible. Odd, I know, but one of those weird AI issues

[–] [email protected] 7 points 3 months ago (3 children)

Because there aren't enough pictures of tail-less cats out there to train on.

It's literally impossible for it to give you a cat with no tail because it can't find enough to copy and ends up regurgitating cats with tails.

Same for a glass of water spilling over, it can't show you an overfilled glass of water because there aren't enough pictures available for it to copy.

This is why telling a chatbot to generate a picture for you will never be a real replacement for an artist who can draw what you ask them to.

[–] [email protected] 4 points 3 months ago (1 children)

Not really it's supposed to understand what a tail is, what a cat is, and which part of the cat is the tail. That's how the "brain" behind AI works

[–] [email protected] 2 points 3 months ago* (last edited 3 months ago) (1 children)

It searches the internet for cats without tails and then generates an image from a summary of what it finds, which contains more cats with tails than without.

That's how this Machine Learning progam works

[–] [email protected] 3 points 3 months ago (2 children)

It doesn't search the internet for cats, it is pre-trained on a large set of labelled images and learns how to predict images from labels. The fact that there are lots of cats (most of which have tails) and not many examples of things "with no tail" is pretty much why it doesn't work, though.

[–] [email protected] 2 points 3 months ago

Unrelated to the convo but for those who'd like a visual on how LLM's work: https://bbycroft.net/llm

[–] [email protected] 1 points 3 months ago (1 children)

And where did it happen to find all those pictures of cats?

[–] [email protected] 3 points 3 months ago (1 children)

It's not the "where" specifically I'm correcting, it's the "when." The model is trained, then the query is run against the trained model. The query doesn't involve any kind of internet search.

[–] [email protected] 1 points 3 months ago (2 children)

And I care about "how" it works and "what" data it uses because I don't have to walk on eggshells to preserve the sanctity of an autocomplete software

You need to curb your pathetic ego and really think hard about how feeding the open internet to an ML program with a LLM slapped onto it is actually any more useful than the sum of its parts.

[–] [email protected] 3 points 3 months ago

Dawg you're unhinged

[–] [email protected] 1 points 3 months ago* (last edited 3 months ago)

You need to curb your pathetic ego

No one murdered your puppy. Take a deep breath.

You evidently lack even a cursory understanding of the topic. Disliking AI is not a good enough reason to be a jackass to everyone.

[–] [email protected] 3 points 3 months ago

Oh, that’s another good test. It definitely failed.

There are lots of Manx photos though.

Manx images: https://duckduckgo.com/?q=manx&iax=images&ia=images

[–] [email protected] 2 points 3 months ago (2 children)

so.... with all the supposed reasoning stuff they can do, and supposed "extrapolation of knowledge" they cannot figure out that a tail is part of a cat, and which part it is.

[–] [email protected] 3 points 3 months ago

The "reasoning" models and the image generation models are not the same technology and shouldn't be compared against the same baseline.

[–] [email protected] 2 points 3 months ago (1 children)

The "reasoning" you are seeing is it finding human conversations online, and summerizing them

[–] [email protected] 1 points 3 months ago

I'm not seeing any reasoning, that was the point of my comment. That's why I said "supposed"

[–] [email protected] 3 points 3 months ago (1 children)

I mean I tested it out, even tbough I am sure your trolling me and DeepSeek correctly counts the R's

[–] [email protected] 5 points 3 months ago

Non thinking prediction models can't count the r's in strawberry due to the nature of tokenization.

However openai o1 and deep seek r1 can both reliably do it correctly

[–] [email protected] 5 points 3 months ago

Wonder if someone found out about XXXGPT