Funny

8652 readers

1089 users here now

General rules:

Be kind.
All posts must make an attempt to be funny.
Obey the general sh.itjust.works instance rules.
No politics or political figures. There are plenty of other politics communities to choose from.
Don't post anything grotesque or potentially illegal. Examples include pornography, gore, animal cruelty, inappropriate jokes involving kids, etc.

Exceptions may be made at the discretion of the mods.

founded 2 years ago

MODERATORS

[email protected]

1110

It's so over (lemmy.world)

submitted 10 months ago by [email protected] to c/[email protected]

139 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 4 points 10 months ago (17 children)

That's literally how they work

[–] [email protected] 7 points 10 months ago (6 children)

Man the models can't store verbatim its training data, the amount of data is turned into a model that is hundreds or thousands of times smaller than the original source data. If it was capable of simply recovering everything that it was trained on this would be some magical compression algorithm and that by itself would be extremely impressive.

[–] [email protected] 3 points 10 months ago (5 children)

Congratulations on discovering compression

[–] [email protected] 5 points 10 months ago* (last edited 10 months ago) (2 children)

Oh ok, you want to claim this is compressing the entirety of the internet in a model that isn't even 1 terabyte of data and be unimpressed that is something.

But it isn't compression. It is a mathematical fact that neural networks are universal function approximators, this is undisputed, and analytic functions are continuous so to be an analytical function approximator it must be able to fill in the gaps between discrete data points by itself, which necessarily means spiting out data outside of the input distribution, data it has not seen.

[–] [email protected] 3 points 10 months ago (1 children)

TBF, compression is related to ML. Hence, the Hutter Prize. Thinking of LLMs as lossy compression algorithms is a decent analogy.

[–] [email protected] 1 points 10 months ago

It is a partial analogy, it takes into consideration the outputs which are related to some specific training data and disconsiders the outputs which cannot be directly related to any specific training data.

For example, make up a new meme template and a new joke on the spot, it couldn't have seen it before if you make sure your joke and template are new. If the AI can explain it then compression is a horrendous analogy.

Lossy compression explains outputs being similar but not identical when trying to recover the original data, it doesn't explain brand new content that makes sense standalone. Imagine a lossy audio compression resulting in a brand new song midway through playback, or a lossy image compression resulting in a brand new coherent image being overlayed onto some pixels of the original image. That is not what happens, lossy audio compression results in noise, lossy image compression results in noise, not in coherent unheard songs and unseen images.

[–] [email protected] 2 points 10 months ago (1 children)

Not sure why you feel the need to put words in my mouth. It wasn't trained on "the entirety of the Internet," but rather less than a terabyte of it. So yeah, that would probably take up less than a terabyte.

[–] [email protected] 7 points 10 months ago

Then why did I just make this meme up right now and chat gpt can explain it?

https://i.ibb.co/NYHRnTY/Screenshot-20240531-072008-Chat-GPT.jpg

Arguing over this is just dumb, you can yourself take any picture you want at this very moment or come up with a brand new meme template on the spot and upload it to ChatGPT to see you are wrong, it is free btw.

load more comments (2 replies)

load more comments (12 replies)