this post was submitted on 13 Feb 2024
515 points (100.0% liked)

Technology

68567 readers
4068 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 60 points 1 year ago* (last edited 1 year ago) (4 children)

[Edit: indeed, its actually good that it's 2gb]

2gb plugin??!

Btw, does it work with tenacity?

[–] [email protected] 64 points 1 year ago (2 children)

AI models are often multiple gigabytes, tbh it's a good sign that it's not "AI" marketing bullshit (less of a risk with open source projects anyway). I'm pretty wary of "AI" audio software that's only a few megabytes.

[–] [email protected] 10 points 1 year ago

Tensorflowlite models are tiny, but they're potentially as much an audio revolution as synthetizer were in the 70s. It's hard to tell if that's what we're looking at here.

[–] [email protected] 4 points 1 year ago (4 children)

Why are they that big? Is it more than code? How could you get to gigabytes of code?

[–] [email protected] 50 points 1 year ago

Currently, AI means Artificial Neural Network (ANN). That's only one specific approach. What ANN boils down to is one huge system of equations.

The file stores the parameters of these equations. It's what's called a matrix in math. A parameter is simply a number by which something is multiplied. Colloquially, such a file of parameters is called an AI model.

2 GB is probably an AI model with 1 billion parameters with 16 bit precision. Precision is how many digits you have. The more digits you have, the more precise you can give a value.

When people talk about training an AI, they mean finding the right parameters, so that the equations compute the right thing. The bigger the model, the smarter it can be.

Does that answer the question? It's probably missing a lot.

[–] [email protected] 15 points 1 year ago* (last edited 1 year ago) (1 children)

It's basically a huge graph/flowchart.

[–] [email protected] 6 points 1 year ago (1 children)

It's really nothing of the sort.

[–] [email protected] 14 points 1 year ago (1 children)
  1. Specifying weights, biases and shape definitely makes a graph.
  2. IMO having a lot of more preferred and more deprecated routes is quite close to a flowchart except there's a lot more routes. The principles of how these work is quite similar.
[–] [email protected] 3 points 1 year ago (1 children)
  1. There are graph neural networks (meaning NNs that work on graphs), but I don't think that's what is used here.

  2. I do not understand what you mean by "routes". I suspect that you have misunderstood something fundamental.

[–] [email protected] 5 points 1 year ago (1 children)
  1. I'm not talking about that. What's weights, biases and shape if not a graph?
  2. By routes, I mean that the path of the graph doesn't necessarily converge and that it is often more tree-like.
[–] [email protected] 4 points 1 year ago* (last edited 1 year ago) (1 children)

You can see a neural net as a graph in that the neurons are connected nodes. I don't believe that graph theory is very helpful, though. The weights are parameters in a system of linear equations; the numbers in a matrix/tensor. That's not how the term is used in graph theory, AFAIK.

ETA: What you say about "routes" (=paths?) is something that I can only make sense of, if I assume that you misunderstood something. Else, I simply don't know what that is talking about.

[–] [email protected] 2 points 1 year ago (1 children)

If you look at the nodes which are most likely to trigger from given inputs then you can draw paths

[–] [email protected] 2 points 1 year ago

I still don't know what this is supposed to mean for neural nets. I think it reflects a misunderstanding.

[–] [email protected] 8 points 1 year ago* (last edited 1 year ago) (1 children)

The current wave of AI is around Large Language Models or LLMs. These are basically the result of a metric fuckton of calculation results generated from running a load of input data in, in different ways. Given these are often the result of things like text, pictures or audio that have been distilled down into numbers, you can imagine we're talking a lot of data.

(This is massively simplified, by someone who doesn't entirely understand it themselves)

[–] [email protected] 7 points 1 year ago

They're composed of many big matrices, which scale quadratically in size. A 32x32 matrix is 4x the size of a 16x16 matrix.

[–] [email protected] 33 points 1 year ago

It seems reasonable given it includes multiple AI models.

[–] [email protected] 7 points 1 year ago

2gb is pretty normal for an AI model. I have some small LLM models on my PC and they're about 7-10gb big. The big ones take up even more space.

[–] [email protected] 3 points 1 year ago (2 children)

Isn’t tenacity a joke project made by 4channers

[–] [email protected] 15 points 1 year ago (1 children)

That fork is sneedacity, which is very dead.

[–] [email protected] 3 points 1 year ago

Gotcha, thank you for the info. Gotta admit their made-up words are pretty funny

[–] [email protected] 10 points 1 year ago (1 children)

Tenacity is a Audacity fork without telemetry

[–] [email protected] 16 points 1 year ago (2 children)

Isn't the telemetry in Audacity opt-in anyway?

[–] [email protected] 3 points 1 year ago

The fork was created when Audacity was bought and one of the first things the new developers were about to do was add opt-out telemetry. People didn't like that at all. From what I read in this thread, they ended up adding opt-in telemetry instead.