The Future of Large Language Model Pre-training is Federated

Hackworth@lemmy.world · edit-2 6 months ago

The Future of Large Language Model Pre-training is Federated

Martineski@lemmy.dbzer0.com · 6 months ago

Is this how you make a sentient planet?

Martineski@lemmy.dbzer0.com · 6 months ago

I wonder if this will become a big thing in FOSS ai space. It’s hard to compete with corpos when it comes to computing power.

General_Effort@lemmy.world · 6 months ago

As far as I know, federated learning is pretty much dead. The point would be that it allows organizations to create a joint model without sharing data. But it doesn’t look like anyone who doesn’t want to share data wants to share a model.

Hackworth@lemmy.world · 6 months ago

Until they can distribute the training load of large models to consumer graphics cards (and do something like SETI@Home) it does seem like the benefit of distributed training isn’t enough to overcome the friction.