Breakthrough: Run Massive Models On Any Device (ex: LLaMA 65b)
Update: Sorry for the audio sync issue 😔 In this video, we talk about Petals. A new project combines old-ish technology with large language models to allow you to run even the largest models in a distributed fashion on any device. This incredible new implementation truly decentralizes LLMs (LLaMA, Bloom, MPT, etc) and allows consumer-grade computers to run any…