Scaling Large ML Models to Small Devices with Atila Orhon

Software Engineering Daily - A podcast by Software Engineering Daily

Categories:

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting