Scaling Large ML Models to Small Devices with Atila Orhon
Software Engineering Daily - A podcast by Software Engineering Daily
Categories:
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting