Passer à la navigation principale Passer à la recherche Passer au contenu principal

Towards inference delivery networks: Distributing machine learning with optimality guarantees

  • Tareq Si Salem
  • , Gabriele Castellano
  • , Giovanni Neglia
  • , Fabio Pianese
  • , Andrea Araldo

Résultats de recherche: Le chapitre dans un livre, un rapport, une anthologie ou une collectionContribution à une conférenceRevue par des pairs

Résumé

We present the novel idea of inference delivery networks (IDN), networks of computing nodes that coordinate to satisfy inference requests achieving the best trade-off between latency and accuracy. IDNs bridge the dichotomy between device and cloud execution by integrating inference delivery at the various tiers of the infrastructure continuum (access, edge, regional data center, cloud). We propose a distributed dynamic policy for ML model allocation in an IDN by which each node periodically updates its local set of inference models based on requests observed during the recent past plus limited information exchange with its neighbor nodes. Our policy offers strong performance guarantees in an adversarial setting and shows improvements over greedy heuristics with similar complexity in realistic scenarios.

langue originaleAnglais
titre2021 19th Mediterranean Communication and Computer Networking Conference, MedComNet 2021
EditeurInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronique)9781665435901
Les DOIs
étatPublié - 1 janv. 2021
Evénement19th Mediterranean Communication and Computer Networking Conference, MedComNet 2021 - Virtual, Online, Espagne
Durée: 15 juin 202117 juin 2021

Série de publications

Nom2021 19th Mediterranean Communication and Computer Networking Conference, MedComNet 2021

Une conférence

Une conférence19th Mediterranean Communication and Computer Networking Conference, MedComNet 2021
Pays/TerritoireEspagne
La villeVirtual, Online
période15/06/2117/06/21

Empreinte digitale

Examiner les sujets de recherche de « Towards inference delivery networks: Distributing machine learning with optimality guarantees ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation