Passer à la navigation principale Passer à la recherche Passer au contenu principal

A distributed algorithm for large-scale generalized matching

  • Faraz Makari Manshadi
  • , Baruch Awerbuch
  • , Rainer Gemulla
  • , Rohit Khandekar
  • , Julián Mestre
  • , Mauro Sozio
  • Max-Planck-Institut fur Informatik
  • Johns Hopkins University
  • Knight Capital Group
  • University of Sydney
  • Telecom Paris

Résultats de recherche: Contribution à un journalArticle de conférenceRevue par des pairs

Résumé

Generalized matching problems arise in a number of applications, including computational advertising, recommender systems, and trade markets. Consider, for example, the problem of recommending multimedia items (e.g., DVDs) to users such that (1) users are recommended items that they are likely to be interested in, (2) every user gets neither too few nor too many recommendations, and (3) only items available in stock are recommended to users. State-of-the-art matching algorithms fail at coping with large real-world instances, which may involve millions of users and items. We propose the first distributed algorithm for computing near-optimal solutions to large-scale generalized matching problems like the one above. Our algorithm is designed to run on a small cluster of commodity nodes (or in a MapReduce environment), has strong approximation guarantees, and requires only a poly-logarithmic number of passes over the input. In particular, we propose a novel distributed algorithm to approximately solve mixed packing-covering linear programs, which include but are not limited to generalized matching problems. Experiments on real-world and synthetic data suggest that a practical variant of our algorithm scales to very large problem sizes and can be orders of magnitude faster than alternative approaches.

langue originaleAnglais
Pages (de - à)613-624
Nombre de pages12
journalProceedings of the VLDB Endowment
Volume6
Numéro de publication9
Les DOIs
étatPublié - 1 janv. 2013
Modification externeOui
Evénement39th International Conference on Very Large Data Bases, VLDB 2012 - Trento, Italie
Durée: 26 août 201330 août 2013

Empreinte digitale

Examiner les sujets de recherche de « A distributed algorithm for large-scale generalized matching ». Ensemble, ils forment une empreinte digitale unique.

Contient cette citation