Résumé
This paper investigates the influence of different page features on the ranking of search engine results. We use Google (via its API) as our testbed and analyze the result rankings for several queries of different categories using statistical methods. We reformulate the problem of learning the underlying, hidden scores as a binary classification problem. To this problem we then apply both linear and non-linear methods. In all cases, we split the data into a training set and a test set to obtain a meaningful, unbiased estimator for the quality of our predictor. Although our results clearly show that the scoring function cannot be approximated well using only the observed features, we do obtain many interesting insights along the way and discuss ways of obtaining a better estimate and main limitations in trying to do so.
| langue originale | Anglais |
|---|---|
| Pages | 48-57 |
| Nombre de pages | 10 |
| état | Publié - 1 déc. 2005 |
| Modification externe | Oui |
| Evénement | 1st International Workshop on Adversarial Information Retrieval on the Web, AIRWeb 2005 - Held in Conjunction with the 14th International World Wide Web Conference - Chiba, Japon Durée: 10 mai 2005 → 10 mai 2005 |
Une conférence
| Une conférence | 1st International Workshop on Adversarial Information Retrieval on the Web, AIRWeb 2005 - Held in Conjunction with the 14th International World Wide Web Conference |
|---|---|
| Pays/Territoire | Japon |
| La ville | Chiba |
| période | 10/05/05 → 10/05/05 |
Empreinte digitale
Examiner les sujets de recherche de « An analysis of factors used in search engine ranking ». Ensemble, ils forment une empreinte digitale unique.Contient cette citation
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver