TY - GEN
T1 - Set of t-uples expansion by example
AU - Er, Ngurah Agus Sanjaya
AU - Abdessalem, Talel
AU - Bressan, Stéphane
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/11/28
Y1 - 2016/11/28
N2 - Set expansion is the task of finding elements of a set given example members. We are interested in the design of al-gorithms and techniques for a set expansion tool that ex-pands a set by searching, finding and extracting candidates from the World Wide Web. Existing approaches mostly consider sets of atomic data. We extend this idea to the expansion of sets of t-uples, that is relation instances or tables. We propose an approach for extracting relation in-stances from the World Wide Web given a handful set of t-uple seeds. For instance, when the user proposes the set of seeds , , the system returns a relation con-taining currency codes with their corresponding country and capital city. We show how a random walk in a heterogeneous graph of Web pages, wrappers, seeds and candidates is able to rank the candidates according to their relevance to the seeds. We evaluate the performance of the approach and show that it is efficient, effective and practical.
AB - Set expansion is the task of finding elements of a set given example members. We are interested in the design of al-gorithms and techniques for a set expansion tool that ex-pands a set by searching, finding and extracting candidates from the World Wide Web. Existing approaches mostly consider sets of atomic data. We extend this idea to the expansion of sets of t-uples, that is relation instances or tables. We propose an approach for extracting relation in-stances from the World Wide Web given a handful set of t-uple seeds. For instance, when the user proposes the set of seeds , , the system returns a relation con-taining currency codes with their corresponding country and capital city. We show how a random walk in a heterogeneous graph of Web pages, wrappers, seeds and candidates is able to rank the candidates according to their relevance to the seeds. We evaluate the performance of the approach and show that it is efficient, effective and practical.
KW - Set Expansion
KW - T-uples expansion
U2 - 10.1145/3011141.3011144
DO - 10.1145/3011141.3011144
M3 - Conference contribution
AN - SCOPUS:85014907576
T3 - ACM International Conference Proceeding Series
SP - 221
EP - 230
BT - 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016 - Proceedings
A2 - Indrawan-Santiago, Maria
A2 - Anderst-Kotsis, Gabriele
A2 - Steinbauer, Matthias
A2 - Khalil, Ismail
PB - Association for Computing Machinery
T2 - 18th International Conference on Information Integration and Web-Based Applications and Services, iiWAS 2016
Y2 - 28 November 2016 through 30 November 2016
ER -