TY - GEN
T1 - Interactive data exploration based on user relevance feedback
AU - Dimitriadou, Kyriaki
AU - Papaemmanouil, Olga
AU - Diao, Yanlei
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Interactive Data Exploration (IDE) applications typically involve users that aim to discover interesting objects by it-eratively executing numerous ad-hoc exploration queries. Therefore, IDE can easily become an extremely labor and resource intensive process. To support these applications, we introduce a framework that assists users by automatically navigating them through the data set and allows them to identify relevant objects without formulating data retrieval queries. Our approach relies on user relevance feedback on data samples to model user interests and strategically collects more samples to refine the model while minimizing the user effort. The system leverages decision tree classifiers to generate an effective user model that balances the trade-off between identifying all relevant objects and reducing the size of final returned (relevant and irrelevant) objects. Our preliminary experimental results demonstrate that we can predict linear patterns of user interests (i.e., range queries) with high accuracy while achieving interactive performance.
AB - Interactive Data Exploration (IDE) applications typically involve users that aim to discover interesting objects by it-eratively executing numerous ad-hoc exploration queries. Therefore, IDE can easily become an extremely labor and resource intensive process. To support these applications, we introduce a framework that assists users by automatically navigating them through the data set and allows them to identify relevant objects without formulating data retrieval queries. Our approach relies on user relevance feedback on data samples to model user interests and strategically collects more samples to refine the model while minimizing the user effort. The system leverages decision tree classifiers to generate an effective user model that balances the trade-off between identifying all relevant objects and reducing the size of final returned (relevant and irrelevant) objects. Our preliminary experimental results demonstrate that we can predict linear patterns of user interests (i.e., range queries) with high accuracy while achieving interactive performance.
UR - https://www.scopus.com/pages/publications/84901752180
U2 - 10.1109/ICDEW.2014.6818343
DO - 10.1109/ICDEW.2014.6818343
M3 - Conference contribution
AN - SCOPUS:84901752180
SN - 9781479934805
T3 - Proceedings - International Conference on Data Engineering
SP - 292
EP - 295
BT - 2014 IEEE 30th International Conference on Data Engineering Workshops, ICDEW 2014
PB - IEEE Computer Society
T2 - 2014 IEEE 30th International Conference on Data Engineering Workshops, ICDEW 2014
Y2 - 31 March 2014 through 4 April 2014
ER -