簡易檢索 / 詳目顯示

研究生: 凌鈺城
Yu-Cheng Ling
論文名稱: 度量空間中可調整距離函數之反向最近臨點查詢
Reverse Nearest Neighbor Search in Metric Spaces with Adjustable Distance Functions
指導教授: 陳良弼
Arbee L.P. Chen
口試委員:
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2007
畢業學年度: 95
語文別: 中文
論文頁數: 37
中文關鍵詞: 度量空間反向最近臨點
外文關鍵詞: Metric Space, Reverse Nearest Neighbor
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 近年來,反向K最近臨點(RkNN)的問題受到了合理的觀注,因為它可以被用在商業地點的選擇、以profile為基礎的行銷、或者資料探勘的用途上。然而,先前在度量空間中的RkNN演算法都不能允許使用者在每次查詢時使用不同的距離函式。在這篇論文中,我們定義了可調整距離函式之RkNN問題,並且提供一個完整的解決方案。


    In recent years, the reverse k-nearest neighbor (RkNN) problem in metric spaces
    has attracted reasonable attention because it can be applied to business location
    planning, profile-based marketing, clustering and outlier detection. However, previous
    works on the RkNN problem in metric spaces cannot allow users to assign different
    distance functions for distinct queries. In this thesis, we define the problem on RkNN
    query with adjustable functions. That is, for each distinct query, it can have its own
    distance function. To the best of our knowledge, this thesis is the first work solving
    the problem of RkNN with adjustable distances in metric spaces. We propose a
    generic framework to handle the RkNN queries with different distance functions. In
    our framework, a new index structure is built by using multiple distance functions.
    Therefore, some pruning rules determined by calculating the correlation between the
    distance function assigned by the user and those used to build the index structure can
    be provided to prune the irrelevant data points and then, the remainder candidates are
    really checked to see whether they are real results. The correctness of our pruning
    strategies is proved and the experiment results demonstrate the efficiency and
    effectiveness of our approach.

    Acknowledgement.........................................................................................................i Abstract........................................................................................................................ii Table of Contents ....................................................................................................... iii List of Figures..............................................................................................................iv 1 Introduction..........................................................................................................1 2 Related Work and Preliminaries ........................................................................5 2.1 Metric space index and NN search .....................................................5 2.2 RNN Algorithm in Euclidean space....................................................9 2.3 RNN Algorithm in Metric Space ......................................................11 2.4 Problem definition .............................................................................12 2.5 Overview of our framework..............................................................12 3 Methodology .......................................................................................................15 3.1 Intermediate node pruning ...............................................................15 3.2 Leaf object pruning............................................................................21 3.3 Refinement step..................................................................................24 3.4 MDM-Tree ..........................................................................................25 4 Experiment .........................................................................................................28 4.1 Experimental Settings and Discussions of Distance Functions......28 4.2 Experiment result...............................................................................30 5 Conclusion ..........................................................................................................35 6 Reference ............................................................................................................36

    [1] Elke Achtert, Christian Böhm, Peer Kröger, Peter Kunath, Alexey Pryakhin,
    Matthias Renz: “Efficient reverse k-nearest neighbor search in arbitrary metric
    spaces,” Proceedings of the ACM SIGMOD International Conference on
    Management of Data, pp. 515-526, 2006.
    [2] Elke Achtert, Christian Böhm, Peer Kröger, Peter Kunath, Alexey Pryakhin,
    Matthias Renz, “Approximate reverse k-nearest neighbor queries in general
    metric spaces,” Proceedings of the ACM CIKM International Conference on
    Information and Knowledge Management, pp. 788-789, 2006
    [3] Xia Cao, Shuai Cheng Li, Beng Chin Ooi, Anthony K. H. Tung: Piers, “An
    Efficient Model for Similarity Search in DNA Sequence Databases, ” SIGMOD
    Record, Volume 33, pp. 39-44, 2004.
    [4] Paolo Ciaccia, Marco Patella, “Searching in metric spaces with user-defined and
    approximate distances,” ACM Transactions on Database Systems, Volume 27, pp.
    398-437, 2002.
    [5] Paolo Ciaccia, Marco Patella, Pavel Zezula, “M-tree: An Efficient Access
    Method for Similarity Search in Metric Spaces,” Proceedings of International
    Conference on Very Large Data Bases, pp. 426-435, 1997.
    [6] Gísli R. Hjaltason, Hanan Samet, “Index-Driven Similarity Search in Metric
    Spaces,” ACM Trans. Database Systems, vol. 28, no. 4, pp. 517-580, 2003.
    [7] Flip Korn, S. Muthukrishnan, “Influence Sets Based on Reverse Nearest
    Neighbor Queries,” Proceedings of the ACM SIGMOD International Conference
    on Management of Data, pp. 201-212, 2000.
    [8] Alexandros Nanopoulos, Yannis Theodoridis, Yannis Manolopoulos, “C2P:
    Clustering Based on Closest Pairs,” Proceedings of the International Conference
    on Very Large Data Bases, pp. 331-340, 2001.
    [9] Amit Singh, Hakan Ferhatosmanoglu, Ali Saman Tosun, “High dimensional
    reverse nearest neighbor queries,” Proceedings of the ACM CIKM International
    Conference on Information and Knowledge Management, pp. 91-98, 2003.
    [10] Ioana Stanoi, Divyakant Agrawal, Amr El Abbadi, “Reverse Nearest Neighbor
    Queries for Dynamic Databases,“ ACM SIGMOD Workshop on Research Issues
    in Data Mining and Knowledge Discovery, pp. 44-53, 2000.
    [11] Yufei Tao, Man Lung Yiu, Nikos Mamoulis, “Reverse Nearest Neighbor Search
    37
    in Metric Spaces,” IEEE Transactions on Knowledge and Data Engineering,
    Volume 18, pp. 1239-1252, 2006.
    [12] Yufei Tao, Dimitris Papadias, Xiang Lian, “Reverse kNN Search in Arbitrary
    Dimensionality,” Proceedings of the International Conference on Very Large
    Data Bases, 744-755, 2004
    [13] Chenyi Xia, Wynne Hsu, Mong-Li Lee, “ERkNN: efficient reverse k-nearest
    neighbors retrieval with local kNN-distance estimation,” Proceedings of the
    ACM CIKM International Conference on Information and Knowledge
    Management, pp. 533-540, 2005.
    [14] Congjun Yang, King-Ip Lin, “An Index Structure for Efficient Reverse Nearest
    Neighbor Queries,” Proceedings of the International Conference on Data
    Engineering, pp. 485-492, 2001.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE