研究生: |
黃聖捷 Sheng-Chieh Huang |
---|---|
論文名稱: |
P2P系統上之連續性聚合查詢 Continuous Aggregate Queries over P2P Systems |
指導教授: |
陳宜欣
Yi-Shin Chen |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2007 |
畢業學年度: | 95 |
語文別: | 英文 |
論文頁數: | 29 |
中文關鍵詞: | P2P 、連續性聚合查詢 |
外文關鍵詞: | P2P, continuous aggregation query |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在P2P網路中,整個系統的統計資訊可以藉由聚合查詢(aggregation queries)得到。而其中對於串流資料(data stream)要能夠提供不受時間影響以及精確的答案是尤其重要的。我們提出了一個在階層式P2P(hierarchical P2P)基於給定限定條件(constraint-based)而能夠進行連續性聚合性查詢(continuous aggregation queries)的方法。我們的方法(簡稱CARE)利用一個由使用者決定的精確容忍度(precision tolerance)去達到精確度與通訊成本(communication cost)的平衡。經由實驗的結果可知,我們的方法大幅降低了通訊成本,並且能夠適應性的(adaptively)、漸進的(progressively)得到在容許範圍之內的答案。
Providing time-dependent and accurate answers to queries over continuous data stream
is crucially required for peer-to-peer (P2P) networks, in which the global statistical information
can be obtained through aggregation queries. We proposed a constraint-based
approach on continuous aggregation queries over hierarchical P2P system. Our technique
(termed CARE) acquires a user-specified precision tolerance that manipulates the
balance between accuracy and communication cost by users’ requests. As shown in
the experimental results, it dramatically reduces the communication cost and obtains the
specified tolerable answers adaptively and progressively.
[1] Can-o-sleep dataset. http://kdl.cs.umass.edu/data/canosleep/
canosleep-info.html.
[2] Gnutella v0.4. http://rfc-gnutella.sourceforge.net/
developer/stable/index.html.
[3] Gnutella v0.6. http://rfc-gnutella.sourceforge.net/
developer/testing/index.html.
[4] Jxta. https://jxta.dev.java.net/.
[5] Kazaa. http://www.kazaa.com.
[6] Peersim. http://peersim.sourceforge.net.
[7] UCI KDD Archive. http://kdd.ics.uci.edu/.
[8] Freenet: a distributed anonymous information storage and retrieval system. Springer-Verlag New York, Inc., 2001.
[9] Robert Alcock. Synthetic control chart time series, 1999. http://kdd.ics.
uci.edu/databases/synthetic_control/synthetic_control.
html.
[10] Benjamin Arai, Gautam Das, Dimitrios Gunopulos, and Vana Kalogeraki. Approximating
aggregation queries in peer-to-peer networks. In ICDE, page 42. IEEE
Computer Society, 2006.
[11] Arvind Arasu, Shivnath Babu, and Jennifer Widom. The cql continuous query language:
semantic foundations and query execution. The VLDB Journal, 15(2):121–
142, 2006.
[12] Brian Babcock, Shivnath Babu, Mayur Datar, Rajeev Motwani, and Jennifer
Widom. Models and issues in data stream systems. In PODS, pages 1–16, 2002.
[13] Brian Babcock and Chris Olston. Distributed top-k monitoring. In SIGMOD Conference,
pages 28–39. ACM, 2003.
[14] Mayank Bawa, Hector Garcia-Molina, Aristides Gionis, and Rajeev Motwani. Estimating
aggregates on a peer-to-peer network. Technical report, Computer Science
Department, Stanford University, 2003.
[15] Alan Brown and M. Kolberg. Tools for peer-topeer
network simulation. http://tools.ietf.org/
html/draft-irtf-p2prg-core-simulators-00
[16] Donald Carney, Ugur C¸ etintemel, Mitch Cherniack, Christian Convey, Sangdon
Lee, Greg Seidman, Michael Stonebraker, Nesime Tatbul, and Stanley B. Zdonik.
Monitoring streams - a new class of data management applications. In VLDB, pages
215–226, 2002.
[17] Jen-Yeu Chen, Gopal Pandurangan, and Dongyan Xu. Robust computation of aggregates
in wireless sensor networks: distributed randomized algorithms and analysis.
In IPSN, pages 348–355. IEEE, 2005.
[18] A. Deligiannakis, Y. Kotidis, and N. Roussopoulos. Hierarchical in-network data
aggregation with quality guarantees, 2004.
[19] Alan J. Demers, Daniel H. Greene, Carl Hauser, Wes Irish, John Larson, Scott
Shenker, Howard E. Sturgis, Daniel C. Swinehart, and Douglas B. Terry. Epidemic
algorithms for replicated database maintenance. In PODC, pages 1–12, 1987.
[20] Ma´rk Jelasity, Alberto Montresor, and O¨ zalp Babaoglu. Gossip-based aggregation
in large dynamic networks. ACM Trans. Comput. Syst., 23(3):219–252, 2005.
[21] Alberto Montresor. A robust protocol for building superpeer overlay topologies. In
Peer-to-Peer Computing, pages 202–209, 2004.
[22] C. Olston, J. Jiang, and J. Widom. Adaptive filters for continuous queries over
distributed data streams. In Proceedings of the 2003 ACM SIGMOD international
conference on Management of data, pages 563–574, San Diego, California, 2003.
[23] Alberto Gonzalez Prieto and Rolf Stadler. Adaptive distributed monitoring with
accuracy objectives. In INM ’06: Proceedings of the 2006 SIGCOMM workshop
on Internet network management, pages 65–70, New York, NY, USA, 2006. ACM
Press.
[24] Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, and Scott Schenker.
A scalable content-addressable network. In Proceedings of the ACM Conference
on Applications, Technologies, Architectures, and Protocols for Computer Communications,
pages 161–172, August 2001.
[25] Robbert Van Renesse, Kenneth P. Birman, andWerner Vogels. Astrolabe: A robust
and scalable technology for distributed system monitoring, management, and data
mining. ACM Trans. Comput. Syst., 21(2):164–206, 2003.
[26] Stefan Saroiu, P. Krishna Gummadi, and Steven D. Gribble. A measurement study
of peer-to-peer file sharing systems. In Proceedings of Multimedia Computing and
Networking 2002 (MMCN ’02), San Jose, CA, USA, January 2002.
[27] Ion Stoica, Robert Morris, David Karger, Frans Kaashoek, and Hari Balakrishnan.
Chord: A scalable Peer-To-Peer lookup service for internet applications. In Proceedings
of the 2001 ACM SIGCOMM Conference, pages 149–160, 2001.
[28] Beverly Yang and Hector Garcia-Molina. Designing a super-peer network. In IEEE
International Conference on Data Engineering, 2003, 2003.