원문정보
초록
영어
Peer-to-peer (P2P) databases are becoming prevalent on the Internet for distribution and sharing of documents, applications, and other digital media. The problem of answering large-scale ad hoc analysis queries, for example, aggregation queries, on these databases poses unique challenges. Exact solutions can be time consuming and difficult to implement, for the distributed and dynamic nature of P2P databases. In this paper, we have presented novel sampling-based techniques for approximate answering of ad hoc aggregation queries in such databases. Computing a high-quality random sample of the database efficiently in the P2P environment is complicated due to several factors: the data is distributed (usually in uneven quantities) across many peers, within each peer the data is often highly correlated and moreover, even collecting a random sample of the peers is difficult to accomplish. To formulate these problems, developed software uses approach, based on random walks of the P2P graph, as well as block-level sampling techniques. We have presented here extensive experimental evaluations to demonstrate the feasibility of our solution. The modules that are included in this project are: 1. Peer-to-Peer Node Construction 2. Random Selection of Node 3. Selection of Records 4. Performance Evaluation
목차
1. Introduction
2. Problem Definition
3. Literature Survey
4. Our Approach
4.1 AQP Phases
4.2 Basic Terms
4.3. Challenges
5. Research Methodology
5.1 Unstructured P2P Network
5.2. Query Processing
6. Random-walk Algorithm
7. Proposed Modules
8. Experimental Evaluation
8.1. Evaluation Metrics
8.2 Input Parameters
9. Experiments
9.1. P2P Databases
9.2. A Typical model for AQP in P2P Network
9.3. Graph Plots
10. Conclusion
References