원문정보
초록
영어
Heterogeneous structured datas give rise to different kind of information caliber issues regarding real-world structured datas. Identical records also a one major issue. Strategy to eliminate identical records results in unsureness to select among true uniform records. Available methods based on expert observation and destructive decisions do not proved effective solution to such problems. This project solves these issues of identical records elimination will solve by de-duplication procedure as data accessing tasks with unsure outcomes. This project implements method to overcome unsureness of identical records that tightly conceal the proper instances of input and gives effective results for identical record.
목차
1. Introduction
2. Related Works
2.1. Query Evolution based on Entity Awareness
2.2. Query Outcomes Ranking
2.3. Query Evaluation for Unsure Predicates
2.4. Contingence Ranking Model of Partial Orders
2.5. Threshold Based Ranking Approach
2.6. Top-k based Query Processing
3. Conclusion
4. FUTURE WORK
References