Please use this identifier to cite or link to this item: http://hdl.handle.net/11455/19581
標題: 一個適合XML串流查詢的編碼機制
An XML Encoding Scheme for Streaming Query Processing
作者: 丁旭賢
Ting, Hsu-Hsien
關鍵字: XML;XML;XML stream;query processing;encoding;XPath;串流;XML串流查詢;編碼;XPath
出版社: 資訊科學與工程學系所
引用: [1] A. Gupta and S. S. Chawathe, “Skipping Streams with XHints,” Technical report, http://citeseer.ist.psu.edu/667942.html, 2004. [2] F. Peng and S. S. Chawathe, “XPath Queries on Streaming Data,” Proceedings of ACM SIGMOD Conference, 2003, pages 431-442. [3] I. Stanoi, C. A. Lang and S. Padmanabhan, “Hint and Run: Accelerating XPath Queries,” Proceedings of the 9th International Database Engineering & Application Symposium, 2005, pages 253-262. [4] J. Clark and S. DeRose, “XML Path Language (XPath),” http://www.w3.org/TR/xpath/, 1999. [5] J. K. Min, M. J. Park and C. W. Chung, “XTREAM: An Efficient Multi-query Evaluation on Streaming XML Data,” Information Sciences, Volume 177, Issue 17, 2007, pages 3300-3331. [6] J. Kim and S. Park, “SemFilter: A Simple and Efficient Semantic Filtering of XML Messages,” Proceedings of the 7th International Conference on Computer and Information Technology, 2007, pages 35-40. [7] L. V. S. Lakshmanan and S. Parthasarathy, “On Efficient Matching of Streaming XML Documents and Queries,” Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology, 2002, pages 142-160. [8] M. Altinel and M. J. Franklin, “Efficient Filtering of XML Documents for Selective Dissemination of Information,” Proceedings of the 26th International Conference on Very Large Data Bases, 2000, pages 53-64. [9] M. P. Consens and T. Milo, “Optimizing Queries on Files,” Proceedings of ACM SIGMOD Conference, 1994, pages 301 - 312. [10] R. H. Choi and R. K. Wong, “Efficient Processing of Branch Queries for High-performance XML Filtering,” Proceedings of the 2nd International Conference on Scalable Information Systems, Article No.50, 2007. [11] S. Park and H. J. Kim, “A New Query Processing Technique for XML Based on Signature,” Proceedings of the 7th International Conference on Database Systems for Advanced Applications, 2001, pages 22-29. [12] S. Park and H. J. Kim, “SigDAQ: an Enhanced XML Query Optimization Technique,” Journal of Systems and Software, Volume 61, Number 2, 2002, pages 91-103. [13] T. Bray, J. Paoli, C. M. Sperberg-McQueen and E. Maler, “Extensible Markup Language (XML) 1.0 (Fourth Edition),” W3C Recommendation, Available at http://www.w3.org/TR/REC-xml/, 2006. [14] T. J. Green, G. Miklau, M. Onizuka and D. Suciu, “Processing XML Streams with Deterministic Automata,” Proceedings of the 9th International Conference on Database Theory, 2003, pages 173-189. [15] V. Apparao et al., “Document Object Model (DOM) Level 1 Specification,” W3C Recommendation, Available at http://www.w3.org/TR/REC-DOM-Level-1/, 1998. [16] Y. Chen, S. B. Davidson and Y. Zheng,“ViteX: A Streaming XPath Processing System,” Proceedings of the 21st International Conference on Data Engineering, 2005, pages 1118-1119. [17] Y. Chen, G. A. Mihaila, S. B. Davidson and S. Padmanabhan, “EXPedite: A System for Encoded XML Processing,” Proceedings of the 13th ACM International Conference on Information and Knowledge Management, 2004, pages 108-117. [18] Y. Chen, G. A. Mihaila, S. B. Davidson and S. Padmanabhan, “Efficient Path Query Processing on Encoded XML,” Proceedings of International Workshop on High Performance XML Processing, 2004. [19] Y. Chen, Y. Shi, and Y. Chen, “Tree Inclusion Algorithm, Signatures and Evaluation of Path-oriented Queries,” Proceedings of the 2006 ACM Symposium on Applied Computing, 2006, pages 1020-1025. [20] Y. Diao and M. J. Franklin, “Query Processing for High-Volume XML Message Brokering,” Proceedings of the 29th International Conference on Very Large Data Bases, 2003, pages 261-272. [21] Protein Information Resource, http://pir.georgetown.edu/. [22] GSFC/The Nasa XML Project, http://xml.nasa.gov/xmlwg/index.htm. [23] The XML Data Repository, University of Washington Computer Science & Engineering, http://www.cs.washington.edu/research/xmldatasets/. [24] The Penn Treebank Project, http://www.cis.upenn.edu/~treebank/. [25] XMark — An XML Benchmark Project, http://monetdb.cwi.nl/xml/.
摘要: 
近年來,XML已經是網際網路上通用的標準資料交換格式,而串流XML資料更是適合在網路環境下傳遞的一種方式,尤其是需要傳輸大量的XML資料時,因此衍生了XML串流查詢技術的研究議題。XML串流查詢的好處是可以在查詢過程中就把已經確定的查詢結果輸出,主要可應用於講求效率的電子商務和線上資訊服務等領域。然而,在串流查詢過程中,由於用戶端無法預知接下來的文件內容,所以必須對接收到的資料逐一還原,判斷是否與查詢有關系,進而決定要加以處理或是直接丟棄。當用戶端為資源有限的手持式行動裝置,並透過無線網路查詢XML串流資料時,查詢過程中會因為多餘的資料辨識動作而虛耗無謂的電力。
針對上述問題,我們提出一個XML串流查詢技巧:UBC,它可以減少串流查詢XML文件的過程中必須處理的節點數量。用戶端在串流查詢時可以將不需要的資料直接略過,節省行動裝置的電力消耗,進而提昇查詢效能。UBC的作法是,在XML文件上每一個節點附加少量的額外資訊,記錄著子孫節點的標籤資訊摘要,查詢時能夠透過這些資訊判斷節點的子樹範圍是否可以略過不予處理。
最後,我們進行實驗以評估我們的方法。實驗結果顯示,在大部分的情況下,UBC能夠略過的節點數量都比基於Signature的方法還多,UBC的查詢時間也能夠因為減少了節點的處理而有效的降低。
URI: http://hdl.handle.net/11455/19581
其他識別: U0005-3107200820351500
Appears in Collections:資訊科學與工程學系所

Show full item record
 

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.