簡易檢索 / 詳目顯示

研究生: 姚政岐
Yao, Cheng-Chi
論文名稱: 轉換舊有數據至RDF格式之工具
X2R: A tool for transform legacy data to RDF format
指導教授: 張韻詩
Jane W.S Liu
口試委員: 朱宗賢
金仲達
邵家健
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2016
畢業學年度: 104
語文別: 英文
論文頁數: 82
中文關鍵詞: 鏈結資料舊有資料
外文關鍵詞: linked data, legacy data
相關次數: 點閱:1下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • X2R是一個能夠幫助使用者將舊有資料輕鬆的轉換至RDF格式的工具,以現有的技術要將舊有資料轉換至RDF是十分耗費使用者對於鏈結資料相關技術知識並且相當耗時。現有轉換技術大多為在轉換前先處理舊有資料,使其能符合轉換工具之要求,若對於此轉換流程不熟悉的初學者直接將尚未經過處理的舊有資料進行轉換,則有可能會產生bad node (壞點),而X2R的訴求就是要幫助使用者處理這些包含bad nodes的檔案。X2R能夠幫助使用者至網路上搜尋能夠取代這些bad nodes的URI(統一資源標誌符)並且將其替換,以產出能夠被重複利用、完整的RDF檔案。如此一來轉換的門檻降低,相信鏈結資料也能更加的普及。


    X2R is a tool for translating data in legacy formats to linked data in RDF format. By observing, the conversion processes are challenging and time consuming for many developers in general. Pre-processing of the input data is typically necessary in order to capture the semantics of the input data and make the data conform to the conventions of the selected translator. If a novice developer decided to skip this step and to use the default settings. As a result, this RDF file produced automatically often contain temporary URIs generated by the translator. The file may also have blank nodes. Clearly, one cannot search for these types of nodes. Their presence in a file hinders the discovery of the content of the file. For this reason, we refer to them as bad nodes. To improve the quality of the RDF file produced by post-processing. We present X2R to help the developer who is not familiar with RDF format and related techniques. X2R performs the replacement of bad nodes minted by translators with reusable URIs found in popular endpoints and internal ontology by supporting URI searching and URI replacing functionalities.

    中文摘要 i Abstract ii Acknowledgement iii Contents iv List of Tables v List of Figures vi Chapter 1 Introduction 1 1.1 Motivation and Goal 1 1.2 Major Components 3 1.3 Contributions 4 1.4 Organization 5 Chapter 2 Related Works 7 2.1 Semantic Web 7 2.2 Linked Data 8 2.2.1 Uniform Resource Identifiers 8 2.2.2 Resource Description Framework 10 2.2.3 Ontology 12 2.2.4 SPARQL 13 2.3 Pre-processing Tools 15 2.3.1 OpenRefine 17 Chapter 3 Use Scenario 21 3.1 Case study 21 3.2 Major Steps 24 3.2.1 Extract Bad Nodes and Semantics 24 3.2.2 Search for and Mint Reusable URI 27 3.2.3 Node-URI mapping 29 Chapter 4 Design and Implement 31 4.1 Extractor 31 4.1.1 RDF Graph 32 4.1.2 Tokenizer 34 4.1.3 Web API Specification 35 4.2 URI Search Service 38 4.2.1 Components 39 4.2.2 Web API Specification 43 4.3 URI Management Service 47 4.4 Mapper 49 4.4.1 Input / Output 49 4.4.2 Web API Specification 50 4.5 X2R Graphical User Interface 52 Chapter 5 Summary and Future Work 56 Appendix 57 A.1 Sample code of Extractor 57 A.2 Sample code of USS 66 A.3 Sample code of Mapper 73 A.4 Add new USS components 78 A.4.1 Add a new Endpoint 78 A.4.2 Add a new ranker 79 A.4.3 Add a new filter 79

    [1] Extensible Markup Language (XML), http://www.w3.org/XML/
    [2] Comma-Separated Values (CSV), https://tools.ietf.org/html/rfc4180/
    [3] JavaScript Object Notation (JSON), https://tools.ietf.org/html/rfc4627
    [4] Resource Description Framework (RDF), http://www.w3.org/RDF/
    [5] Uniform Resource Identifiers (URI), https://www.ietf.org/rfc/rfc2396.txt
    [6] Bizer, C., Heath, T. and Berners-Lee, T., Linked Data - The Story So Far. International Journal on Semantic Web and Information Systems (IJSWIS), 5(3), pp. 1–22, 2009.
    [7] Berners-Lee, T., Linked Data. http://www.w3.org/DesignIssues/LinkedData.html.
    [8] Bauer, F. and Kaltenbock, M., Linked Open Data: The Essentials. REEEP, 2011.
    [9] LOD2 Project. http://lod2.eu/WikiArticle/Project.html.
    [10] SIMILE Porject. http://simile.mit.edu/.
    [11] Converter to RDF. http://www.w3.org/wiki/ConverterToRdf
    [12] Stolz, A., Rodriguez-Castro, B., and Hepp, M., RDF Translator: A RESTful Multi-Format Data Converter for the Semantic Web. Technical Report TR-2013-1, E-Business and Web Science Research Group, 2013.
    [13] Generic XML to RDF converter, http://sourceforge.net/projects/xmltordf/
    [14] RDFizer Concept,
    http://wiki.opensemanticframework.org/index.php/RDFizer_Concept
    [15] SPARQL 1.1 Overview,
    http://www.w3.org/TR/2013/REC-sparql11-overview-20130321/
    [16] World Wide Web Consortium (W3C), W3C Semantic Web Activity, 2011.
    http://www.w3.org/2001/sw/
    [17] The Standards of Semantic Web, http://www.w3.org/2001/sw/wiki/Main_Page
    [18] Uniform Resource Locators (URL), RFC 1738, 1994.
    [19] Functional Requirements for Uniform Resource Names (URN), RFC 1737, 1994.
    [20] RDF Primer
    http://www.w3.org/TR/2004/REC-rdf-primer-20040210/
    [21] NoSQL https://en.wikipedia.org/wiki/NoSQL
    [22] MongoDB https://www.mongodb.org/
    [23] Fadi Maali, Getting to the Five-Star: From Raw Data to Linked Government Data., National University of Ireland Galway. 2011
    [24] OpenRefine, http://openrefine.org/
    [25] List of Hawaii Emergency Shelters
    http://www.arcgis.com/sharing/rest/content/items/3f0f20ae77c0447cb5fa2a15038d0520/data
    [26] Ghislain Atemezing, Oscar Corcho, Daniel Garijo , José Mora, María Poveda-Villalón, Pablo Rozas, Daniel Vila-Suero and Boris Villazón-Terrazas “Transforming Meteorological Data into Linked Data” IOS Press (2011) 1-5
    [27] DBpedia SPARQL endpoint, http://dbpedia.org/sparql
    [28] RDF Translator, http://rdf-translator.appspot.com/
    [29] BnodeSkolemization, http://www.w3.org/wiki/BnodeSkolemization
    [30] Heath, T. and Bizer, C., Linked Data Evolving the Web into a Global Data Space. Morgan & Claypool, 2011.
    [31] Neologism - Easy Vocabulary Publishing. http://neologism.deri.ie/
    [32] URI Generic Syntax, RFC 3986, https://www.ietf.org/rfc/rfc3986.txt
    [33] Converting XML to RDF, Bob DuCharme, 2004 http://www.xml.com/pub/a/2004/09/01/tr.html
    [34] Christian Weiss, Transferring Open Government Data into the global Linked Open Data Cloud, 2013
    [35] Fadi Maali, Richard Cyganiak, Vassilios Peristeras, Re-using Cool URIs: Entity Reconciliation Against LOD Hubs., In Proceedings of the Linked Data on the Web Workshop 2011 (LDOW2011) Workshop at WWW2011, 2011
    [36] Grigoris Antoniou, Frankvan Harmelen, Web Ontology Language:OWL, International Handbooks on Information Systems pp 67-92

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE