簡易檢索 / 詳目顯示

研究生: 孫玫如
Sun, Mei-ju
論文名稱: 學術資料典藏系統技術發展
Technical Development of the Academic Data Preservation System
指導教授: 石維寬
Shih, Wei-Kuan
口試委員: 徐讚昇
衛信文
許見章
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2013
畢業學年度: 101
語文別: 中文
論文頁數: 52
中文關鍵詞: iRODS長期保存異地備份監控系統資料網格
外文關鍵詞: iRODS, long-term preservation, remote backup, monitoring system, data grid
相關次數: 點閱:3下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 數位化資料日益增長,為了解決大量數位檔案的儲存問題,從學術界開始發展智慧儲存庫的概念,將生產出的學術論文等集中保管,為的是避免這皆重要資源因意外而遺失或是損壞,也有利於增加這些學術研究的能見度。資料的數位化讓原有的重要資產能夠得到良好的保存,然而資料保存系統的維持需要耗費不少人力、金錢與資源,因此現階段各國投入資源於研究如何長期維運資料保存系統,其中如何能夠穩定的維持系統,確保資料的安全及容錯機制尤其重要,影響資料保存系統的實際成效。
    台灣各學術單位的重要資產各自保存,如此分散的保存方式不但成效不彰,也浪費許多資源。因此我們利用資料網格技術,建立了一個分散式的儲存系統,提供統一的操作介面,以及穩定的異地備份機制。經過不斷的更新與調整架構,系統已經維持運行了五年以上。經由系統運行的成果,我們得以分析其系統內容、使用者操作模式以及資料的特性。藉由找出系統的特性與缺失,可以確切有效的針對問題找尋正確的解決方式,也提供未來系統改進的基礎根據。另外,從發現系統存在一些潛在缺失後,更加凸顯監控系統的重要性,因此我們特別針對原有的監控系統做改進,提供監控者更良好有效率的監控環境,藉以增強整個系統的穩定度,維持此資料保存系統的長期運行。


    For the purpose of preserving large amount of increasingly digital data, many academic institutions do research on Institutional Repository (IR). IR not only prevents important research results from damaging but also increases their visibility. To maintain a long-term data preservation system might costs a lot of resources and money, so many countries put much effort on how to store digital data permanently, especially on fault tolerance and data security.
    In Taiwan, academic institutions preserve their digital assets separately. Doing so made preservation inefficiently, moreover it cost lots of waste. We setup a distributed data preservation system using data grid technology to gather digital files from each institution, providing consistent user interface and stable backup mechanism. This system has run for over 5 years, experiencing system upgrade and architecture modification. From analyzing system logs of one year from deferent aspects, we find some characteristics and shortcoming of the preservation system. After knowing that this system has some potential weak points, monitoring system is more and more important. So we optimize the original monitoring system, making an efficient monitoring environment, elevating stability and finally, maintaining the data preservation system in long term.

    1. 導論 1 2. 文獻探討 3 2.1. 數位資料保存系統 3 2.2. 資料網格中介軟體 4 2.2.1. SRB 5 2.2.2. iRODS 6 2.3. Nagios監控軟體 10 3. 中研院數位典藏系統 12 3.1. 系統架構 12 3.2. UrSpace GUI工具 15 3.2.1. UrSpace實作 15 3.2.2. UrSpace組成元件 16 3.3. Sync Package檔案套件 18 3.3.1. Sync Package作業流程 19 3.3.2. 支援多國語言 20 3.3.3. 檔案加密的平行處理 21 4. 系統分析 22 4.1. 分析方法 22 4.2. 資料保存系統資料累計量 24 4.3. 使用者讀寫模式 25 4.4. 檔案大小 28 4.5. Locality 30 4.6. 系統運行分析結果 34 5. 監控系統 35 5.1. 監控系統架構 35 5.2. 監控系統系統元件架構 38 5.3. 監控主機備援機制 42 5.4. 樹狀監控介面 43 5.4.1. 樹狀監控介面設計 44 5.4.2. 樹狀監控介面特性 47 6. 結論與未來展望 50 7. 參考文獻 51

    1. Gayatri Doctor and Smitha Ramachandran. Considerations for implementing an institutional repository at a business school in India. Int. J. Inf. Manag. 28, 5 (October 2008), 346-354.
    2. Sally Rumsey. The purpose of institutional repositories in UK higher education: A repository manager’s view. International Journal of Information Management. 26, 3(June 2006), 181-186.
    3. Mary Baker, Kimberly Keeton, and Sean Martin. Why traditional storage system s don't help us save stuff forever. In Proceedings of the First conference on Hot topics in system dependability (HotDep'05).
    4. Dspace, http://www.dspace.org/
    5. Fedora, http://fedora-commons.org/
    6. DuraSpace, http://duraspace.org/
    7. SRB, http://www.sdsc.edu/srb/index.php/Main_Page
    8. iRODS, https://www.irods.org/index.php
    9. Nagios, http://www.nagios.org
    10. 中央研究院網格與科學計算中心, http://www.twgrid.org
    11. PNP4Nagios., Retrieved from http://docs.pnp4nagios.org/pnp-0.4/start
    12. Rajasekar, A., and Wan, M.. SRB & SRBRack - Components of a Virtual Data Grid Architecture. Advanced Simulation Technologies Conference (ASTC02) San Diego.
    13. Rajasekar, A., Wan, M. and Moore, R.. MySRB & SRB - Components of a Data Grid. The 11th International Symposium on High Performance Distributed Computing (HPDC-11) Edinburgh, Scotland.
    14. Rajasekar, A., Marciano, R., and Moore, R.. Collection Based Persistent Archives. Proceedings of the 16th IEEE Symposium on Mass Storage Systems, 1999.
    15. Rajasekar, A., Michael, W., Reagan, M., George, K., and Tom, G.. Data Grids, Collections, and Grid Bricks. Proceedings of the 20th IEEE Symposium on Mass Storage Systems and Eleventh Goddard Conference on Mass Storage Systems and Technologies, San Diego, 2003.
    16. Moore, R.. Knowledge-Based Data Management for Digital Libraries. NIT2001, Beijing, China.
    17. Moore, R., Baru, C., Rajasekar, A., Ludascher, B., Marciano, R., Wan, M. Schroeder, W. and Gupta, A.. Collection-Based Persistent Digital Archives – Parts 1& 2. D-Lib Magazine, 2000.
    18. Moore, R., and Rajasekar, A.. Data and Metadata Collections for Scientific Applications. High Performance Computing and Networking, Amsterdam, NL 2011.
    19. Moore, R., and Schroeder, W.. Introduction to iRODS. https://www.irods.org/index.php/Introduction_to_iRODS, 2012.
    20. Moore, R., and Schroeder, W., Rule Engine, 2012, https://www.irods.org/index.php/Rule_Engine.
    21. TELDAP, http://remote-backup.teldap.tw/

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE