研究生: |
孫玫如 Sun, Mei-ju |
---|---|
論文名稱: |
學術資料典藏系統技術發展 Technical Development of the Academic Data Preservation System |
指導教授: |
石維寬
Shih, Wei-Kuan |
口試委員: |
徐讚昇
衛信文 許見章 |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2013 |
畢業學年度: | 101 |
語文別: | 中文 |
論文頁數: | 52 |
中文關鍵詞: | iRODS 、長期保存 、異地備份 、監控系統 、資料網格 |
外文關鍵詞: | iRODS, long-term preservation, remote backup, monitoring system, data grid |
相關次數: | 點閱:3 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
數位化資料日益增長,為了解決大量數位檔案的儲存問題,從學術界開始發展智慧儲存庫的概念,將生產出的學術論文等集中保管,為的是避免這皆重要資源因意外而遺失或是損壞,也有利於增加這些學術研究的能見度。資料的數位化讓原有的重要資產能夠得到良好的保存,然而資料保存系統的維持需要耗費不少人力、金錢與資源,因此現階段各國投入資源於研究如何長期維運資料保存系統,其中如何能夠穩定的維持系統,確保資料的安全及容錯機制尤其重要,影響資料保存系統的實際成效。
台灣各學術單位的重要資產各自保存,如此分散的保存方式不但成效不彰,也浪費許多資源。因此我們利用資料網格技術,建立了一個分散式的儲存系統,提供統一的操作介面,以及穩定的異地備份機制。經過不斷的更新與調整架構,系統已經維持運行了五年以上。經由系統運行的成果,我們得以分析其系統內容、使用者操作模式以及資料的特性。藉由找出系統的特性與缺失,可以確切有效的針對問題找尋正確的解決方式,也提供未來系統改進的基礎根據。另外,從發現系統存在一些潛在缺失後,更加凸顯監控系統的重要性,因此我們特別針對原有的監控系統做改進,提供監控者更良好有效率的監控環境,藉以增強整個系統的穩定度,維持此資料保存系統的長期運行。
For the purpose of preserving large amount of increasingly digital data, many academic institutions do research on Institutional Repository (IR). IR not only prevents important research results from damaging but also increases their visibility. To maintain a long-term data preservation system might costs a lot of resources and money, so many countries put much effort on how to store digital data permanently, especially on fault tolerance and data security.
In Taiwan, academic institutions preserve their digital assets separately. Doing so made preservation inefficiently, moreover it cost lots of waste. We setup a distributed data preservation system using data grid technology to gather digital files from each institution, providing consistent user interface and stable backup mechanism. This system has run for over 5 years, experiencing system upgrade and architecture modification. From analyzing system logs of one year from deferent aspects, we find some characteristics and shortcoming of the preservation system. After knowing that this system has some potential weak points, monitoring system is more and more important. So we optimize the original monitoring system, making an efficient monitoring environment, elevating stability and finally, maintaining the data preservation system in long term.
1. Gayatri Doctor and Smitha Ramachandran. Considerations for implementing an institutional repository at a business school in India. Int. J. Inf. Manag. 28, 5 (October 2008), 346-354.
2. Sally Rumsey. The purpose of institutional repositories in UK higher education: A repository manager’s view. International Journal of Information Management. 26, 3(June 2006), 181-186.
3. Mary Baker, Kimberly Keeton, and Sean Martin. Why traditional storage system s don't help us save stuff forever. In Proceedings of the First conference on Hot topics in system dependability (HotDep'05).
4. Dspace, http://www.dspace.org/
5. Fedora, http://fedora-commons.org/
6. DuraSpace, http://duraspace.org/
7. SRB, http://www.sdsc.edu/srb/index.php/Main_Page
8. iRODS, https://www.irods.org/index.php
9. Nagios, http://www.nagios.org
10. 中央研究院網格與科學計算中心, http://www.twgrid.org
11. PNP4Nagios., Retrieved from http://docs.pnp4nagios.org/pnp-0.4/start
12. Rajasekar, A., and Wan, M.. SRB & SRBRack - Components of a Virtual Data Grid Architecture. Advanced Simulation Technologies Conference (ASTC02) San Diego.
13. Rajasekar, A., Wan, M. and Moore, R.. MySRB & SRB - Components of a Data Grid. The 11th International Symposium on High Performance Distributed Computing (HPDC-11) Edinburgh, Scotland.
14. Rajasekar, A., Marciano, R., and Moore, R.. Collection Based Persistent Archives. Proceedings of the 16th IEEE Symposium on Mass Storage Systems, 1999.
15. Rajasekar, A., Michael, W., Reagan, M., George, K., and Tom, G.. Data Grids, Collections, and Grid Bricks. Proceedings of the 20th IEEE Symposium on Mass Storage Systems and Eleventh Goddard Conference on Mass Storage Systems and Technologies, San Diego, 2003.
16. Moore, R.. Knowledge-Based Data Management for Digital Libraries. NIT2001, Beijing, China.
17. Moore, R., Baru, C., Rajasekar, A., Ludascher, B., Marciano, R., Wan, M. Schroeder, W. and Gupta, A.. Collection-Based Persistent Digital Archives – Parts 1& 2. D-Lib Magazine, 2000.
18. Moore, R., and Rajasekar, A.. Data and Metadata Collections for Scientific Applications. High Performance Computing and Networking, Amsterdam, NL 2011.
19. Moore, R., and Schroeder, W.. Introduction to iRODS. https://www.irods.org/index.php/Introduction_to_iRODS, 2012.
20. Moore, R., and Schroeder, W., Rule Engine, 2012, https://www.irods.org/index.php/Rule_Engine.
21. TELDAP, http://remote-backup.teldap.tw/