研究生: |
江顯揚 Xian-Yang Jiang |
---|---|
論文名稱: |
在個人電腦叢集系統上針對 Fortran 90 之平行稀疏矩陣函式庫的效能改進 Performance of Parallel Sparse Supports for Fortran 90 on PC-based Networks of Clusters |
指導教授: |
李政崑
Jenq Kuan Lee |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2000 |
畢業學年度: | 88 |
語文別: | 中文 |
論文頁數: | 48 |
中文關鍵詞: | 電腦效能 、分散式處理器 、多處理器 、數值程式設計 、數值方法 |
外文關鍵詞: | Computer performance, Distributed processors, Multiprocessors, Mathematical programming, Numerical methods |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
個人電腦叢集系統(PC Clusters)為近來被廣泛使用的實驗平臺之一,主要是因為個人電腦叢集系統的價格比一般大型工作站便宜,但卻又擁有很強大的計算能力,並且架設容易。為了評估在個人電腦叢集系統的計算能力以及將它的使用效能最大化,觀察在個人電腦叢集系統上執行處理稀疏資料集(sparse data set)的數值應用程式(numerical applications)的效能是十分有趣的。
稀疏矩陣資料的運算可以應用在許多科學運算的領域中,像是流體力學(CFD)、數值分析(numerical analysis)和電路模擬(circuit simulations)等等。而在目前的一些先進的架構中,處理一般資料的效能和稀疏矩陣的效能似乎有了差距。為了解決這個問題,[Chang 98] 曾經在 IBM SP2 上完成了一套支援 Fortran 90 矩陣內建函式(array intrinsic functions)的平行函式庫。
在這份論文中,我們使用了這套函式庫,完成了兩件事。第一,為了評估電腦叢集系統上處理稀疏矩陣的效能,我們將 [Chang 98] 的函式庫從原先的 IBM SP2 移植到個人電腦叢集系統上,並且做了效能的調整,我們有實驗數據來說明調整後的效能。第二,我們為了讓此函式庫能夠支援更多需求,我們提出了一個有彈性的架構,讓原先的函式庫能夠很容易的支援像是 SuperLU 的數值函式庫。我們的實驗平台是在國家高速電腦中心所提供的八個節點的個人電腦叢集系統上。
PC-based networks of clusters, abbreviated to PC Clusters, have become an important research testbed recently, as their competitive computing powers, inexpensive costs, scalability, and fast interconnections among CPUs are considered. In order to evaluate the computing power of PC Clusters and maximize their performance, it is very interesting to observe the performance results of numerical applications on sparse data sets on PC Clusters. [Chang 98] has designed and implemented a parallel sparse library supporting array intrinsic operations of Fortran 90 on IBM SP2. In this thesis, we have accomplished two results. First, we extend the sparse library developed by [Chang 98] from IBM SP2 to PC Clusters and tune the performance of library on PC Clusters. We will report the performance results in this case. Second, we present a flexible framework to extend the support of the sparse library towards direct numerical solvers such as SuperLU and Cholesky factorization. Our experiments were done on the 8-node PC Clusters at National Center for High-Performance Computing of Taiwan.
[Adams 92] Jeanne C. Adams, Walter S. Brainerd, Jeanne T. Martin, Brian T. Smith, and Jerrold L. Wagener, Fortran 90 handbook complete ansi/iso reference, Intertext Publications McGraw-Hill Book Company, 1992.
[Ashcraft 87] C. Ashcraft, R. Grimes, J. Lewis, B. Peyton, and H. Simon, "Progress in sparse matrix methods for large sparse systems on vector supercomputers," International Journal of Supercomputer Applications, no. 1, pp. 10-30, 1987.
[Becker 95] Donald J. Becker, Thomas Sterling, Daniel Savarese, John E. Dorband, Uday A. Ranawak, Charles V. Packer, "BEOWULF: A Parallel Workstation for Scientic Computation," International Conference on Parallel Processing, 1995.
[Becker 95a] Donald J. Becker, Thomas Sterling, Daniel Savarese, Bruce Fryxell, Kevin Olsen, "Communication Overhead for Space Applications on the Beowulf Parallel Workstation," High Performance Distributed Computing Conference, August 1-4, 1995.
[Beowulf URL] http://www.beowulf.org/
[Bodin 93] F. Bodin, P. Beckman, D. Gannon, S. Narayana, and S. Yang, "Distributed pC++: Basic ideas for an object parallel language, "Scientific Programming, Vol. 2, No. 3, 1993.
[Chang 98] Rong Guey Chang, Tyng Ruey Chuang, and Jenq Kuen Lee, "Efficient Support of Parallel Sparse Computation for Array Intrinsic Functions of Fortran 90," ACM International Conference on Supercomputing, Melbourne, July 13-17, 1998.
[Chang 99] Rong Guey Chang, Tyng Ruey Chuang, and Jenq Kuen Lee, "Compiler Optimizations for Parallel Sparse Programs with Array Intrinsics of Fortran 90," International Conference on Parallel Processing, September 1999.
[Choudhary 98] Sanjay Goil and Alok Choudhary, "High Performance OLAP and Data Mining on Parallel Computers", Proceedings of IPPS/SPDP '98, Orlando, April 1998.
[Demmel 99] James W. Demmel, Stanley C. Eisenstat, John R. Gilbert, Xiaoye S. Li, and Joseph W. H. Liu, "A supernodal approach to sparse partial pivoting," SIAM Journal on Matrix Analysis and Applications, vol20(3), pp. 720-755, 1999.
[Demmel 99a] James W. Demmel, John R. Gilbert, and Xiaoye S. Li. SuperLU users' guide, University of California, Berkeley.
[Duff 83] I. Duff and J. Reid, "The multifrontal solution of indefinite sparse symmetric linear equations," ACM Transaction Mathematical Software, no. 9, pp. 302-325, 1983.
[Gilbert 92] John R. Gilbert, C. Moler, and R. Schreiber, "Sparse matrices in MATLAB: Design and implementation," SIAM Journal on Matrix Analysis and Applications, no. 13(1), pp. 333-356, January 1992.
[Hwang 95] Gwan-Hwan Hwang, Jenq Kuen Lee, and Dz ching Ju, "An array operation synthesis scheme to optimize Fortran 90 programs," Proceedings of the Fifth ACM SIGPLAN Symposium on Principles & Practice of Parallel Programming, July 1995.
[Koelbel 94] Charles H. Koelbel, David B. Loveman, Robert S. Schreiber, Guy L. Steele Jr, and Mary E. Zosel, The high performance Fortran handbook, Scientific and Engineering Computation Series, The MIT Press, 1994.
[Kumar 98] Mahesh V. Joshi, George Karypis, and Vipin Kumar, "ScalParC: A New Scalable and Efficient Parallel Classification Algorithm for Mining Large Datasets", Proceedings of IPPS/SPDP '98, Orlando, April 1998.
[Min 2000] Min Chen, Kuo-Chan Huang, and Hsi-Ya Chang, "An experimental PC Cluster management software environment, " High Performance Computing 2000 (HPC2000), pp. 289-294, April 2000.
[NCHC URL] http://www.nchc.gov.tw/RESEARCH/pccluster/
[Press 96] William H. Press, Saul A. Teukolsky, William T. Vetterling, and Brian P. Flannery, Numerical Recipes in Fortran 90: The Art of Scientific Computing, Cambridge University Press, 1996.
[Ridge 97] Daniel Ridge, Donald Becker, Phillip Merkey, and Phillip Merkey, "Beowulf: Harnessing the Power of Parallelism in a Pile-of-PCs," Proceedings of IEEE Aerospace, 1997.
[Sterling 99] Thomas L. Sterling, John Salmon, Donald J. Becker, and Daniel F. Savarese, How to Build a Beowulf - A Guide to the Implementation and Application of PC Clusters, The MIT Press, 1999.