研究生: |
鄭百成 Jeng, Bai-Cheng |
---|---|
論文名稱: |
異質計算模擬器框架 HSAemu framework |
指導教授: |
鍾葉青
Chung, Yeh-Ching |
口試委員: |
游逸平
Yi-Ping You 徐慰中 Wei-Chung Hsu 吳真貞 Jan-Jan Wu 朱子青 Roy Ju |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2014 |
畢業學年度: | 102 |
語文別: | 英文 |
論文頁數: | 29 |
中文關鍵詞: | 異質計算架構 、模擬器 、OpenCL |
相關次數: | 點閱:1 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
異質系統架構是一個為了支持廣泛資料平行和任務平行之程式模型的工業標準。現在,大多數支援異質系統架構的硬體和軟體仍在開發中,此時提供一個模擬環境給異質系統架構的開發者研發異質系統架構的軟體層將會很有幫助。本文提出一個異質計算模擬器框架,基於PQEMU和Multi2Sim的系統模擬器。該框架提供一個通用的介面,用以整合不同的模擬器,整個框架包含異質系統架構執行時期程式庫、異質系統架構圖形處理器之模擬器和中央處理器之模擬器。在我們初步的實驗中,異質計算模擬器框架已通過許多基準測試,可以幫助開發人員測試完整的軟體層和系統效能。
Heterogeneous System Architecture (HSA) is an industry standard designed to assist a wide variety of data-parallel and task-parallel programming models. Currently, most of HSA hardware and software components are either still in development. It is useful to provide an emulation environment for HSA developers in developing HSA software stacks. In this paper, we introduce the design of HSAemu framework, a full system emulator for the HSA platform, which is based on PQEMU and Multi2Sim. HSAemu framework provides a generic interface, which can integrate different simu-lator and emulator to HSAemu, the component of the framework including HSA runtime, HSA GPU emulator and CPU emulators. In our preliminary experiments, HSAemu framework already passed several benchmarks. It can help developers test-ing a complete HSA software stack and profile system performance.
[1] Advanced Micro Devices. (2013) AMD OpenCL™ Accelerated Parallel Processing SDK. [Online]. http://developer.amd.com/tools-and-sdks/opencl-zone/opencl-tools-sdks/amd-accelerated-parallel-processing-app-sdk/
[2] A. Bakhoda, G.L. Yuan, W.W.L. Fung, H. Wong, and T.M. Aamodt, "Analyzing CUDA workloads using a detailed GPU simulator," Performance Analysis of Systems and Software, pp. 163-174, 2009.
[3] Fabrice Bellard, "QEMU, a fast and portable dynamic translator," USENIX Annual Technical Conference, pp. 41-46, 2005.
[4] Nathan Binkert et al., "The gem5 Simulator," ACM SIGARCH Computer Architecture News, pp. 1-7, May 2011.
[5] David Brooks, Vivek Tiwari, and Margaret Martonosi, "Wattch: A Framework for Architectural-Level Power Analysis and Optimization," Computer Architecture, pp. 83-94, June 2000.
[6] Shuai Che et al., "Rodinia: A benchmark suite for heterogeneous computing," IEEE International Symposium on Workload Characterization, pp. 44-54, 2009.
[7] S. Collange, M. Daumas, D. Defour, and D. Parello, "Barra: A Parallel Functional Simulator for GPGPU," Modeling, Analysis & Simulation of Computer and Telecommunication Systems, pp. 351-360, 2010.
[8] Jiun-Hung Ding, Po-Chun Chang, Wei-Chung Hsu, and Yeh-Ching Chung, "PQEMU: A Parallel System Emulator Based on QEMU," IEEE International Conference on Parallel and Distributed Systems, pp. 276-283, 2011.
[9] HSA foundation, "HSA Platform System Architecture Specification," pp. 1-62, Mar. 2014.
[10] HSA foundation, "HSA Programmer’s Reference Manual: HSAIL Virtual ISA and Programming Model, Compiler Writer’s Guide, and Object Format (BRIG)," pp. 1-418, June 2013.
[11] HSA foundation, "HSA Runtime Programmer’s Reference Manual," pp. 1-117, June 2014.
[12] Shih-Hao Hung, Tei-Wei Kuo, Chi-Sheng Shih, and Chia-Heng Tu, "System-wide profiling and optimization with virtual machines," Asia and South Pacific Design Automation Conference, pp. 395-400, 2012.
[13] Khronos OpenCL Working Group, "The OpenCL Specification Version: 2.0," pp. 1-284, Mar. 2014.
[14] Jingwen Leng et al., "GPUWattch: Enabling Energy Optimizations in GPGPUs," International Symposium on Computer Architecture, pp. 487-498, June 2013.
[15] NVIDIA Corporation. (2014) Parallel Programming and Computing Platform | CUDA | NVIDIA. [Online]. http://www.nvidia.com/object/cuda_home_new.html
[16] J. Power, J. Hestness, M. Orr, M. Hill, and D. Wood, "gem5-gpu: A Heterogeneous CPU-GPU Simulator," Computer Architecture Letters , p. 1, 2014.
[17] John A. Stratton, Sam S. Stone, and Wen-mei W. Hwu, "MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs," Languages and Compilers for Parallel Computing, pp. 16-30, 2008.
[18] R. Ubal, J. Sahuquillo, S. Petit, and P. Lopez, "Multi2Sim: A Simulation Framework to Evaluate Multicore-Multithreaded Processors," Computer Architecture and High Performance Computing, pp. 62-68, 2007.
[19] Z. Wang et al., "COREMU: a scalable and portable parallel full-system emulator," Principles and Practice of Parallel Programming, pp. 213–222, 2011.