研究生: |
唐偉翔 Tang, WeiXIang |
---|---|
論文名稱: |
設計具延遲釋放的管線化Clos網路架構 Design a Pipelined Clos Network with Late Release Scheme for NoC |
指導教授: |
許雅三
Hsu, Yarsun |
口試委員: |
闕河鳴
李政崑 |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2011 |
畢業學年度: | 100 |
語文別: | 英文 |
論文頁數: | 48 |
中文關鍵詞: | 克勞斯網路 、延遲釋放機制 、晶片網路 、管線化路由器 |
外文關鍵詞: | Clos Network, Late Release Scheme, NoC, Pipelined Router |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
As the number of processor on a single chip grows, communication efficiency may dominate the performance of parallel programming. Seeking for high throughput communication is a clear goal. In this paper, a novel 4-stage pipelined router is proposed for 3-stage Clos network and its corresponding network interface (NI). The proposed structure is built in DE3 and the performance is estimated using an in-house C++ simulator. To further improve the throughput, we propose a late release scheme (LRS) which reserves the allocated paths. The simulation result shows the throughput improvements are 9.42% and 42.91% under random and mixed traffic, respectively. The latency improvements are 5.1 and 2.53 under Jacobi linear equation simulation with 1k and 512 data sizes, respectively.
隨著單晶片中運算核心數目的成長,之間通訊效率漸為平行程式能的關鍵。找尋低延遲通訊方法儼然成為大家目標本篇論文提供了有效率multicast的Clos交換器管線化電路與其對應的傳收介面,此外我們利用時間的介面,此外我們利用時間的介面,此外我們利用時間的介面,此外我們利用時間的介面,此外我們利用時間的局部性進一步縮短傳輸的延遲,並提出延遲清除路徑的概念與其實現方法。
本論文以Altera DE3Altera DE3驗證功能性,並以驗證功能性,並以驗證功能性,並以C++作效能的評估。改進架構在作效能的評估。改進架構在作效能的評估。改進架構在Jacobi Jacobi Linear Equation Solver與一些合成的通訊都顯現出較低傳遞延遲與最大傳輸量的提升。
[1] C. Clos, “A study of non-blocking switching networks.” Bell System Technical Journal, pp. 406–424, Mar. 1953.
[2] Yuanyuan Yang, Jianchao Wang. “A more accurate analytical model on blocking probability of multicast networks.” IEEE Transaction on Communications, pp.1930-1936, Nov. 2000
[3] Yu-Hsiang Kao, Alfaraj N., Ming Yang, Chao, H.J. “Design of High-Radix Clos Network-on-Chip.” Fourth ACM/IEEE International Symposium on Networks-on-Chip (NOCS), pp.181-188, May 2010
[4] Oki, E., Zhigang Jing, Rojas-Cessa, R. Chao, H.J. "Concurrent Round-Robin-Based Dispatching schemes for Clos-network switches." ACM/IEEE Transactions on Networking, pp.830-844, Jan. 2003
[5] Kleban, J., Santos, H., "Packet Dispatching Algorithms with the Static Connection Patterns Scheme for Three-Stage Buffered Clos-Network Switches." IEEE International Conference on Communications (ICC’07), pp. 6319- 6324, Jun. 2007
[6] Harold S. Stone. “Parallel processing with the perfect shuffle.” IEEE Transactions on Computers, pp153-161, Feb. 1971
[7] Altera Corporation. "Straitix III Device Handbook." http://www.altera.com/literature/hb/stx3/stratix3_handbook.pdf
[8] Balaji Prabhakar, Nick McKeown, and Ritesh Ahuja. “Multicast Scheduling for Input-Queued Switches.” IEEE Journal on Selected Areas in Communications, pp.855-866, Jun. 1997
[9] Bin Tang. "On Multicast Scheduling and Routing in Multistage Clos Networks." The 3rd ACS/IEEE International Conference on Computer Systems and Applications, pp.73-80, Jun. 2005
[10] Altera Corporation. "Quartus II Handbook Version 10.1" http://www.altera.com/literature/hb/qts/quartusii_handbook.pdf
[11] Luca Benini and Giovanni De Micheli. "Networks on Chips: Technology and Tools" Elsevier Inc. pp.106-111, 2006
[12] Neil H.E. Weste, David Harris. "CMOS VLSI Design: A Circuits and Systems Perspective. 3rd Edition." P earson Education, Inc., p.460-461, 2005
[13] William James Dally and Brian Towles. "Principles and Practices of Interconnection Networks." Elsevier, Inc., 2004
[14] David A. Patterson; John L. Hennessy; “ Computer Organization and Design: the Hardware/Software Interface 4th edition.” Elsevier, Inc, 2009