簡易檢索 / 詳目顯示

研究生: 王泰元
Wang, Tai-Yuan
論文名稱: 利用常規式軌跡模擬器進行設計空間探索之精準度增進
Making Ordinary Trace-driven Simulators Accurate for Design Space Exploitation
指導教授: 金仲達
King, Chung-Ta
口試委員: 黃稚存
Chih-Tsun Huang
洪士灝
Shih-Hao Hung
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2014
畢業學年度: 102
語文別: 英文
論文頁數: 22
中文關鍵詞: 軌跡式模擬設計效能晶片網路
外文關鍵詞: Trace based simulation, Design, Performance, on-chip networks
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著晶片網路複雜度的增加,設計空間探索變得更加的困難。因此,軌跡式模擬器所能提供的模擬速度在設計空間探索當中是不可或缺的。然而,軌跡式模擬所產生的時序卻常常與實際情形相去甚遠,主要原因是在傳統的軌跡式模擬當中,時序是固定的,也無法基於模擬結果做動態的調節。具有處理相依性軌跡的軌跡式模擬器可以解決這個問題,但是這樣的模擬器需要緊密的連結輸入的軌跡,而許多傳統的模擬器以批次的方式模擬軌跡,若要達到上述的目的則必須對模擬器做許多更改及了解。
    這篇論文以一個迴路的方式解決這個問題,除了輸入軌跡格式外,不需要對模擬器有多餘的修正或了解,我們使用此方法模擬晶片系統網路,最高可以達到28.75%的精準度增進,相較於全系統模擬器也只有平均3.73%的誤差。


    The raising complexity of system architectures makes it more difficult to exploit the design space. For design space exploitation, trace-driven simulation is becoming indispensable due to its simplicity and speed. Unfortunately, the timing produced by trace-driven simulation is often inaccurate by a large margin. This is because the event timing in traditional trace-driven simulators is fixed and cannot be adjusted dynamically according to results out of the simulators. Dependency-aware simulators solve the accuracy problem, but they require close interactions between the input traces and the simulators. Many traditional simulators process trace events in batch, which require non-negligible modification efforts to make them interactive. This thesis solves the problem by adopting an iterative strategy. There is no need to modify the simulators, except perhaps in reading and reformatting the input traces. Our evaluations using Gem5 and BookSim for Network-on-Chip (NoC) simulation show that this technique can achieve up to 28.75% improvement in timing accuracy compared to the traditional trace-driven simulators, and result in only 3.73% error in average in terms of network latency compared to full-system simulators.

    Introduction 1-4 Related Work 5 Methodology 6-10 Evaluation 11-17 Conclusion 18

    [1] N. Agarwal, T. Krishna, Li-Shiuan Peh, and N.K. Jha. Garnet: A detailed on-chip network model inside a full-system simulator. In Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on, pages 33–42, April 2009.
    [2] Jung Ho Ahn, Sheng Li, O. Seongil, and N.P. Jouppi. Mcsima+: A manycore simulator with application-level+ simulation and detailed microarchitecture modeling. In Performance Analysis of Systems and Software (ISPASS), 2013 IEEE International Symposium on, pages 74–85, April
    2013.
    [3] Nathan Binkert, Bradford Beckmann, Gabriel Black, Steven K. Reinhardt, Ali Saidi, Arkaprava Basu, Joel Hestness, Derek R. Hower, Tushar Krishna, Somayeh Sardashti, Rathijit Sen, Korey Sewell, Muhammad Shoaib, Nilay Vaish, Mark D. Hill, and David A. Wood. The gem5 simulator. SIGARCH Comput. Archit. News, 39(2):1–7, August 2011.
    [4] D. R. Butenhof. Programming with POSIX threads. 1997.
    [5] S. Fytraki and D. Pnevmatikatos. Resim, a trace-driven, reconfigurable ilp processor simulator.
    In Design, Automation Test in Europe Conference Exhibition, 2009. DATE ’09., pages 536–541, April 2009.
    [6] C. Grecu, A Ivanov, R. Pande, A Jantsch, E. Salminen, U. Ogras, and R. Marculescu. Towards open network-on-chip benchmarks. In Networks-on-Chip, 2007. NOCS 2007. First International Symposium on, pages 205–205, May 2007.
    [7] B. Grot, J. Hestness, S.W. Keckler, and O. Mutlu. Kilo-noc: A heterogeneous network-on-chip architecture for scalability and service guarantees. In Computer Architecture (ISCA), 2011 38th Annual International Symposium on, pages 401–412, June 2011
    [8] John L. Henning. Spec cpu2006 benchmark descriptions. SIGARCH Comput. Archit. News, 34(4):1–17, September 2006.
    [9] Joel Hestness, Boris Grot, and Stephen W. Keckler. Netrace: Dependency-driven trace-based network-on-chip simulation. In Proceedings of the Third International Workshop on Network on Chip Architectures, NoCArc ’10, pages 31–36, New York, NY, USA, 2010. ACM.
    [10] Nan Jiang, D.U. Becker, G. Michelogiannakis, J. Balfour, B. Towles, D.E. Shaw, J. Kim, and W.J. Dally. A detailed and flexible cycle-accurate network-on-chip simulator. In Performance
    Analysis of Systems and Software (ISPASS), 2013 IEEE International Symposium on, pages 86–96, April 2013.
    [11] Kiyeon Lee and Sangyeun Cho. In-n-out: Reproducing out-of-order superscalar processor behavior from reduced in-order traces. In Modeling, Analysis Simulation of Computer and Telecommunication Systems (MASCOTS), 2011 IEEE 19th International Symposium on, pages 126–135, July 2011.
    [12] Kiyeon Lee and Sangyeun Cho. Accurately modeling superscalar processor performance with reduced trace. J. Parallel Distrib. Comput., 73(4):509–521, April 2013.
    [13] Kiyeon Lee, S. Evans, and Sangyeun Cho. Accurately approximating superscalar processor performance from traces. In Performance Analysis of Systems and Software, 2009. ISPASS 2009. IEEE International Symposium on, pages 238–248, April 2009.
    [14] Chi-Keung Luk, Robert Cohn, Robert Muth, Harish Patil, Artur Klauser, Geoff Lowney, Steven Wallace, Vijay Janapa Reddi, and Kim Hazelwood. Pin: Building customized program analysis tools with dynamic instrumentation. In Proceedings of the 2005 ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI ’05, pages 190–200, New York, NY, USA, 2005. ACM.
    [15] Kevin Macdonald, Christopher Nitta, Matthew Farrens, and Venkatesh Akella. PDG_GEN: A methodology for fast and accurate simulation of on-chip networks. Computers, IEEE Transactions on, 63(3):650–663, March 2014.
    [16] MacSim. http://code.google.com/p/macsim/
    [17] S. Mahadevan, F. Angiolini, M. Storoaard, R.G. Olsen, J. Sparsoe, and J. Madsen. Network traffic generator model for fast network-on-chip simulation. In Design, Automation and Test in Europe, 2005. Proceedings, pages 780–785 Vol. 2, March 2005.
    [18] C. Nitta, K. Macdonald, M. Farrens, and V. Akella. Inferring packet dependencies to improve trace based simulation of on-chip networks. In Networks on Chip (NoCS), 2011 Fifth IEEE/ACM International Symposium on, pages 153–160, May 2011.
    [19] A Patel, F. Afram, Shunfei Chen, and K. Ghose. Marss: A full system simulator for multicore x86 cpus. In Design Automation Conference (DAC), 2011 48th ACM/EDAC/IEEE, pages 1050–1055, June 2011

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE