研究生: |
陳姵伃 Chen, Pei-Yu |
---|---|
論文名稱: |
針對以MLC STT-MRAM作為主記憶體有著非對稱性寫入能源的系統所提出的重思考末級快取寫回策略 Rethinking Last-level-cache Write-back Strategy for MLC STT-MRAM Main Memory with Asymmetric Write Energy |
指導教授: |
石維寬
Shih, Wei-Kuan |
口試委員: |
張原豪
陳朝欽 衛信文 |
學位類別: |
碩士 Master |
系所名稱: |
|
論文出版年: | 2017 |
畢業學年度: | 105 |
語文別: | 英文 |
論文頁數: | 21 |
中文關鍵詞: | MLC STT-MRAM 、主記憶體 、省能源 、快取替換政策 |
外文關鍵詞: | MLC STT-MRAM, Main memory, Power saving, Cache replacement poicy |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
為了達成低能源消耗的需求,又因為MLC STT-MRAM有高記憶體單元密度、可與DRAM相匹敵的讀寫效能以及沒有更新耗能,所以它被廣泛地認為是在下一世代電腦架構中可以取代以DRAM為基底建構主記憶體的一個夠格的候選者。然而,MLC STT-MRAM在進行寫入操作時會消耗比DRAM更多的能源,這是因為有時候MLC STT-MRAM會需要兩個轉換步驟才能將原本儲存的資料改變成想要寫入的目標資料。也正因如此,當不同的bit patterns被寫入到記憶體單元時,MLC STT-MRAM所消耗的能源也不一樣。據我們所知,之前的研究為了達到減少主記憶體耗能,而在電腦系統裡利用STT-RAM作為主記憶體時幾乎都沒有考慮到非對稱性能源消耗的特性。因此,我們提出了一個energy-aware cache replacement policy,稱為energy-aware write-back strategy (EWS),它會將非對稱性能源消耗納入考慮,因此每當快取失誤發生時,它都能從中選出一個最適合的快取區塊從快取中踢除以作替換,也因此達到最小化系統能源消耗的目的。實驗結果顯示,跟LRU相比,我們所提出得解決方案平均來說可以減少23.9%能源消耗量。
To meet the requirement of low-power consumption, multi-level-cell STT-RAM (MLC STT-RAM) has been widely regarded as a potential candidate for replacing DRAM-based main memory in the next generation computer architectures because of its high cell density, comparable read/write performance and zero refresh power consumption. However, MLC STT-MRAM has higher power consumption than DRAM while a write operation is performed because MLC STT-MRAM sometimes needs to perform a two-step transition to change the original stored bits to another specific written bit patterns. As a result, MLC STT-MRAM has asymmetric power consumption while different bit patterns are written to a memory cell. To the best of our knowledge, a few or none of previous studies consider the feature of asymmetric power consumption to reduce main memory power consumption while STT-RAM is utilizes as a main memory in a computer system. Thus, this study proposes an energy-aware cache replacement policy, namely energy-aware write-back strategy (EWS), which consider asymmetric energy consumption to evict a proper cache block, so as to minimize system energy consumption. The experimental results show that the proposed solution reduces the energy consumption by 23.9% on average, compared with LRU.
[1] H. David et al. Memory Power Management via Dynamic Voltage/Frequency Scaling. In ICAC,2011.
[2] Q. Deng et al. MemScale: Active Low-power Modes for Main Memory. In ASPLOS, 2011.
[3] C. J. Xue, Y. Zhang, Y. Chen, G. Sun, J. J. Yang, and H. Li, "Emerging non-volatile memories: Opportunities and challenges," in Proc. 7th IEEE/ACM/IFIP CODES+ISSS, 2011, pp. 325–334.
[4] X. Wu, J. Li, L. Zhang, E. Speight, R. Rajamony, and Y. Xie, "Hybrid cache architecture with disparate memory technologies," in Proc. ISCA, 2009, pp. 34–45.
[5] J. Hu, C. J. Xue, Q. Zhuge, W.-C. Tseng, and E. H.-M. Sha, "Write activity reduction on non-volatile main memories for embedded chip multiprocessors," ACM Trans. Embed. Comput. Syst., vol. 12, no. 3, pp. 77:1–77:27, 2013.
[6] J. Hu, C. J. Xue, W.-C. Tseng, Y. He, M. Qiu, and E. H.-M. Sha, "Reducing write activities on non-volatile memories in embedded CMPs via data migration and recomputation," in Proc. 47th Design Autom. Conf., 2010, pp. 350–355.
[7] L. Jiang, B. Zhao, Y. Zhang, and J. Yang, "Constructing large and fast multi-level cell STT-MRAM-based cache for embedded processors," in Proc. 49th Annu. Design Autom. Conf., 2012, pp. 907–912.
[8] B. Zhao, J. Yang, Y. Zhang, Y. Chen, and H. Li, "Architecting a common-source-line array for bipolar non-volatile memory devices," in Proc. Conf. Design, Autom. Test Eur., 2012, pp. 1451–1454.
[9] Z. Sun, H. Li, Y. Chen, and X. Wang, "Voltage driven nondestructive self-reference sensing scheme of spin-transfer torque memory," IEEE Trans. Very Large Scale Integr. Syst., vol. 20, no. 11, pp. 2020–2030, Nov. 2012.
[10] Y. Chen, X. Wang, H. Li, H. Xi, Y. Yan, and W. Zhu, "Design margin exploration of spin-transfer torque ram (STT-RAM) in scaled technologies," IEEE Trans. Very Large Scale Integr. Syst., vol. 18, no. 12, pp. 1724–1734, Dec. 2010.
[11] S. Mittal, J. Vetter, and D. Li, "A survey of architectural approaches for managing embedded DRAM and non-volatile on-chip caches," Parallel and Distributed Systems, IEEE Transactions on, vol. 26, no. 6, pp. 1524–1537, June 2015.
[12] Q. Li and et. al. Compiler-Assisted STT-RAM-Based Hybrid Cache for Energy Efficient Embedded Systems. In IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, pages 1829 - 1840, 2014.
[13] S. M. Khan et al., "Decoupled dynamic cache segmentation," in Proc. HPCA, 2012, pp. 1–12.
[14] S. Bansal and Dharmendra S. Modha. CAR: Clock with Adaptive Replacement. In proceedings of the 3rd USENIX Conference on File and Storage Technologies, pages 187-200, 2004.
[15] S. Jiang and et. al. CLOCK-Pro: An Effective Improvement of the CLOCK Replacement. In Proceedings of the Annual Conference on USENIX Annual Technical Conference (ATEC), pages 35-35, 2005.
[16] T. Ishigaki, T. Kawahara, R. Takemura, K. Ono, K. Ito, H. Matsuoka, and H. Ohno. "A Multi-level-cell Spin-transfer Torque Memory with Series-stacked Magnetotunnel Junctions". In Symposium on VLSI Technology, pages 47 –48, Jun. 2010.
[17] X. Lou, Z. Gao, D. V. Dimitrov, and M. X. Tang. Demonstration of multilevel cell spin transfer switching in mgo magnetic tunnel junctions. Applied Physics Letters, 93(24):242502, 2008.
[18] N. Binkert, B. Beckmann, G. Black, S. K. Reinhardt et al., "The gem5 simulator," Computer Architecture News, vol. 39, no. 2, 2011.
[19] M. Poremba, T. Zhang, and Y. Xie, "NVMain 2.0: A User-friendly Memory Simulator to Model (Non-)Volatile Memory Systems," IEEE Computer Architecture Letters, pages 140-143, 2015.
[20] J. L. Henning, "Spec cpu2006 benchmark descriptions," Computer Architecture News, vol. 34, no. 4, 2006.