研究生: |
陳聿廣 Chen, Yu-Guang |
---|---|
論文名稱: |
耐用晶片設計-對於先進製程設計成本與可靠度之方法研究 Live Free or Die Hard - Leveraging Design Cost and Reliability for Modern Design Techniques |
指導教授: |
張世杰
Chang, Shih-Chieh |
口試委員: |
陳宏明
Chen, Hung-Ming 黃婷婷 Hwang, Ting-Ting 史弋宇 Shi, Yiyu 黃世旭 Huang, Shih-Hsu |
學位類別: |
博士 Doctor |
系所名稱: |
電機資訊學院 - 資訊工程學系 Computer Science |
論文出版年: | 2016 |
畢業學年度: | 104 |
語文別: | 英文 |
論文頁數: | 112 |
中文關鍵詞: | 低功耗設計 、多位元回復暫存器 、可靠度 、三維晶片 、矽穿孔 、動態電壓條變 |
外文關鍵詞: | Low Power Design, Multi Bit Retention Register, Relibility, 3D-IC, Through Silicon Via, Dynamic Voltage Scaling |
相關次數: | 點閱:4 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著製程的進步,現今的晶片設計往往能達到體積小、功能複雜且效能優越等特性,且被廣泛地運用於可攜帶式裝置如遠端感測器、智慧可攜式裝置等。然而,考量其有限的電源供應,低功耗設計成為實現此類應用不可或缺的技術。另一方面,為達到低功耗設計,晶片的可靠度往往成為被犧牲的對象,造成的潛在的可靠度問題。因此,適當的槓桿低功耗及可靠度成為一個重要的探討課題。在此篇論文中,我們針對三種常見的低功耗晶片設計方法—電源閘控、三維晶片、以及動態電壓/頻率調變加以探討,指出這些設計方法可能造成的潛在性可靠度問題,並提出創新且有效率的方法來槓桿設計成本與晶片可靠度,以期在不增加太多成本的前提下,大幅度提升晶片的可靠度。
第一部分,我們針對電源閘控設計中暫存器資料流失的問題提出探討。在過去的做法中,為確保暫存器資料不會因為電源閘控作用而消失,掃描鏈或回覆暫存器被廣泛的使用來保存暫存器資料。然而,前者將大大增長資料傳輸的時間,而後者則會增加約20%的設計面積。在此論文中,我們提出了多位元回復暫存器的概念與設計,以及布局多位元回復暫存器的方法。多位元回復暫存器結合了傳統兩種方法的好處,能達到減少傳輸時間且降低額外設計面積的好處,並能更進一步節省功耗。實驗結果顯示與傳統的單位元回復暫存器相比,我們平均能節省84%漏電流。
第二部分,我們針對現今三維晶片堆疊方法的可靠度問題提出深入的研究。三維晶片堆疊方法在提出後,因其能達到高速度與低功耗等優勢,近幾年被廣泛的研究與應用。其中,矽穿孔(TSV)為實現三維晶片不可或缺的元件,然而在現今的製程中,矽穿孔良率過低對三維晶片的可靠度造成非常大的威脅。因此,在此論文中,我們針對冗餘矽穿孔的布局進行研究,在考量良率與時間需求下,優化冗餘矽穿孔的數量與布局方式,來降低額外增加的面積。實驗結果顯示我們可以達到61%的額外面積節省率。
第三部分,在現今的低功耗設計中,動態電壓/頻率調變已經廣泛的被應用於晶片上,藉由不同的工作效能需求來動態調整工作電壓/頻率來節省功耗。然而,過度的降低電壓將會導致可靠度的問題,進而造成功能的錯誤。因此,如何動態決定最適當的工作電壓成為實現動態電壓/頻率調變很重要的一環。在此研究中,我們提出了兩種不同的方式,聯合機率分布(JPDF)以及Q-學習方法,來動態決定最佳工作電壓。實驗結果顯示,與傳統stepping-based 的方法比較,我們可以達到89.3%的功耗減少。
As CMOS technology continuous to scale down, Integrated circuits (ICs) have been wieldy applied to various applications such as remote sensors and smart portable devices due to the benefits of smaller area, higher complexity, and high performance. On the other hand, the limited power budget makes low power design as an indispensable part. However, the existing low power design schemes may cause reliability issues and threaten the integrity of the ICs, which comes up with an open space for further investigation. In this dissertation, we address three major reliability challenges for modern low power IC designs, i.e., power gating design, 3D-ICs, and dynamic voltage/frequency scaling, and propose corresponding efficient and intelligent methods to address the problems to gracefully trade-off between design cost and reliability.
First, retention registers have been widely used in power gated designs to store data during sleep mode. However, their excessive area and leakage power render it imperative to minimize the total retention storage size. The current industry practice replaces all registers with single-bit retention ones, which significantly limits the design freedom and yields sub-optimal designs. Towards this, for the first time in literature, we propose the concept and the design of multi-bit retention registers, with which only selected registers need to be replaced. The technique can significantly reduce the number of bits that need to be stored and thus the leakage power, but needs several clock cycles for mode transition. In addition, an efficient assignment algorithm is developed to minimize the total retention storage size subject to mode transition latency constraint. Experimental results show that our framework on average can reduce the leakage power in sleep mode by 84% along with additional mode transition latency of 6 to 11 clock cycles, compared with the single-bit retention register based design.
Second, in three-dimensional integrated circuits (3D ICs), Through Silicon Via (TSV) is a critical enabling technique to provide vertical connections. However, it may suffer from many reliability issues such as undercut, misalignment or random open defects. Various fault-tolerance mechanisms have been proposed in literature to improve yield, at the cost of significant area overhead. In this part, we focus on the structure that uses one spare TSV for a group of original TSVs, and study the optimal assignment of spare TSVs under yield and timing constraints to minimize the total area overhead. We show that such problem can be modeled as a constrained graph decomposition problem. Two efficient heuristics are further developed to address this problem. Experimental results show that under the same yield and timing constraints, our heuristic can reduce the area overhead induced by the fault-tolerance mechanisms by up to 61%, compared with a seemingly more intuitive nearest-neighbor based heuristic.
Third, Dynamic voltage scaling (DVS) has been widely used to suppress power consumption in modern designs. The decision of optimal operating voltage at runtime should consider the variations in workload, process as well as environment. As these variations are hard to predict accurately at design time, various deterministic and reinforcement learning based DVS schemes have been proposed in the literature. However, none of them can be readily applied to designs with graceful degradation, where timing errors are allowed with bounded probability to trade for further power reduction. In this part, we propose JPDF based and Q-learning based DVS scheme dedicated to the designs with graceful degradation. We compare it with deterministic DVS schemes, i.e., a stepping based scheme. Experimental results on three 45nm industrial designs show that the proposed Q-learning based scheme can achieve up to 83.9% power reduction with 0.01 timing error probability bound. To the best of the authors’ knowledge, this is the first in-depth work to explore reinforcement learning based DVS schemes for designs with graceful degradation.
[1] N. AbouGhazaleh, A. Ferreira, C. Rusu, R. Xu, F. Liberato, B. Childers, D. Mosse, R. Melhem, “Integrated CPU and l2 cache voltage scaling using machine learning,” in Proc. of conference on Languages, compilers, and tools for embedded systems, pp.41-50, 2007.
[2] T. Austin, V. Bertacco, D. Blaauw and T. Mudge, “Opportunities and Challenges for Better than Worst-Case Design,” in Proc. of Asia and South Pacific Design Automation Conf. (ASP-DAC), pp. 2–7, 2005.
[3] P. Ashar, and S. Malik, “Implicit Computation of Minimum-Cost Feedback-Vertex Sets for Partial Scan and Other Applications”, in Proc. Design Automation Conference (DAC), pp77-80, 1994.
[4] J. Burns, L. Mcllrath, C. Keast, C. Lewis, A. Loomis, K. Warner and P. Wyatt, “Three-Dimensional Integrated Circuit for Low Power, High-Bandwidth Systems on a Chip,” in Proc. of IEEE International Solid-State Circuits Conference (ISSCC), pp. 268-269, 2001.
[5] F.X. Che, H.Y. Li, Xiaowu Zhang, S. Gao, and K.T. Teo, “Wafer level warpage modeling methodology and characterization of TSV wafers,” in Proc. of IEEE Electronic Components and Technology Conference (ECTC), pp.1196-1203, June 2011.
[6] Y. -G Chen, Y. Shi, K. -Y Lai, H. Geng and S. -C Chang, "Efficient multiple-bit retention register assignment for power gated design: Concept and algorithms", in Proc. of International Conference on Computer-Aided Design (ICCAD), pp.309-316, Nov.2012.
[7] Y. -G Chen, H. Geng, K. -Yu Lai, Y. Shi, S. -C Chang, “Multibit Retention Registers for Power Gated Designs: Concept, Design, and Deployment”, in IEEE Trans. on CAD of Integrated Circuits and Systems (TCAD), Volume 33, Issue 4, pp.507-518, 2014.
[8] Y.-G. Chen, K.-Y Lai, M.-C Lee, Y. Shi, W.-K. Hon, S.-C Chang, “Yield and timing constrained spare TSV assignment for three-dimensional integrated circuits,” in Proc. of Automation & Test in Europe Conference & Exhibition (DATE), pp.1-4, 2014.
[9] Y.-G. Chen, W.-Y Wen, Y. Shi, W.-K. Hon, S.-C. Chang, “Novel Spare TSV Deployment for 3-D ICs Considering Yield and Timing Constraints,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD) 34(4): 577-588, 2015
[10] Y. -G. Chen, T. Wang, K.Y. Lai, W.Y. Wen, Y. Shi, and S.C. Chang, “Critical path monitor enabled dynamic voltage scaling for graceful degradation in sub-threshold designs,” in Proc. of ACM/EDAC/IEEE Design Automation Conference (DAC), pp.1-6, June 2014.
[11] Y.-G. Chen, W.-Y. Wen, T. Wang, Y. Shi, S.-C. Chang, “Q-Learning Based Dynamic Voltage Scaling for Designs with Graceful Degradation,” in Proc. of International Symposium on Physical Design (ISPD), pp.41-48, 2015.
[12] M. Cho, H. Ren, H. Xiang, R. Puri, “History-based VLSI legalization using network flow”, in Proc. of ACM/IEEE Design Automation Conference (DAC), pp. 286-291, June 2010.
[13] E. Choi, C. Shin, T. Kim, and Y. Shin, “Power-Gating-Aware High-Level Synthesis”, in Proc. Low Power Electronics and Design (ISLPED), pp.39-44, 2008.
[14] J. Cong, G. Luo, Y. Shi, “Thermal-aware cell and through-silicon-via co-placement for 3D ICs”, in Proc. of ACM/EDAC/IEEE Design Automation Conference (DAC), pp.670-675, June 2011.
[15] A. Darbari, B. M. Al-Hashimi, D. Flynn, and J. Biggs, “Selective State Retention Design using Symbolic Simulation“ in Proc. Design, Automation & Test in Europe Conference & Exhibition (DATE), pp. 1644-1649, 2009.
[16] W. R. Davis, J. Wilson, S. Mick, J. Xu, “Demystifying 3D ICs: The Pros and Cons of Going Vertical,” in IEEE Design Test of Computer, vol. 22, no. 6, pp. 498-510, Nov./Dec., 2005.
[17] A. Drake, R. Senger, H. Deogun, G. Carpenter, S. Ghiasi, T. Nguyen, N. James, M. Floyd, and V. Pokala, “A Distributed Critical-Path Timing Monitor for a 65nm High-Performance Microprocessor,” in IEEE International Solid-State Circuits Conference (ISSCC), pp.398–399, 2007.
[18] J. Edmonds, “Paths, trees, and flowers”, Canadian Journal of mathematics, 17, pp.449-467, 1965.
[19] R. Glang, “Deffect size distrubution in VLSI chips”, in Proc. of International Conference on Microelectronic Test Structures (ICMTS), pp.57-60, March 1990.
[20] M.E. Gomez, V. Santonja, "Self-similarity in I/O workload: analysis and modeling", in Proc. of Workload Characterization: Methodology and Case Studies, pp.97 - 104, 1999.
[21] M.S. Gupta, K.K. Rangan, M.D. Smith, G.Y. Wei, and D. Brooks, “DeCoR: A Delayed Commit and Rollback mechanism for handling inductive noise in processors,” in IEEE International Symposium on High Performance Computer Architecture (HPCA), pp.381–392, Feb 2008.
[22] A.-C. Hsieh, T.-T. Huang, M.-T. Chang, M.-H. Tsai, C.-M. Tseng and H.-C. Li, “TSV redundancy: architecture and design issues in 3D IC”, in Proc. of Design, Automation & Test in Europe Conference & Exhibition (DATE), pp.166-171, March 2010.
[23] H.-Y. Huang, Y.-S. Huang and C.-L. Hsu, “Built-in self-test/repair scheme for TSV-based three-dimensional integrated circuits”, in Proc. of IEEE Asia Pacific Conference on Circuits and Systems (APCCAS), pp.56-59, Dec. 2010.
[24] Iris H. R. Jiang, C. L. Chang, and Y. M. Yang, “INTEGRA: Fast Multibit Flip-Flop Clustering for Clock Power Saving”, in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Volumn31, Issue 2, pp.192-204, Feb. 2012.
[25] H. Jiang, M. Marek-Sadowska, S. R. Nassif, “Benefits and costs of power-gating technique”, in Proc. of IEEE International Conference on Computer Design: VLSI in Computers and Processors (ICCD), pp.559-566, 2005.
[26] L. Jiang, Q. Xu, and B. Eklow, “On effective TSV repair for 3D-stacked ICs”, in Proc. of Design, Automation & Test in Europe Conference & Exhibition (DATE), pp.793-798, 2012.
[27] H. Jiao and V. Kursun, “High-Speed and Low-Leakage MTCMOS Memory Registers“ in Proc. Quality Electronic Design (ASQED), 2010, pp. 17-22.
[28] M. Jung, J. Mitra, David Z. Pan, and S. K. Lim, “TSV stress-aware full-chip mechanical reliability analysis and optimization for 3D IC”, in Proc. of Design Automation Conference (DAC), pp. 188-193, June 2011.
[29] U. Kang and et al., “8Gb 3D DDR3 DRAM using through-silicon-via technology”, in Proc. of IEEE International Solid-State Circuits Conference (ISSCC), pp130-131, 131a, Feb. 2009.
[30] M. Keating, D. Flynn, R. Aitken, A. Gibbons, and K. Shi, Low Power Methodology Manual, For System-on-Chip Design, Springer, 2007.
[31] D. H. Kim, S. K. Lim and et al., “3D-MAPS: 3D Massively parallel processor with stacked memory”, in Proc of IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC), pp. 188-190, Feb. 2012
[32] D. H. Kim, R. O. Topaloglu, and S. K. Lim, “TSV density-driven global placement for 3D stacked ICs,” in Proc. of International SoC Design Conference (ISOCC), pp.135-138, Nov. 2011
[33] H. Kim, K. Athikulwongse, and S. K. Lim, “A study of Through-Silicon-Via impact on the 3D stacked IC layout”, in Proc. of Design Automation Conference (DAC), pp. 674-680, Nov. 2009.
[34] H. Kim, S. Mukhopadhyay, and S. K. Lim, “TSV-aware interconnect length and power prediction for 3D stacked ICs”, in Proc. of International Interconnect Technology Conference (IITC), pp.26-28, June 2009.
[35] H.-O. Kim and Y. Shin, “Semicustom design methodology of power gated circuits for low leakage applications,” in IEEE Transactions on Circuits and Systems II: Express Briefs, Vol. 54, No. 6, pp. 512–516, June 2007.
[36] M. Laisne, K. Arabi and T. Petrov, “System and methods utilizing redundancy in semiconductor chip interconnects”, United State Patent Application Publication. US 2010/0060310 A1 (Mar. 11, 2010).
[37] J. H. Lau, “TSV manufacturing yield and hidden costs for 3D IC integration”, in Proc. of Electronic Components and Technology Conference (ECTC), pp. 1031-1042, June 2010.
[38] C.R. Lefurgy, A.J. Drake, M.S. Floyd, M.S. Allen-Ware, B. Brock, J.A. Tierno, and J.B. Carter, “Active management of timing guardband to save energy in POWER7,” in Proc. of IEEE/ACM International Symposium on Microarchitecture, pp.1–11, 2011.
[39] J. M. Levine, E. Stott and P. Y. K. Cheung, “Dynamic Voltage & Frequency Scaling with Online Slack Measurement,” in Proc. of ACM/SIGDA international symposuium on Fild-Programmable Gate Arrays, pp.65-74, 2014.
[40] H. -M. Lin and J. -Y. Jou, “On computing the minimum feedback vertex set of a directed graph by contraction operations”, in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Volume 19, Issue 3, pp.295-307, Mar 2000.
[41] M. P. H. Lin, C. C. Hsu and Y. T. Chang, “Post-Placement Power Optimization With Multi-Bit Flip-Flops”, in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD) , Volume 30, Issue 12,pp.1870-1882, Dec. 2011.
[42] I. Loi, S. Mitra, T. H. Lee, S. Fujita and L. Benini, “A low-overhead fault tolerance scheme for TSV-based 3D network on chip links”, in Proc. of International Conference on Computer-Aided Design (ICCAD), pp.598-602, Nov. 2008.
[43] C.-L Lung, Y.-S. Su, S.-H Huang, Yiyu Shi, and S.-C. Chang, “Fault-Tolerant 3D Clock Network”, in Proc. of ACM/EDAC/IEEE Design Automation Conference (DAC), pp.645-651, June 2011.
[44] C.-L. Lung, Y.-S. Su, H.-H. Huang, Y. Shi, and S.-C. Chang, "Through-Silicon Via Fault-Tolerant Clock Networks for 3D ICs," in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 32, no. 7, pp. 1100-1109, 2013.
[45] G. Luo, Y. Shi, and J. Cong, "An Analytical Placement Framework for 3D ICs and Its Extension on Thermal-Awareness," in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), vol. 32, no. 4, pp.510-523, 2013.
[46] H. Mahmoodi-Meimand and K. Roy, “Data-Retention Flip-Flops for Power-Down Applications”, in Proc. of IEEE Symposium on Circuit and System (ISCAS), pp. II - 677-80 Vol.2, 2004.
[47] M. Jung, J. Mitra, David Z. Pan, and S. K. Lim, “TSV stress-aware full-chip mechanical reliability analysis and optimization for 3D IC”, in Proc. of Design Automation Conference (DAC), pp. 188-193, June 2011.
[48] M. M. Ozdal, C. Amin, A. Ayupov, S. Burns, G. Wilke, C. Zhuo, "The ISPD -2012 Discrete Cell Sizing Contest and Benchmark Suite", in Proc. ACM International Symposium on Physical Design (ISPD), pp. 161-164, 2012.
[49] R.P. Pokala, R.A. Feretich and R.W. McGuffin, "Physical synthesis for performance optimization," in Proc of IEEE Internationa ASIC Conference and Exhibit, pp.34-37, 1992.
[50] J. Seomun and Y. Shin, “Self-Retention of Data in Power-Gated Design”, in Proc. of SoC Design Conference (ISOCC), pp.212-215, 2009.
[51] K. Shi, “Area and power-delay efficient state retention pulse-triggered flip-flops with scan and reset capabilities” in Proc. of International Conference on Computer Design (ICCD), pp.170-175, Oct. 2008.
[52] Y. T. Shyu, J. M. Lin, and C. W. Lin, “Effective and Efficient Approach for Power Reduction by Using Multi-Bit Flip-Flops” in IEEE Transactions on Very Large Scale Integration Systems (TVLSI),Volume 1, Issue 4,pp.624-635, April 2013.
[53] R.S. Sutton and A.G. Barto, “Reinforcement Learning. An Introduction”, MIT Press, Cambridge, MA, 2000.
[54] B. Swinnen, W. Ruythooren, P. D. M. L. Bogaerts, L. Carbonell, K. D. Munck, B. Eyckens, S. Stoukatch, Tezcan, D. Sabuncuoglu, Z. Tokei, J. Vaes, J. V. Aelst, and E. Beyne, “3D integration by cu-cu thermocompression bonding of extremely thinned bulk-si die containing 10 μm pitch through-si vias”, in Proc. of International Electron Devices Meeting (IEDM), pp. 1-4, Dec. 2006.
[55] J. N. Tsitsiklis. Asynchronous stochastic approximation and Q-learning. Machine Learning, 16(3):185–202, 1994.
[56] F. Vandeputte and L. Eeckhout, “Finding Stress Patterns in Microprocessor Workloads,” in Proc. of International Conference on High Performance Embedded Architectures and Compilers, pp.153-167, 2009.
[57] D. Velenis, E. J. Marinissen and E. Beyne, “Cost effectiveness of 3D integration options”, in Proc. of IEEE International 3D Systems Integration Conference (3DIC), pp.1-6, Nov. 2010.
[58] S.H. Wang, Y.Y. Liang, T.Y. Kuo and W.K. Mak, “Power-Driven Flip-Flop Merging and Relocation”, in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Volume 31, Issue 2,pp.180-191, Feb. 2012.
[59] C. J. C. H. Watkins and P. Dayan, “Q-Learning”, in Proc. Machine Learning, pp.279-292, 1992.
[60] S. Yang, B. M. Al-Hashimi, D. Flynn, and S. Khursheed, “Scan Based Methodology for Reliable State Retention Power Gating Design”, in Proc. Design, Automation & Test in Europe Conference & Exhibition (DATE), pp.69-74, 2010.
[61] F. Ye and K. Chakrabarty, “TSV open defects in 3D integrated circuits: Characterization, test, and optimal spare allocation”, in Proc. of ACM/IEEE Design Automation Conference (DAC), pp. 1024-1030, June 2012.
[62] L. Yuan and G. Qu, “Analysis of energy reduction on dynamic voltage scaling-enabled systems,” in Proc. of IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), Vol. 24, pp.1827–1837, 2005.
[63] Y. Zhao, S. Khursheed, and B. M. Al-Hashimi, “Cost-Effective TSV Grouping for Yield Improvement of 3D-ICs”, in Proc. of Asian Test Symposium (ATS), pp.201-206, Nov. 2011.
[64] Cadance. [Online]. Available: http://www.cadence.com/products/cic/spectre_circuit/pages/default.aspx