簡易檢索 / 詳目顯示

研究生: 蘇祐世
Su, Yu-Shih
論文名稱: Variability and Reliability Aware Performance & Timing Optimization of VLSI Designs
考慮變動及可靠度之大型積體電路效能及時序最佳化
指導教授: 張世杰
Chang, Shih-Chieh
口試委員:
學位類別: 博士
Doctor
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2009
畢業學年度: 97
語文別: 英文
論文頁數: 88
中文關鍵詞: VariabilityVLSIPerformance
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在先進製程 (90nm)技術下,變動性(variability)—如製程、電壓、和溫度變化(PVT variation)以及設計中多種不同運作模式和限制,已經是造成晶片設計發生錯誤的重要原因。由於製程、電壓、和溫度變化所造成的時序(timing)不確定,會造成晶片設計不符合所要求的時序規格(timing specification)。並且,要設計出一個唯一的晶片佈局(Layout)來同時符合先進設計中所存在的多種運作模式下的眾多設計限制,非常的困難。
    在本論文中,我們主要提出三個方法,在考慮變動性下,對晶片設計的可靠性和時序做最佳化。首先,我們提出一個可偵測時序錯誤的架構(TED)來偵測並修復因為製程、電壓、和溫度變化所造成的時序錯誤(timing error)。我們的架構可以藉由容忍時序變化(timing variation)的特性來增加設計上的可靠度,並且不會有之前相關架構所存在的短路徑(short path)問題。
    接著,傳統上設計時序的最佳化,通常考慮設計中可能發生的最糟情況(worst-case)。但是這些最糟情況在運作中非常少發生,這導致傳統最佳化的方法在資源運用上並不有效。在本論文中,我們跟據可變動時脈設計(variable latency design)的方式,提出一個再次和成(re-synthesis)的方法來增加設計的效能。
    最後,考慮複雜設計架構下的多個運作模式,建立一個唯一的時鐘架構(clock network)滿足所有運作模式下的限制非常困難。我們提出一個在原本時鐘架構中,插入可調變延遲裝置(adjustable delay buffer)的方式,藉由在不同運作模式下調整各可調變延遲裝置的延遲時間,來使得此時鐘架構可以同時滿足多個運作模式下的時鐘偏移限制(clock skew constraint)。我們提出一個決定各可調變延遲裝置延遲時間(delay assignment)的線性時間最佳化演算法,使得此時鐘架構在各運作模式下的時鐘偏移為最小。


    Under 90nm technologies, variability, such as PVT variation and multiple design modes/corners, have become one of leading causes for chip failures. The delay uncertainty raised by PVT variation may cause a design to fail its’ timing specification. In addition, it has becomes tremendously difficult to create a single layout that satisfies numerous constraints imposed by the multiple modes of operations. In this thesis, we propose three mechanisms for reliability and timing optimization considering variability. We first propose an error-detecting architecture (TED) to detect and correct timing errors caused by PVT variation. This TED architecture can tolerate delay variation and enhance timing reliability without the short path problem in previous works. Second, the traditionally strategy of optimizing a circuit for the rarely activated worst-cases conditions could lead to inefficient resource use. We propose a re-synthesis method based on variable-latency design style to improve design performance. Finally, considering multiple design modes, building a single clock network to satisfy all constraints in each mode is difficult. We insert adjustable delay buffers (ADB) whose delay can be tuned in each mode into clock tree to simultaneously satisfy multiple clock skew constraints. We propose a linear-time optimal algorithm which assigns the values of ADBs so that the skew is optimal among all assignments. In addition, to obtain the accurate probability information of rare activation, we also propose an efficient approach to find the timing distribution of a circuit.

    Abstract ...2 Content ...3 List of Figures...5 List of Tables ...7 Chapter 1 Introduction ...8 1.1 Variability & Reliability ...8 1.2 Organization ...9 1.2.1 Synthesis of a Novel Timing-Error Detection Architecture ...10 1.2.2 An Efficient Mechanism for Performance Optimization of Variable-Latency Designs ...13 1.2.3 Value Assignment of Adjustable Delay Buffers for Clock Skew Minimization in Multi-Voltage Mode Designs ...17 1.2.4 Efficient Calculation of Timed Cumulative Probability Density Function ...19 Chapter 2 Synthesis of a Novel Timing-Error Detection Architecture ...21 2.1 The Timing-Error Detection (TED) Architecture ...21 2.2 Critical Activation Logic ...25 2.3 Algorithm for Choosing Backup Gates ...27 2.4 Experimental Results ...28 2.5 Summary ...32 Chapter 3 An Efficient Mechanism for Performance Optimization of Variable-Latency Designs ...33 3.1 Review of the Hold Logic ...33 3.1.1 Definition ...33 3.1.2 The non-exact hold logic in [7] ...34 3.2 A motivating example ...35 3.3 Our Approach ...36 3.3.1 Short path activation function (SPAF) ...36 3.3.2 Calculating SPAF ...40 3.4 Discussion of optimality ...43 3.5 Heuristic method for large circuits ...44 3.6 Selecting the timing constraint T for constructing the hold logic ...46 3.7 Experimental Results ...48 3.8 Summary ...51 Chapter 4 Value Assignment of Adjustable Delay Buffers for Clock Skew Minimization in Multi-Voltage Mode Designs ...52 4.1 Illustrative Example ...52 4.2 A Linear Time Optimal Assignment for Adjustable Delay Buffer ...53 4.2.1 Leaf Node Partitioning ...54 4.2.2 Skew-Optimal Linear Time Algorithm ...57 4.2.3 Calculation of Clock Skew Under Optimal Assignments ...63 4.2.4 Reduction of The Maximum Clock Latency ...64 4.3 Efficient Heuristic For ADB Position Selection ...66 4.3.1 Dealing with multiple modes of a clock tree ...68 4.4 Experimental Results ...68 4.5 Summary ...71 Chapter 5 Efficient Calculation of Timed Cumulative Probability Density Function ...72 5.1 Timed Characteristic Function ...72 5.2 Estimating Timed-CDF by TCF ...73 5.2.1 Avoiding unnecessary functional simulation of TCFs ...75 5.2.2 Sampling Selection of Time Instances ...76 5.3 Experimental Results ...77 5.4 Summary ...80 Chapter 6 Conclusion ...81 References ...82

    [1]A. Agarwal, V. Zolotov, and D. T. Blaauw, “Statistical timing analysis using bounds and selective enumeration.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Volume: 22, Issue: 9, 1243-1260. 2003.
    [2]A. Agarwal, D. Blaauw, V.Zolotov, S.Sundareswaran, M. Zhao, K. Gala, and R. Panda, “Path-based statistical timing analysis considering inter- and intra-die correlations,” In Proceedings of TAU (ACMIEEE workshop on timing issues in the specification and synthesis of digital systems), 16-21. 2002.
    [3]T. Austin, “Designing robust microarchitectures,” In Proceedings of Design Automation Conference. 78.2002
    [4]T. Austin, V. Bertacco, D. Blaauw, and T. Mudge, “Opportunities and Challenges for Better than Worst-Case Design,” in Proceedings of the ASP-DAC 2005. Asia and South Pacific, vol. 1, pp. I/2 - I/7, 18-21 Jan. 2005.
    [5]C. J. Alpert, A. Devgan, and S. T. Quay, “Buffer insertion with accurate gate and interconnect delay computation,” in Proc. of IEEE/ACM Design Automation Conference., pp. 479–484. 1999.
    [6]K. D. Boese and A. B. Kahng, “Zero-skew clock routing trees with minimum wire length,” in Proc. of 5th IEEE Int. Conf. ASIC, pp. 17–21. 1992.
    [7]L. Benini, G. De Micheli, A. Lioy, E. Macii, G. Odasso, and M. Poncino, “Automatic Synthesis of large telescopic units based on near-minimum timed supersetting,” IEEE Transaction on Computers, vol. 48, no. 8, pp. 769-779, Aug. 1999.
    [8]L. Benini, E. Macii, M. Poncino and G. De Micheli, “Telescopic Units: A New Paradigm for Performance Optimization of VLSI Designs,” IEEE Transaction on Computer-Aided Design of Integrated Circuits and Systems, vol. 17, no. 3, pp. 220-232, Mar. 1998.
    [9]L. Benini, E.Macii and M. Poncino, “Efficient Controller Design for Telescopic Units,” in Proceedings of the IEEE International Conference Innovative Systems in Silicon, pp.290-299, Oct. 1997.
    [10]F. Brglez, “On testability of combinational networks,” in Proc. Int Symp. Circuits and Systems, pp. 221–225, 1984.
    [11]H.-C. Chen, and D. Du, “Path Sensitization in Critical Path Problem,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 12, no. 2, pp. 196-207, Feb. 1993.
    [12]T. H. Chao, Y. C. Hsu, J. M. Ho, K. D. Boese, and A. B. Kahng, “Zero skew clock net routing,” IEEE Trans. Circuits Syst. II, vol. 39, no. 11, pp. 799–814, Nov. 1992.
    [13]J. Cong, Z. Pan, L. He, C.-K. Koh, and K.-Y. Khoo, “Interconnect design for deep submicron ICs,” in Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design, pp. 478–485. 1997.
    [14]J. Cong, C. Koh, and K. Leung, “Simultaneous buffer and wire sizing for performance and power optimization,” in Proc. of Int. Symposium on Low Power Electron. Design, pp. 271–276, 1996.
    [15]C.-P Chen, C. C. N. Chu, and D. F. Wong, “Fast and exact simultaneous gate and wire sizing by Lagrangian relaxation,” in Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design, pp. 617–624, 1998.
    [16]David Chinnery and Kurt Keutzer, Closing the Gap Between ASIC & Custom, 2008
    [17]S. Dhar, D. Maksimovic, and B. Kranzen, “Closed-Loop Adaptive Voltage Scaling Controller For Standard-Cell ASICs.” Int’l Symposium on Low Power Electronics and Design (ISLPED), 2002.
    [18]D. Ernst , N. S. Kim, S. Das, S. Pant, R. Rao, T. Pham, C. Ziesler, D. Blaauw, T. Austin, K. Flautner, and T. Mudge, “Razor: A Low-Power Pipeline Based on Circuit-Level Timing Speculation.” In 36rd Annual International Symposium on Microarchitecture( MICRO-36), 7-18. 2003.
    [19]R. Ellis and D. Gulick, Calculus with Analytic Geometry Fort Worth, TX: Saunders, 1994.
    [20]Rajamohana Hegde and Naresh R. Shanbhag,“Energy-efficient signal processing via algorithmic noise-tolerance.” In Proceedings of the International Symposium on Low Power Electronics and Design. 1999
    [21]C. Hare. 586/686 Processors Chart. Feb 2002.
    [22]C. Hare. 786 Processors Chart. Feb 2002.
    [23]J. A. G. Jess, K. Kalafala, S. R. Naidu, R. H. J. M. Otten, and C. Visweswariah, “Statistical timing for parametric yield prediction of digital integrated circuits.” In Proceedings of Design Automation Conference, 932-937. 2003.
    [24]T. Kehl, “Hardware Self-Tuning and Circuit Performance Monitoring,” Int’l Conference on Computer Design (ICCD-93), 1993.
    [25]Euiseok Kim, Hiroshi Saito, Jeong-Gun Lee, Dong-Ik Lee, Hiroshi Nakamura, Takashi Nanya, “Performance Optimization of Synchronous Control Units for Datapaths with Variable Delay Arithmetic Units,” In Proceedings of the ASP-DAC 2003. Asia and South Pacific, pp. 816-819, 21-24 Jan. 2003.
    [26]Yu-Min Kuo, Yue-Lung Chang, Shih-Chieh Chang, “Efficient Boolean Characteristic Function for Fast Timed ATPG,” ICCAD ’06, November 5-9, 2006.
    [27]N. A. Kurd, J. S. Barkatullah, R. O. Dizon, T. D. Fletcher, and P. D. Madland, “A multigigahertz clocking scheme for Pentium 4 Microprocessor,” IEEE J. Solid-State Circuits, vol. 36, pp. 1647–1653, Nov. 2001.
    [28]Vishal Khandelwal and Ankur Srivastava, “Variability-Driven Formulation for Simultaneous Gate Sizing and Post-Silicon Tunability Allocation,” in Proc. of Int. Symposium on Physical design, pp. 11-18, 2007.
    [29]R. Kay, G. Bucheuv, and L. T. Pileggi, “EWA: Exact wiring-sizing algorithm,” in Proc. of Int. Symposium on Physical Design, pp. 178–185, 1997.
    [30]I.-M Liu, T.-L Chou, A. Aziz, and D. F. Wong, “Zero-skew clock tree construction by simultaneous routing, wire sizing, and buffer insertion,” in Proc. of Int. Symposium on Physical Design, pp. 33–38, 2000.
    [31]J. J. Liou, A. Krstic, Y. M. Jiang, and K. T. Cheng, “Modeling, testing, and analysis for delay defects and noise effects in deep submicron devices.” IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, Volume: 22 , Issue:6, 756 – 769. 2003.
    [32]G. Lakshminarayana, A. Raghunathan, K.S. Khouri, N.K. Jha, S. Dey, “Common-case computation: a high-level energy and performance optimization technique,“ IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 23 pp. 33-49, 2004.
    [33]P.C. McGeer and R.K. Brayton, Integration Functional and Temporal Domains in Logic Synthesis. Boston, Mass.: Kluwer Academic 1991.
    [34]P. C. McGeer, A. Saldanha, P. R. Stephan, R. K. Brayton, and A. L. Sangiovanni-Vicentelli, “Timing Analysis and Delay-Fault Test Generation using Path-Recursive Functions,” in Proceedings of the International Conference on Computer Aided-Design, 1991.
    [35]Patrick Mahoney, Eric Fetzer, Bruce Doyle, and Sam Naffziger. ”Clock distribution on a dual-core multi-threaded Itanium-family processor,” in Digest of technical papers of the international solid-state circuits conference, 2005.
    [36]MTEK Computer Consulting, AMD CPU Roster, Jan. 2002.
    [37]MTEK Computer Consulting, Intel CPU Roster, Jan. 2002.
    [38]Sani Nassif, “Delay Variability: Sources, Impacts and Treands”, in IEEE International Solid-State Circuits Conference, pp.368-369, 2000.
    [39]S.F. Oberman and M.J. Flynn, “Design Issues in Division and Other Floating-Point Operations,” IEEE Trans. Computers, vol. 46, no. 2, pp. 154-161, Feb. 1997.
    [40]T. Okamoto and J. Cong, “Buffered Steiner tree construction with wire sizing for interconnect layout optimization,” in Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design, pp. 44–49, 1996.
    [41]Trevor Pering, Tom Burd, and Robert Brodersen, “The simulation and evaluation of dynamic voltage scaling algorithms”, in Proceedings of the international symposium on Low power electronics and design, pp. 76-81, 1998.
    [42]A. Rajaram and David Z. Pan, “Robust chip-level clock tree synthesis for SOC designs”, in Proc. of IEEE/ACM Design Automation Conference, pp.720-723, June 2008
    [43]N. V. Shenoy, et al, “Minimum Padding to Satisfy Short Path Constraints.” In Proceedings of the IEEE/ACM International Conference on Computer-Aided Design. ACM, New York, 156-161. 1993.
    [44]L. G. Silva, J. P. M. Silva, L. M. Silveira, and K. A. Sakallah, ”Satisfiability Models and Algorithms for Circuit Delay Computation,” ACM Transactions on Design Automation of Electronic Systems, vol. 7, no. 1, pp. 137-158, Jan. 2002.
    [45]Yu-Shih Su, Yi-Hsin Weng, Shih-Chieh Chang, “Efficient Calculation of Timed Cumulative Probability Density Function,” International Symposium on VLSI Design, Automation and Test, vol. 1, no. 4, pp. 25-27 April 2007.
    [46]S. S. Sapatnekar, “RC Interconnect optimization under the Elmore delay model,” in Proc. of IEEE/ACM Design Automation Conference, pp. 387–391, 1994.
    [47]Simon Tam, Stefan Rusu, Utpal Nagarji Desai, Robert Kim, Ji Zhang, and Ian Young. ”Clock generation and distribution for the first IA-64 microprocessor,” IEEE Journal of Solid-State Circuits, pages 35(11):1545–1552, Nov 2000.
    [48]Jeng-Liang Tsai, Tsung-Hao Chen, Charlie Chung-Ping Chen, “Zero skew clock tree optimization with buffer insertion/sizing and wire sizing,” IEEE Transactions on CAD, Vol. 23, No. 4, Jun. 2004.
    [49]J.-L. Tsai, L. Zhang, C. Chen, “Statistical timing analysis driven post-silicon-tunable clock-tree synthesis,” in Proc. of Int. Conf. on Computer-Aided Design, pp. 575-581, Nov. 2005.
    [50]E. Takahashi, Y. Kasai, M. Murakawa, and T. Higuchi, “A post-silicon clock timing adjustment using genetic algorithms,” in Digest of technical papers of the symp. on VLSI circuits, pages 13–16, 2003.
    [51]Jeng-Liang Tsai, DongHyun Baik, Charlie Chung-Ping Chen, and Kewal K. Saluja. “A yield Improvement methodology using pre- and post-silicon statistical clock scheduling,” in Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design, pages 611–618, 2004.
    [52]A. Uht, “Uniprocessor Performance Enhancement Through Adaptive Clock Frequency Control.” International Conference on Advances in Infrastructure for e-Business, e-Education, e-Science, e-Medicine, and Mobile Technologies on the Internet (SSGRR 2003w). 2003.
    [53]C. Visweswariah, “Death, taxes and failing chips.” In Proceedings of Design Automation Conference. 343-347. 2003.

    無法下載圖示 全文公開日期 本全文未授權公開 (校內網路)
    全文公開日期 本全文未授權公開 (校外網路)

    QR CODE