簡易檢索 / 詳目顯示

研究生: 柳孟芸
Liu, Meng-Yun
論文名稱: 生成用於功率優化的混合驅動多位元正反器
Generation of Mixed-Driving Multi-Bit Flip-Flops for Power Optimization
指導教授: 麥偉基
Mak, Wai-Kei
口試委員: 王廷基
Wang, Ting-Chi
陳勝雄
Chen, Sheng-Hsiung
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2022
畢業學年度: 110
語文別: 英文
論文頁數: 36
中文關鍵詞: 多位元正反器混合驅動多位元正反器功率優化
外文關鍵詞: Mixed-Driving Multi-Bit Flip-Flops, Multi-Bit Flip-Flops, Power Optimization
相關次數: 點閱:2下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 多位元正反器通常用於減少時鐘接收器的數量,從而實現低功耗設計。傳統的 多位正反器由具有相同驅動強度的單位元正反器所組成。但是,如果多位元正反器中僅有一些位元違反時序約束,則必須調整多位元正反器 的大小或將其分解為更小的位寬組合以滿足時序,這將會降低省電的比率。
    在本文中,我們提出了一種新的多位元正反器生成方法,該方法考慮了混合驅動多位元正反器,其某些位元具有比其他位元更高的驅動強度。為了最大化正反器的合併率(最小化時鐘接收器的最終數量),我們的方法將首先在時序約束下執行激進的正反器合併。從某種意義上說,我們的合併是激進的,是因為我們可能會願意加大一些正反器的尺寸,以便盡可能地將正反器合併到具有相同驅動強度的多位元正反器中。
    過大的多位元正反器將在之後的階段在時序限制下而縮小,最終產生混合驅動的多位元正反器。
    此外,正反器們通過對齊的方法重新定位,以利用水平長時鐘引腳形狀來減少時鐘線長度,從而進一步降低時鐘功率。
    我們的生成方法已與商業佈局和佈線工具相結合,實驗結果表明,我們的方法在時鐘接收器的數量、正反器功率、時鐘緩衝器的數量以及時鐘樹線長優於僅考慮相同驅動多位元正反器的先前工作。


    Multi-bit flip-flops (MBFFs) are often used to reduce the number of clock sinks, resulting in a low-power design. A traditional MBFF is composed of individual FFs of uniform driving strength. However, if some but not all of the bits of an MBFF violate timing constraints, the MBFF has to be sized up or decomposed into smaller bit-width combinations to satisfy timing, which reduces the power saving. In this paper, we present a new MBFF generation approach considering mixed-driving MBFFs whose certain bits have a higher driving strength than the other bits. To maximize the FF merging rate (and hence to minimize the final amount of clock sinks), our approach will first perform aggressive FF merging subject to timing constraints. Our merging is aggressive in the sense that we are willing to possibly oversize some FFs in order to merge FFs into MBFFs of uniform driving strengths as much as possible. The oversized individual FFs of an MBFF will be later downsized subject to timing constraints by our approach, which results in a mixed-driving MBFF. Moreover, FFs and MBFFs are relocated through an alignment method to take advantage of the horizontal long clock pin shapes to reduce the clock wirelength for further clock power reduction. Our MBFF generation approach has been combined with a commercial place and route tool, and our experimental results show the superiority of our approach over a prior work that considers uniform-driving MBFFs only in terms of the clock sink count, the FF power, the clock buffer count, the routed clock wirelength, and the clock capacitance.

    誌謝 摘要 i Abstract ii 1 Introduction 1 1.1 Multi-bit Flip Flop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Previous Works . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.3 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 1.4 Contribution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.5 Organization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 Preliminaries 7 2.1 MBFF Library . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.2 Problem Formulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7 2.3 Flip Flop Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 3 Proposed Approach 9 3.1 Overall Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 3.2 Compute Feasible Regions of Flip Flop . . . . . . . . . . . . . . . . . . . . . 11 3.3 Find all Maximal Cliques in Multiple Rectangle Intersection Graphs . . . . . . 12 3.4 Non-Conflicting MBFFs Generation . . . . . . . . . . . . . . . . . . . . . . . 13 3.5 Flip-Flop Alignment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 3.6 Timing-Driven MBFF Sizing . . . . . . . . . . . . . . . . . . . . . . . . . . . 21 4 Experimental Results 23 4.1 Experimental Setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 4.2 Comparison with Mean Shift [1] . . . . . . . . . . . . . . . . . . . . . . . . . 24 4.2.1 The Results of Clock Sink Count, FF Area, and FF Power . . . . . . . 24 4.2.2 Clock Tree Synthesis Results . . . . . . . . . . . . . . . . . . . . . . . 25 4.3 Limited Selection of Mixed-Driving MBFFs . . . . . . . . . . . . . . . . . . . 27 4.4 Considering Signal Switching Power . . . . . . . . . . . . . . . . . . . . . . . 30 5 Conclusion 33 Bibliography 35

    [1] Y.-C. Chang, T.-W. Lin, I. H.-R. Jiang, and G.-J. Nam, “Graceful register clustering by
    effective mean shift algorithm for power and timing balancing,” Proceedings of the 2019
    International Symposium on Physical Design, p. 11–18, 2019.
    [2] S.-H. Wang, Y.-Y. Liang, T.-Y. Kuo, and W.-K. Mak, “Power-driven flip-flop merging
    and relocation,” IEEE Transactions on Computer-Aided Design of Integrated Circuits and
    Systems, pp. 180–191, 2012.
    [3] A. Kapoor, C. Groot, G. V. Piqué, H. Fatemi, J. Echeverri, L. Sevat, M. Vertregt, M. Meijer, V. Sharma, Y. Pu, and J. P. de Gyvez, “Digital systems power management for high
    performance mixed signal platforms,” IEEE Transactions on Circuits and Systems I: Regular Papers, pp. 961–975, 2014.
    [4] Y.-T. Chang, C.-C. Hsu, M. P.-H. Lin, Y.-W. Tsai, and S.-F. Chen, “Post-placement power
    optimization with multi-bit flip-flops,” 2010 IEEE/ACM International Conference on
    Computer-Aided Design (ICCAD), pp. 218–223, 2010.
    [5] W. Hou, D. Liu, and P.-H. Ho, “Automatic register banking for low-power clock trees,”
    2009 10th International Symposium on Quality Electronic Design, pp. 647–652, 2009.
    [6] D. Yi and T. Kim, “Allocation of multi-bit flip-flops in logic synthesis for power optimization,” 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD),
    2016.
    [7] C.-C. Tsai, Y. Shi, G. Luo, and I. H.-R. Jiang, “Ff-bond: Multi-bit flip-flop bonding at
    placement,” Proceedings of the 2013 ACM International Symposium on Physical Design,
    p. 147–153, 2013.
    [8] M. P.-H. Lin, C.-C. Hsu, and Y.-C. Chen, “Clock-tree aware multibit flip-flop generation
    during placement for power optimization,” IEEE Transactions on Computer-Aided Design
    of Integrated Circuits and Systems, pp. 280–292, 2015.
    [9] I. H.-R. Jiang, C.-L. Chang, and Y.-M. Yang, “Integra: Fast multibit flip-flop clustering
    for clock power saving,” IEEE Transactions on Computer-Aided Design of Integrated
    Circuits and Systems, pp. 192–204, 2012.
    [10] Y.-T. Shyu, J.-M. Lin, C.-P. Huang, C.-W. Lin, Y.-Z. Lin, and S.-J. Chang, “Effective and
    efficient approach for power reduction by using multi-bit flip-flops,” IEEE Transactions
    on Very Large Scale Integration (VLSI) Systems, pp. 624–635, 2013.
    [11] S. S.-Y. Liu, W.-T. Lo, C.-J. Lee, and H.-M. Chen, “Agglomerative-based flip-flop merging and relocation for signal wirelength and clock tree optimization,” ACM Trans. Des.
    Autom. Electron. Syst., 2013.
    [12] I. Seitanidis, G. Dimitrakopoulos, P. M. Mattheakis, L. Masse-Navette, and D. Chinnery,
    “Timing-driven and placement-aware multibit register composition,” IEEE Transactions
    on Computer-Aided Design of Integrated Circuits and Systems, pp. 1501–1514, 2019.
    [13] A. B. Kahng, J. Li, and L. Wang, “Improved flop tray-based design implementation for
    power reduction,” 2016 IEEE/ACM International Conference on Computer-Aided Design
    (ICCAD), 2016.
    [14] G. Wu, Y. Xu, D. Wu, M. Ragupathy, Y.-y. Mo, and C. Chu, “Flip-flop clustering by
    weighted k-means algorithm,” 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC), 2016.
    [15] S.-h. Chen, S.-h. Wang, W.-h. Chen, C.-y. KU, and H.-c. OU, “Integrated circuit and
    method of forming same and a system,” 2019. US Patent US10990745B2.
    [16] T. Kwon, M. Imran, D. Z. Pan, and J.-S. Yang, “Virtual-tile-based flip-flop alignment
    methodology for clock network power optimization,” IEEE Transactions on Very Large
    Scale Integration (VLSI) Systems, pp. 1256–1268, 2020.
    [17] M.-C. Kim, J. Hu, J. Li, and N. Viswanathan, “Iccad-2015 cad contest in incremental
    timing-driven placement and benchmark suite,” Proceedings of the IEEE/ACM International Conference on Computer-Aided Design, p. 921–926, 2015.
    [18] “Synopsys liberty,”
    [19] H. Imai and T. Asano, “Finding the connected components and a maximum clique of an
    intersection graph of rectangles in the plane,” Journal of Algorithms, pp. 310–323, 1983.
    [20] M. I. Shamos and D. Hoey, “Geometric intersection problems,” 17th Annual Symposium
    on Foundations of Computer Science (sfcs 1976), pp. 208–215, 1976.
    [21] “Cadence innovus,” 2020ver.
    [22] J. Chen, I. H.-R. Jiang, J. Jung, A. B. Kahng, V. N. Kravets, Y.-L. Li, S.-T. Lin, and
    M. Woo, “Datc rdf-2020: Strengthening the foundation for academic research in ic physical design,” Proceedings of the 39th International Conference on Computer-Aided Design,
    2020.
    [23] T. Ajayi, D. Blaauw, T. Chan, C. Cheng, V. Chhabria, D. Choo, M. Coltella, S. Dobre,
    R. Dreslinski, M. Fogaça, et al., “Openroad: Toward a self-driving, open-source digital
    layout implementation tool chain,” Proc. GOMACTECH, pp. 1105–1110, 2019.

    QR CODE