簡易檢索 / 詳目顯示

研究生: 王勁程
Wang, Ching-Cheng
論文名稱: 應用於單通量量子電路且考量時鐘樹的新型擺置演算法
A Novel Clock Tree Aware Placement Algorithm for Single Flux Quantum Logic Circuits
指導教授: 麥偉基
Mak, Wai-Kei
口試委員: 王廷基
Wang, Ting-Chi
陳宏明
Chen, Hung-Ming
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Computer Science
論文出版年: 2021
畢業學年度: 109
語文別: 英文
論文頁數: 34
中文關鍵詞: 標準元件擺置實體設計時鐘樹合成超導電路
外文關鍵詞: Placement, Physical Design, Clock Tree Synthesis, Superconducting Circuit
相關次數: 點閱:4下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在單通量量子電路中,幾乎所有電路元件都需要接收時鐘信號,這會導致很的高時鐘網路繞線成本。此外,單通量量子電路的時鍾樹需要在每個樹分支節點插入時鐘分離器元件,這使得針對傳統的電路擺置設計流程和時鍾樹合成無效,導致無法獲得具有低時鐘偏斜的高質量時鍾樹。為了解決這些問題,我們提出了一種兩階段全局擺置方法和擺置合法化後的擺置優化演算法。我們的兩階段全局擺置方法首先應用傳統的全局擺置演算法將電路元件均勻地放置在給定的單通量量子電路設計中,接著進行時鍾樹合成和時鐘分配器插入,然後同時針對原始電路元件和時鐘分配器執行第二次全局擺置。在第二次全局擺置階段中,使用前瞻性合法化技術來擴散原始電路元件和時鐘分離器,並多次重新合成時鍾樹以獲得優化的時鍾樹拓撲,使得原始電路元件與時鐘分配器的重疊減少。此外,我們也同時優化了數據信號和時鐘信號的總線長。在所有單元的擺置合法化後,我們的擺置優化方法可以運行以進一步減少時鐘偏斜。與之前最先進的工作相比,我們平均可以將總半週線長度和時鐘偏斜分別減少 17% 和 30%。


    In a single-flux-quantum (SFQ) circuit, almost all cells need to receive the clock signal which incurs a high clock routing overhead. Besides, the clock tree of a SFQ circuit requires the insertion of a clock splitter cell at every tree branching point which renders the conventional design flow of placement followed by clock tree synthesis ineffective to obtain a high quality clock tree with low clock skew. To address these issues, we propose a two-stage global placement methodology and a placement refinement algorithm after placement legalization. Our two-stage global placement methodology first applies a conventional global placement algorithm to place the cells in the given SFQ circuit evenly, which is followed by clock tree synthesis and clock splitter insertion, and then performs a second stage of global placement to re-place both the original cells and clock splitters at the same time. In the second global placement stage, the look-ahead legalization technique is used to spread out the original cells and the clock splitters, and the clock tree is re-synthesized several times to obtain an optimized clock tree topology such that there are little overlaps of the clock splitters with the original circuit cells. In addition, the total wirelength of data signals and clock signal is optimized concurrently. After legalizing the placement of all cells, our placement refinement method can be run to further reduce the clock skew. Compared with the previous state-of-the-art work, on average we can reduce the total half-perimeter wirelength and clock skew by 17% and 30%, respectively.

    誌謝 ii 摘要 iii Abstract iv Contents v List of Figures vii List of Tables ix 1 Introduction 1 1.1 Advantages of SFQ Logic Circuits.............................. 1 1.2 Challenges of SFQ Logic Circuits Placement.................... 1 1.3 Motivation and Contribution .................................. 2 1.4 Organization ................................................. 4 2 Preliminaries 5 2.1 Placement .................................................... 5 2.2 Clock Tree Synthesis ......................................... 6 2.3 Prior Works .................................................. 6 3 Proposed Approach 9 3.1 Algorithm Outline............................................ 10 3.2 Modified Bound-To-Bound (B2B) [15] Net Model ................ 11 3.3 Trial Global Placement without Clock Splitters (1st GP) & Trial Clock Tree Synthesis............................................. 12 3.4 Update Netlist with Clock Splitters & Clock Nets............. 12 3.5 Global Placement of All Cells Including Clock Splitters & Clock Tree Re Synthesis (2nd GP........................................ 13 3.6 Placement Refinement ........................................ 18 4 Experimental Results 23 4.1 Experimental Setup .......................................... 23 4.2 Benchmark Generation......................................... 23 4.3 Differences between Proposed Approaches and Enhanced-MILP ... 25 4.4 The Algorithm Flow of Enhanced-MILP ......................... 27 4.5 Results on ISCAS’85 Benchmark Suite ......................... 27 5 Conclusion 32 References 33

    [1] Y. Kameda, S. Yorozu and Y. Hashimoto, “A New Design Methodology for Single-Flux-Quantum (SFQ) Logic Circuits Using Passive-Transmission-Line (PTL) Wiring,” in IEEE Transactions on Applied Superconductivity, vol. 17, no. 2, pp. 508-511, June 2007.

    [2] S. N. Shahsavani, T. Lin, A. Shafaei, C. J. Fourie and M. Pedram, “An Integrated Row-Based Cell Placement and Interconnect Synthesis Tool for Large SFQ Logic Circuits,” in IEEE Transactions on Applied Superconductivity, vol. 27, no. 4, pp. 1-8, June 2017, Art no. 1302008.

    [3] Gurobi Optimization, LLC, “Gurobi Optimizer Reference Manual,” 2021, [online] Available: http://www.gurobi.com.

    [4] S. N. Shahsavani, A. Shafaei and M. Pedram, “A placement algorithm for superconducting logic circuits based on cell grouping and super-cell placement,” 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE), 2018, pp. 1465-1468.

    [5] S. N. Shahsavani and M. Pedram, “A Minimum-Skew Clock Tree Synthesis Algorithm for Single Flux Quantum Logic Circuits,” in IEEE Transactions on Applied Superconductivity, vol. 29, no. 8, pp. 1-13, Dec. 2019.

    [6] T. Lin, C. Chu, J. R. Shinnerl, I. Bustany and I. Nedelchev, “POLAR: A High Performance Mixed-Size Wirelengh-Driven Placer With Density Constraints,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 34, no. 3, pp. 447-459, March 2015.

    [7] W. Chen, A. V. Rylyakov, V. Patel, J. E. Lukens and K. K. Likharev, “Rapid single flux quantum T-flip flop operating up to 770 GHz,” in IEEE Transactions on Applied Superconductivity, vol. 9, no. 2, pp. 3212-3215, June 1999.

    [8] N. Viswanathan and C. C. -. Chu, “FastPlace: efficient analytical placement using cell shifting, iterative local refinement,and a hybrid net model,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 24, no. 5, pp. 722-733, May 2005.

    [9] X. He, T. Huang, L. Xiao, H. Tian, G. Cui and E. F. Y. Young, “Ripple: An effective routability-driven placer by iterative cell movement,” 2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2011, pp. 74-79.

    [10] M. Kim, D. Lee and I. L. Markov, “SimPL: An Effective Placement Algorithm,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 31, no.1, pp. 50-60, Jan. 2012.

    [11] P. Spindler, U. Schlichtmann, and F. M. Johannes, “Abacus: fast legalization of standard cell circuits with minimal movement,” Proceedings of the 2008 international symposium on Physical design, pp. 47–53, 2008.

    [12] O. A. Mukhanov, “Energy-Efficient Single Flux Quantum Technology,” in IEEE Transactions on Applied Superconductivity, vol. 21, no. 3, pp. 760-769, June 2011.
    [13] K. K. Likharev and V. K. Semenov, “RSFQ logic/memory family: a new Josephson-junction technology for sub-terahertz-clock-frequency digital systems,” in IEEE Transactions on Applied Superconductivity, vol. 1, no. 1, pp. 3-28, March 1991.

    [14] Y. Saad, “Iterative Methods for Sparse Linear Systems. Societyfor Industrial and Applied Mathematics,” Philadelphia, PA, USA, 2nd edition, 2003.

    [15] P. Spindler, U. Schlichtmann and F. M. Johannes, “Kraftwerk2—A Fast Force-Directed Quadratic Placement Approach Using an Accurate Net Model,” in IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 27, no. 8, pp. 1398-1411, Aug. 2008.

    [16] ABC: A system for Sequential Synthesis and Verification, [online] Available: http://www.eecs.berkeley.edu/~alanmi/abc/.

    [17] N. Katam, A. Shafaei and M. Pedram, “Design of Complex Rapid Single-Flux-Quantum Cells with Application to Logic Synthesis,” 2017 16th International Superconductive Electronics Conference (ISEC), 2017.

    [18] M. C. Hansen, H. Yalcin and J. P. Hayes, “Unveiling the ISCAS-85 benchmarks: a case study in reverse engineering,” in IEEE Design & Test of Computers, vol. 16, no. 3, pp.72-80, July-Sept. 1999.

    QR CODE