研究生: |
葉怡婷 Yi-Ting Yeh |
---|---|
論文名稱: |
整合錯誤更正碼與冗餘修復的方法以提高記憶體之可靠度 Enhancing Memory Reliability by Using an Integrated ECC and Redundancy Repair Scheme |
指導教授: |
吳誠文
Cheng-Wen Wu |
口試委員: | |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電機工程學系 Department of Electrical Engineering |
論文出版年: | 2005 |
畢業學年度: | 93 |
語文別: | 英文 |
論文頁數: | 68 |
中文關鍵詞: | 錯誤更正碼 、冗餘修復 、可靠度 、記憶體 |
外文關鍵詞: | ECC, Redundancy Repair, Reliability, Memory |
相關次數: | 點閱:2 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
隨著半導體製程的快速發展,半導體記憶體的大小及密度迅速成長,而維持高水準的品質(Quality)、良率(Yield)與可靠度(Reliability)也越來越困難。冗餘修復(Redundancy Repair)技巧與錯誤更正碼(Error Correction Code, ECC)是常用來增進記憶體晶片良率與可靠度的兩種方法。傳統上,前者是用來修復硬錯誤(Hard Error),後者則是用來更正軟錯誤(Soft Error)。在本篇文章中,我們提出一個整合型的解決方案以增進記憶體之可靠度。過去相關的研究結合冗餘修復技巧與錯誤更正碼都是為了提高記憶體之良率,出廠前所測到的部份缺陷並沒有完全利用冗餘單元作永久修復,而是利用錯誤更正碼的更錯能力讓使用者在使用時能更正錯誤的資料。現代互補型金屬氧化物半導體(CMOS)記憶體容易受到放射線所引起的軟錯誤,因此,以可靠度的觀點,錯誤更正碼更正軟錯誤的
能力不應該被犧牲。我們所提出的方法則是在使正常操作模式(Normal Operation Mode)時能配合錯誤型態辨別方法分辨硬錯誤與軟錯誤,並在記憶體空載時間(Memory Idle Time)將硬錯誤用還沒有被使用的冗餘單元修補,使有錯誤更正碼電路的記憶體可以容忍更多的軟錯誤且不會損壞。我們並提出了ㄧ個估計記憶體可靠度的方法。實驗結果顯示藉由提出的整合冗餘修復與錯誤更正碼的方法,記憶體大小為8kx64 的平均故障間隔時間(Mean Time to Failure, MTTF)可提高1412 小時(7.1%)。
另一方面,由於進入深次微米技術,為了維持記憶體產品的可靠度以及品質,錯誤更正碼要同時能容忍更多數目的錯誤。然而,雙錯誤更正能力的錯誤更正碼(Double-Error Correctable ECC)卻很少被設計在記憶體電路中,因為其所需較多的校驗位數目(parity-bit)以及複雜的編碼解碼步驟使得成本過高。本篇文章提出一個低成本具有雙錯誤更正能力的方法,其所需要的校驗位數目與傳統具單錯誤更正-雙錯誤偵測能力的錯誤更正碼(Single-Error Correctable and Double-Error Detectable ECC)相同。
With the fast development pace of deep submicron technology comes a rapid growth in the size and density of semiconductor memory. However, keeping a high level of yield and reliability for
memory products becomes increasingly difficult. Both the redundancy repair and Error Correction
Code (ECC) techniques have been widely used for enhancing the yield and reliability of memory
chips. Specifically, the redundancy repair and ECC techniques are conventionally used to repair or
correct the hard faults and soft errors, respectively. In this paper, we propose an integrated ECC and
redundancy repair scheme for memory reliability enhancement. Previous works emphasized yield
improvement as they combined redundancy repair and ECC schemes, while we develop a method
to permanantly repair hard errors in the field so that more soft errors can be tolerated without chip
failure. Our approach identifies the hard faults and soft errors during the memory normal operation
mode, and repairs hard faults during the memory idle time as long as there are unused redundant
elements. A method for evaluating the memory reliability is also developed. Experimental results
show that the proposed approach is effective, e.g., the MTTF of an 8K 64 memory is improved
by 1,412 hours (7.1%) with our integrated ECC and repair scheme.
In addition, the development of deep submicron technology causes the semiconductor memory
to become larger and denser. In order to maintain high reliability and quality for memory products,
ECC buth serves as a soft error correction and plays an important role in reliability enhancement
to tolerate other forms of hard errors. However, double-error correctable (DEC) ECC has rarely
been used in memory designs because of its high cost in parity area overhead and the complex
calculation process. In this paper, we also propose a low-cost double-error correctable scheme
without expanding the number of parity bits required in traditional single-error correctable and
double-error detectable (SEC-DED) ECC. Experimental results show that the proposed method
has a low area overhead with no increase in the decoding cycles compared to conventional double
error correctable and triple-error detectable (DEC-TED) ECC (BCH code).
[1] Semiconductor Industry Association, “International technology roadmap for semiconductors
(ITRS), 2003 edition,” Dec. 2003.
[2] C.-T. Huang, J.-R. Huang, C.-F. Wu, C.-W. Wu, and T.-Y. Chang, “A programmable BIST
core for embedded DRAM,” IEEE Design & Test of Computers, vol. 16, pp. 59–70, Jan.-Mar.
1999.
[3] I. Kim, Y. Zorian, G. Komoriya, H. Pham, F. P. Higgins, and J. L. Lweandowski, “Built in
self repair for embedded high density SRAM,” in Proc. Int. Test Conf. (ITC), pp. 1112–1119,
Oct. 1998.
[4] J.-F. Li, J.-C. Yeh, R.-F. Huang, C.-W. Wu, P.-Y. Tsai, A. Hsu, and E. Chow, “A built-in selfrepair
scheme for semiconductor memories with 2-D redundancy,” in Proc. Int. Test Conf.
(ITC), (Charlotte), pp. 393–402, Sept. 2003.
[5] Y. Zorian, “Embedded memory test & repair: infrastructure IP for SOC yield,” in Proc. Int.
Test Conf. (ITC), (Baltmore), pp. 340–349, Oct. 2002.
[6] C. V. Srinivasan, “Codes for error correction in high-speed memory systems—Part I: Correction
of cell defects in integrated memories,” IEEE Trans. Computers, vol. 20, pp. 882–888,
1971.
[7] T. Granlund, B. Granbom, and N. Olsson, “Soft error rate increase for new generations of
SRAMs,” IEEE Trans. Nuclear Science, vol. 50, pp. 2065–2068, Dec. 2003.
[8] N. Derhacobian, V. A. Vardanian, and Y. Zorian, “Embedded memory reliability: The
SER challenge,” in Proc. IEEE Int. Workshop on Memory Technology, Design and Testing
(MTDT), (San Jose), pp. 104–110, Aug. 2004.
[9] S. Gregori, A. Cabrini, O. Khouri, and G. Torelli, “On-chip error correcting techniques for
new-generation flash memories,” Proc. of the IEEE, vol. 91, pp. 602–616, Apr. 2003.
[10] C. Wickman, D. G. Elliott, and B. F. Cockburn, “Cost model for large file memory DRAMs
with ECC and bad block marking,” in Proc. IEEE Int. Symp. Defect and Fault Tolerance in
VLSI Systems (DFT), (Albuquerque), pp. 319–327, Nov. 1999.
[11] M. Nicolaidis, N. Achouri, and L. Anghel, “A diversified memory built-in self-repair approach
for nanotechnologies,” in Proc. IEEE VLSI Test Symp. (VTS), (Napa Valley), pp. 313–
318, Apr. 2004.
[12] Y.-T. Hsing, C.-W.Wang, C.-W.Wu, C.-T. Huang, and C.-W.Wu, “Failure factor based yield
enhancement for SRAM designs,” in Proc. IEEE Int. Symp. Defect and Fault Tolerance in
VLSI Systems (DFT), (Cannes), pp. 20–28, Oct. 2000.
[13] R.-F. Huang, J.-F. Li, J.-C. Yeh, and C.-W. Wu, “A simulator for evaluating redundancy analysis
algorithms of repairable embedded memories,” in Proc. IEEE Int. Workshop on Memory
Technology, Design and Testing (MTDT), (Isle of Bendor, France), pp. 68–73, July 2002.
[14] M. Spica and T. M. Mak, “Do we need anything more than single bit error correction (ECC)?,”
in Proc. IEEE Int.Workshop on Memory Technology, Design and Testing (MTDT), (San Jose),
pp. 111–116, Aug. 2004.
[15] R. W. Hamming, “Error detecting and error correcting codes,” Bell System Tech. J.,
vol. XXVI, pp. 147–160, Apr. 1950.
[16] W. Gao and S. Simmons, “A study on the VLSI implementation of ecc for embedded dram,”
in Proc. Canadian Conf. Electrical and Computer Engineering, vol. 1, (Montreal), pp. 203–
206, May 2003.
[17] M. Y. Hsiao, “A class of optimal minimum odd-weight-column sec-ded codes,” IBM J. Research
and Development, vol. 14, pp. 395–401, July 1970.
[18] W. W. Peterson and E. J. Weldon, Jr., Error-Correcting Codes. Cambridge, MA: MIT Press,
2 ed., 1972.
[19] S. Lin and D. J. Costello, Error Control Coding: Fundamentals and Applications. Englewood
Cliffs, NJ: Prentice-Hall Inc., 1983.
[20] R. C. Baumann, “Soft errors in advanced semiconductor devices—Part I: The three radiation
sources,” IEEE Trans. Device and Materials Reliability, vol. 1, pp. 17–22, Mar. 2001.
[21] W. Kuo, W.-T. K. Chien, and T. Kim, Reliability, Yield, and Stress Burn-in. Boston: Kluwer
Academic Publishers, 1998.
[22] C. Cheng, C.-T. Huang, J.-R. Huang, C.-W. Wu, C.-J. Wey, and M.-C. Tsai, “BRAINS: A
BIST complier for embedded memories,” in Proc. IEEE Int. Symp. Defect and Fault Tolerance
in VLSI Systems (DFT), (Yamanashi), pp. 299–307, Oct. 2000.
[23] C.-L. Su, R.-F. Huang, and C.-W. Wu, “A processor-based built-in self-repair design for embedded
memories,” in Proc. 12th IEEE Asian Test Symp. (ATS), (Xian), pp. 366–371, Nov.
2003.
[24] W. K. S. Walker, C. E. Sundberg, and C. J. Black, “A reliable spaceborne memory with
a single error correction and erasure correction scheme,” IEEE Trans. Computers, vol. 28,
pp. 493–500, July 1979.
[25] D. C. Bossen and M. Y. Hsiao, “A system solution to the memory soft error problem,” IBM
J. Research and Development, vol. 24, pp. 390–397, May 1980