Please use this identifier to cite or link to this item:
Algorithm and Architecture Design of HDTV1080p Stereo Video Inter Prediction Processor in H.264/AVC Stereo High Profile
|關鍵字:||stereo video coding;立體影像編碼;disparity estimation;motion estimation;視差估測;移動估測||出版社:||電機工程學系所||引用:|| Video Codec for Audiovisual Services at p 64 × Kbit/s, ITU-T Recommendation H.261, March, 1993.  Video Coding for Low Bit Rate Communication, ITU-T Recommendation H.263, February, 1998.  Information Technology - Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5Mbit/s - Part 2: Video, ISO/IEC 11172-2, 1993.  Information Technology - Generic Coding of Moving Pictures and Associated Audio Information: Video, ISO/IEC 13818-2 and ITU-T Recommendation H.262, 1996.  Information Technology - Coding of Audio-Visual Objects - Part 2: Visual, ISO/IEC 14496-2, 1999.  Joint Video Team, Draft ITU-T Recommendation and Final Draft International Standard of Joint Video Specification, ITU-T Recommendation H.264 and ISO/IEC 14496-10 AVC, May 2003.  M. E. Lukacs, “Predictive coding of multi-viewpoint image sets,” in Proc, IEEE International Conference Acoustics Speech Signal Processing, Tokyo, Japan, vol. 1, pp. 521-524, January 1986.  I. Dinstein, G. Guy, J. Rabany, J. Tzelgov, and A. Henik, “On the compress of stereo image: Preliminary results,” Signal Processing, Image Communications, no. 4, pp. 373-382, August 1989.  M. G. Perkins “Data compression of stereo pairs,” IEEE Transactions on Communications, vol. 40, no. 4, pp. 684-696, April 1992.  MPEG-2 Group, Propose Draft Amendment No.3 to 13813-2 (Multi-View Profile), ISO/IEC JTC 1/SC 29/WG11 N1088. MPEG-2, 1995.  ITU-T and ISO/IEC JTC 1, Final draft amendment 3, Amendment 3 to ITU-T Recommendation H262 and ISO/IEC 13818-2 (MPEG-2 Video), ISO/IEC JTC 1/SC29/WG 11 (MPEG) Doc. N1366, September 1996.  A. Puri, R.V. Kollarits, and B. G. Haskell, “Stereoscopic video compression using temporal scalability,” in Proc. SPIE Conf. Visualization Communication Image Process., vol. 2501, pp. 745-756, May 1995.  X. Chen and A. Luthra, “MPEG-2 multi-view profile and its application in 3DTV,” in Proc, SPIE IS&T Multimedia Hardware Architectures, San Diego, CA, vol. 3021, pp. 212-223, February 1997.  J.-R. Ohm, “Stereoscopic video compression using temporal scalability,” in Proc. SPIE Conf. Stereoscopic Displays Virtual Reality System Vi, San Jose, CA, Jan. 1999, DOI:10.1117/12.349.85.  J.-R. Ohm and K. Muller, “Incomplete 3-D multiview representation of video objects” IEEE Transactions on Circuit and Systems for video technology, vol. 9, no. 2, pp389-4, August 2002.  MPEG Video Sub-Group Chair (J.-R. Ohm), submissions received in CfP on multiview video coding, Bangkok, Thailand, ISO/IEC JTC 1/SC 29/WG 11 (MPEG) Doc. M12969, January 2006.  J.-R. Ohm, “Stereo/multiview video encoding using the MPEG family of standards,” in Proc. SPIE Conf. Stereoscopic Displays Virtual Reality Syst. VI, San Jose, CA, January 1999, DOI:10.1117/12.349285.  R. Li, B. Zeng, and M. L. Liou, “A new three-step search algorithm for block motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 4, no. 4, pp. 438 - 442, August 1994.  J. Y. Tham, S. Ranganath, M. Ranganath, A. A. Kassim, “A novel unrestricted center-biased diamond search algorithm for block motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 8, no. 4, August 1998.  Shan Zhu and Kai-Kuang Ma, “A new diamond search algorithm for fast block-matching motion estimation,” IEEE Transactions on Image Processing, vol. 9, no. 2, February 2000  M. J. Chen, L. G. Chen, and T. D. Chiueh, “One-dimensional full search motion estimation algorithm for video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 4, no. 5, 1994, pp. 504 - 509, June 1994.  L. M. Po and W. C. Ma, “A novel four-step search algorithm for fast block motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology., vol. 6, no. 3, pp. 313 - 317, June 1996.  “A fast hierarchical motion vector estimation algorithm using mean pyramid,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 5, no. 4, August 1995.  W. Li and E. Salari “Successive elimination algorithm for motion estimation,” IEEE Transactions on Image Processing, vol. 4, no. 1, January 1995.  X. Q. Gao, C. J. Duanmu, C. R. Zou, “A multilevel successive elimination algorithm for block matching motion estimation,” IEEE Transactions on Image Processing, vol. 9, no. 3, March 2000.  Y.-W. Huang, S.-Y. Chien, B.-Y. Hsieh, L.-G. Chen, “Global elimination algorithm and architecture design for fast block matching motion estimation,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 14, no. 6, June 2004.  L.-F. Ding, S.-Y. Chien, and L.-G. Chen, “Joint prediction algorithm and architecture for stereo video hybrid coding systems,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 16, no. 11, pp. 1324-1337 November 2006.  A. B. B. Adikari, W. A. C. Fernando, H. K. Arachchi, K. K. Loo, “A H.264 Compliant Stereoscopic video codec,” Canadian Conference on Electrical and Computer Engineering, Page(s):1614 - 1617, May 2005.  H. H. Cheng, Y. C. Chen “Design and Analysis of Disparity Prediction Architecture for H.264 based Stereo Video Coding,” MS.D. dissertation, Dept Elect. Eng., Nation Tsing Hua Univ., Hsinchu, Taiwan R.O.C., 2007.  L. Shen, Z. Liu, S. Liu, Z. Zhang, and P. An, “Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view Coding,” IEEE Transactions on Broadcasting, vol. 55, no. 4, pp. 761-766, November 2009.  L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An, “View-adaptive motion estimation and disparity estimation for low complexity multiview video coding,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, no. 6, pp. 925-930, June 2010.  W. Zhu, X. Tian, F. Zhou and Y. Chen, “Fast inter mode decision based on textural segmentation and correlations for multiview video coding,” IEEE Transaction on Consumer Electronics, vol. 56, no. 3, pp. 1696-1704, October 2010.  I.-M. Pao, M.-T. Sun, “Modeling DCT coefficients for fast video encoding” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 608 - 616, June 1999.  Lam E.Y., Goodman. J.W., “A mathematical analysis of the DCT coefficient distributions for images,” IEEE Transactions on Image Processing, vol. 9, no. 10, pp. 1661 - 1666, August 2000.  Lam E.Y., “Analysis of the DCT coefficient distributions for document coding,” IEEE Signal Processing Letters, vol. 11, no. 2, part 1, pp. 97 -100, January 2004.  J.-C.Tuan, T.-S. Chang, and C.-W. Jen, “On the data reuse and memory bandwidth analysis for full-search block-matching vlsi architecture,” IEEE Transactions on Circuits and Systems for Video Technology., vol. 12, no. 1, pp. 61-72, August 2002.||摘要:||
晶片實驗結果採用TSMC Arm 0.09mm 1P9M 標準元件製程，頻率可以達到250Mhz 達到HDTV 1080p即時立體影像標準，功率消耗達到171mW。
Three-dimensional (3D) video offers a high-quality and immersive multimedia to human eyes via sender transmission. In 3D video systems, stereo video coding plays an important role because stereo video doubles the amount of data, comparing with the tradition 2D video. Therefore, stereo video coding must solve this problem so it can effectively reduce the original large amount of date, whereas maintain the good quality in stereo video. For this reason, this thesis stereo video coding will be divided into two stages. At the first stage, we propose fast inter coding algorithm applied in H.264/AVC stereo video coding system; and at the second stage, we propose fast inter coding architecture applied in H.264/AVC stereo video coding.
In terms of algorithm, we propose a fast motion/disparity estimation algorithm based on the coarse-to-fine technique. The proposed algorithm is applied to integer motion/disparity estimator of the H.264 stereo encoder. It performs the local full search on pixels around the selected candidates to obtain the 41 MVs or DVs. The proposed algorithm that compared with full search block match algorithm can keep the good quality and save the amount of computation algorithm. Moreover, we propose an early disparity termination algorithm base on model-based scheme. The threshold values are determined by some equations. When motion estimation is finished, we compare the SAD value of some points with threshold values. If SAD value is smaller than threshold value, disparity will be terminated. This algorithm can also increase with the quality and reduce the computation amount.
In the Hardware, combining the proposed coarse-to-fine algorithm and module-based early disparity termination algorithm, we propose fast inter coding architecture applied in H.264/AVC stereo video coding. This architecture also allows us to use the same computation unit to estimate left view motion and right view motion/disparity. We also use memory preload technology to save stereo video coding time.
The implementation result shows that the proposed Stereo inter prediction processor has 130K logic gates with core size 2.8 × 2.8 mm2. Moreover, In the TSMC 0.09mm 1P9M CMOS process, the H.264/AVC Stereo High Profile HDTV1080p applications at 250 MHz.
|Appears in Collections:||電機工程學系所|
Show full item record
TAIR Related Article
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.