February Fourier Talks 2006
Pankaj Topiwala
Title:
Advanced Video Coding, and the Influence of Wavelets
Abstract:
Lossy video coding is by now a maturing subject, albeit still not an exact
science. It is largely predicated on the structure of image coding,
which is a more mature subject. While no global optimization approach
yet exists for the problem of image coding, in practice it has now
stabilized into a 3-stage process of transform, quantization, and entropy
coding. Video coding goes a step further, and incorporates a motion
estimation/compensation loop, giving it both temporal and spatial
decorrelation. While the follow-on stages of quantization and entropy
coding have seen improvements in recent designs, it is primarily the
subject of advanced spatial and temporal decorrelation that has been the
focus of recent designs, especially the latest international standard:
ITU/H.264 | ISO/IEC/MPEG-4 Pt. 10 (AVC), released in July, 2003, and
updated July 2004. A scalable extension of the standard is due by Dec.,
2006. This talk develops some of the main design features of the H.264
video coding standard, and compares it to several previous designs, and
highlights some of its advanced features. Ironically, while wavelet
concepts are not directly visible in the design, and DCT-based designs
still prevail, the methods of lifting, first developed for wavelets, have
played an important role in both the base design as well as the scalable
extension. And wavelet-based video coding methods remain as competitors
to these designs, ever ready to replace the DCT-based structures should
they come up short. And the dream of someday achieving a powerful, fully
embedded video codec lives on.