MPEG-7 is a multimedia content description standard. It was standardized in ISO/IEC 15938 (Multimedia content description interface). This description will be associated with the content itself, to allow fast and efficient searching for material that is of interest to the user. MPEG-7 is formally called the Multimedia Content Description Interface. Thus, it is not a standard that deals with the actual encoding of moving pictures and audio, like MPEG-1, MPEG-2 and MPEG-4. It uses XML to store metadata and can be attached to timecode in order to tag particular events, or synchronize lyrics to a song, for example.

It was designed to standardize:

  • a set of Description Schemes ("DS") and Descriptors ("D")
  • a language to specify these schemes called the Description Definition Language ("DDL")
  • a scheme for coding the description

The combination of MPEG-4 and MPEG-7 has been sometimes referred to as MPEG-47.

MPEG-7 is intended to provide complementary functionality to the previous MPEG standards, representing information about the content, not the content itself ("the bits about the bits"). This functionality is the standardization of multimedia content descriptions. MPEG-7 can be used independently of the other MPEG standards - the description might even be attached to an analog movie. The representation that is defined within MPEG-4, i.e. the representation of audio-visual data in terms of objects, is however very well suited to what will be built on the MPEG-7 standard. This representation is basic to the process of categorization. In addition, MPEG-7 descriptions could be used to improve the functionality of previous MPEG standards. With these tools, we can build a MPEG-7 Description and deploy it. According to the requirements document,1 “a Description consists of a Description Scheme (structure) and the set of Descriptor Values (instantiations) that describe the Data.” A Descriptor Value is “an instantiation of a Descriptor for a given data set (or a subset thereof).” The Descriptor is the syntactic and semantic definition of the content. extraction algorithms are inside the scope of the standard because their standardization isn’t required to allow interoperability.

