Misplaced Pages

MPEG-7

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

MPEG-7 is a multimedia content description standard . It was standardized in ISO / IEC 15938 (Multimedia content description interface). This description will be associated with the content itself, to allow fast and efficient searching for material that is of interest to the user. MPEG-7 is formally called Multimedia Content Description Interface . Thus, it is not a standard which deals with the actual encoding of moving pictures and audio, like MPEG-1 , MPEG-2 and MPEG-4 . It uses XML to store metadata , and can be attached to timecode in order to tag particular events, or synchronise lyrics to a song , for example.

#870129

29-425: It was designed to standardize: The combination of MPEG-4 and MPEG-7 has been sometimes referred to as MPEG-47 . MPEG-7 is intended to complement the previous MPEG standards, by standardizing multimedia metadata -- information about the content, not the content itself. MPEG-7 can be used independently of the other MPEG standards - the description might even be attached to an analog movie. The representation that

58-509: A web browser might fetch a webpage or a new VRML file from the Internet when the user clicks on the specific graphical component. Animations , sounds , lighting , and other aspects of the virtual world can interact with the user or may be triggered by external events such as timers . A special Script Node allows the addition of program code (e.g., written in Java or ECMAScript ) to

87-527: A VRML file. VRML files are commonly called "worlds" and have the .wrl extension (for example, island.wrl). VRML files are in plain text and generally compress well using gzip , useful for transferring over the Internet more quickly (some gzip compressed files use the .wrz extension ). Many 3D modeling programs can save objects and scenes in VRML format. The Web3D Consortium has been formed to further

116-545: A company called Protozoa. H-Anim is a standard for animated Humanoids, which is based around VRML, and later X3D. The initial version 1.0 of the H-Anim standard was scheduled for submission at the end of March 1998. VRML has never seen much serious widespread use. One reason for this may have been the lack of available bandwidth . At the time of VRML's popularity, a majority of users, both business and personal, were using slow dial-up Internet access . VRML experimentation

145-453: A series of technologies for developers, for various service-providers and for end users: The MPEG-4 format can perform various functions, among which might be the following: MPEG-4 provides a large and rich set of tools for encoding. Subsets of the MPEG-4 tool sets have been provided for use in specific applications. These subsets, called 'Profiles', limit the size of the tool set a decoder

174-429: A specific set of capabilities to be defined in a manner appropriate for a subset of applications. Initially, MPEG-4 was aimed primarily at low- bit-rate video communications; however, its scope as a multimedia coding standard was later expanded. MPEG-4 is efficient across a variety of bit rates ranging from a few kilobits per second to tens of megabits per second. MPEG-4 provides the following functions: MPEG-4 provides

203-508: Is a standard file format for representing 3-dimensional (3D) interactive vector graphics , designed particularly with the World Wide Web in mind. It has been superseded by X3D . VRML is a text file format where, e.g., vertices and edges for a 3D polygon can be specified along with the surface color, UV-mapped textures , shininess , transparency , and so on. URLs can be associated with graphical components so that

232-876: Is available in OpenVRML . Its libraries can be used to add both VRML and X3D support to applications, and a GTK+ plugin is available to render VRML/X3D worlds in web browsers. In the 2000s, many companies like Bitmanagement improved the quality level of virtual effects in VRML to the quality level of DirectX 9.0c, but at the expense of using proprietary solutions. All main features like game modeling are already complete. They include multi-pass render with low level setting for Z-buffer, BlendOp, AlphaOp, Stencil, Multi-texture, Shader with HLSL and GLSL support, realtime Render To Texture, Multi Render Target (MRT) and PostProcessing. Many demos shows that VRML already supports lightmap, normalmap, SSAO, CSM and Realtime Environment Reflection along with other virtual effects. This example shows

261-466: Is defined within MPEG-4, i.e. the representation of audio-visual data in terms of objects, is however very well suited to what will be built on the MPEG-7 standard. This representation is basic to the process of categorization. In addition, MPEG-7 descriptions could be used to improve the functionality of previous MPEG standards. With these tools, we can build an MPEG-7 Description and deploy it. According to

290-419: Is not required to allow interoperability. The MPEG-7 (ISO/IEC 15938) consists of different Parts. Each part covers a certain aspect of the whole specification. An MPEG-7 architecture requirement is that description must be separate from the audiovisual content . On the other hand, there must be a relation between the content and description . Thus the description is multiplexed with the content itself. On

319-571: Is required to implement. In order to restrict computational complexity, one or more 'Levels' are set for each Profile. A Profile and Level combination allows: MPEG-4 consists of several standards—termed "parts"—including the following (each part covers a certain aspect of the whole specification): Profiles are also defined within the individual "parts", so an implementation of a part is ordinarily not an implementation of an entire part. MPEG-1 , MPEG-2 , MPEG-7 and MPEG-21 are other suites of MPEG standards. MPEG-4 contains patented technologies,

SECTION 10

#1732797821871

348-479: Is supported today. VRML's capabilities remained largely the same while realtime 3D graphics kept improving. The VRML Consortium changed its name to the Web3D Consortium, and began work on the successor to VRML— X3D . SGI ran a web site at vrml.sgi.com on which was hosted a string of regular short performances of a character called "Floops" who was a VRML character in a VRML world. Floops was a creation of

377-489: The Web Ontology Language (OWL), which is a structured data equivalent of the terms of the MPEG-7 standard (MPEG-7Ontos, COMM, SWIntO, etc.). However, these mappings did not really bridge the " Semantic Gap ," because low-level video features alone are inadequate for representing video semantics. In other words, annotating an automatically extracted video feature, such as color distribution, does not provide

406-428: The x264 encoder, Nero Digital AVC, QuickTime 7, Flash Video , and high-definition video media like Blu-ray Disc ). Most of the features included in MPEG-4 are left to individual developers to decide whether or not to implement. This means that there are probably no complete implementations of the entire MPEG-4 set of standards. To deal with this, the standard includes the concept of "profiles" and "levels", allowing

435-607: The Web in August 1995. In October 1995, at Internet World, Template Graphics Software (TGS) demonstrated a 3D/VRML plug-in for the beta release of Netscape 2.0 by Netscape Communications . In 1997, a new version of the format was finalized, as VRML97 (also known as VRML2 or VRML 2.0), and became an ISO/IEC standard. VRML97 was used on the Internet on some personal homepages and sites such as " CyberTown ", which offered 3D chat using Blaxxun Software, as well as Sony's SAPARi program, which

464-834: The collective development of the format. VRML (and its successor, X3D ), have been accepted as international standards by the International Organization for Standardization (ISO) and the International Electrotechnical Commission (IEC). The first version of VRML was specified in November 1994. This version was specified from, and very closely resembled, the API and file format of the Open Inventor software component , originally developed by SGI . Version 2.0 development

493-784: The formal standard ISO/IEC 14496 – Coding of audio-visual objects . Uses of MPEG-4 include compression of audiovisual data for Internet video and CD distribution, voice ( telephone , videophone ) and broadcast television applications. The MPEG-4 standard was developed by a group led by Touradj Ebrahimi (later the JPEG president) and Fernando Pereira. MPEG-4 absorbs many of the features of MPEG-1 and MPEG-2 and other related standards, adding new features such as (extended) VRML support for 3D rendering, object-oriented composite files (including audio, video and VRML objects), support for externally specified digital rights management and various types of interactivity. AAC (Advanced Audio Coding)

522-517: The meaning of the actual visual content. MPEG-4 MPEG-4 is a group of international standards for the compression of digital audio and visual data, multimedia systems, and file storage formats. It was originally introduced in late 1998 as a group of audio and video coding formats and related technology agreed upon by the ISO/IEC Moving Picture Experts Group (MPEG) ( ISO/IEC JTC 1 /SC29/WG11) under

551-473: The requirements document,1 "a Description consists of a Description Scheme (structure) and the set of Descriptor Values (instantiations) that describe the Data." A Descriptor Value is "an instantiation of a Descriptor for a given data set (or subset thereof)." The Descriptor is the syntactic and semantic definition of the content. Extraction algorithms are inside the scope of the standard because their standardization

580-487: The right side you can see this relation between description and content. MPEG-7 uses the following tools: On the right side you can see the relation between MPEG-7 tools. There are many applications and application domains which will benefit from the MPEG-7 standard. A few application examples are: The MPEG-7 standard was originally written in XML Schema (XSD), which constitutes semi-structured data . For example,

609-539: The running time of a movie annotated using MPEG-7 in XML is machine-readable data , so software agents will know that the number expressing the running time is a positive integer, but such data is not machine-interpretable (cannot be understood by agents), because it does not convey semantics (meaning), known as the " Semantic Gap ." To address this issue, there were many attempts to map the MPEG-7 XML Schema to

SECTION 20

#1732797821871

638-405: The same scene as X3D § Example . In a March 1998 ACM essay, "Playfulness in 3D Spaces -- Why Quake is better than VRML, and what it means for software design", Clay Shirky sharply criticised VRML as a "technology in search of a problem", whereas "Quake does something well instead of many things poorly...The VRML community has failed to come up with anything this compelling -- not despite

667-564: The terms of that license for the patents listed. The majority of patents used for the MPEG-4 Visual format are held by three Japanese companies: Mitsubishi Electric (255 patents), Hitachi (206 patents), and Panasonic (200 patents). VRML VRML ( Virtual Reality Modeling Language , pronounced vermal or by its initials, originally—before 1995—known as the Virtual Reality Markup Language)

696-513: The use of which requires licensing in countries that acknowledge software algorithm patents . Over two dozen companies claim to have patents covering MPEG-4. MPEG LA licenses patents required for MPEG-4 Part 2 Visual from a wide range of companies (audio is licensed separately) and lists all of its licensors and licensees on the site. New licenses for MPEG-4 System patents are under development and no new licenses are being offered while holders of its old MPEG-4 Systems license are still covered under

725-711: Was coined by Dave Raggett in a paper called "Extending WWW to support Platform Independent Virtual Reality" submitted to the First World Wide Web Conference in 1994, and first discussed at the WWW94 VRML BOF established by Tim Berners-Lee , where Mark Pesce presented the Labyrinth demo he developed with Tony Parisi and Peter Kennard . VRML was introduced to a wider audience in the SIGGRAPH Course, VRML: Using 3D to Surf

754-454: Was guided by the ad hoc VRML Architecture Group (VAG). A working draft was published in August 1996. Formal collaboration between the VAG and SC24 of ISO/IEC began in 1996 and VRML 2.0 was submitted to ISO for adoption as an international standard. The current and functionally complete version is VRML97 (ISO/IEC 14772-1:1997). VRML has now been superseded by X3D (ISO/IEC 19775-1). The term VRML

783-536: Was pre-installed on Vaio computers from 1997 to 2001. The format was championed by SGI's Cosmo Software; when SGI restructured in 1998, the division was sold to the VREAM Division of Platinum Technology , which was then taken over by Computer Associates , which did not develop or distribute the software. To fill the void a variety of proprietary Web 3D formats emerged over the next few years, including Microsoft Chrome and Adobe Atmosphere , neither of which

812-415: Was primarily in education and research where an open specification is most valued. It has now been re-engineered as X3D . The MPEG-4 Interactive Profile (ISO/IEC 14496) was based on VRML (now on X3D), and X3D is largely backward-compatible with it. VRML is also widely used as a file format for interchange of 3D models, particularly from CAD systems. A free cross-platform runtime implementation of VRML

841-547: Was standardized as an adjunct to MPEG-2 (as Part 1) before MPEG-4 was issued. MPEG-4 is still an evolving standard and is divided into a number of parts. Companies promoting MPEG-4 compatibility do not always clearly state which "part" level compatibility they are referring to. The key parts to be aware of are MPEG-4 Part 2 (including Advanced Simple Profile, used by codecs such as DivX , Xvid , Nero Digital , RealMedia , 3ivx , H.263 and by QuickTime 6) and MPEG-4 part 10 (MPEG-4 AVC/ H.264 or Advanced Video Coding, used by

#870129