The Semantic Web


Semantic Web Cube

Many digital media resources are human-readable only, which cannot be processed automatically and are inefficient when searching for related information. A large share of datasets are isolated data silos that cannot be accessed publicly, are bound to a proprietary file format, and are not linked to one another. These limitation can be addressed by organizing and publishing data using powerful formats that add structure and meaning to the content and link related data to one another so that software agents can use the data more easily and efficiently.

For example, a typical web page contains structuring elements, formatted text, and some even multimedia objects. By default, the headings, texts, links, and other web site components created by the web designer are meaningless to computers. While browsers can display web documents based on the markup, only the human mind can interpret the meaning of information, so there is a huge gap between what computers and humans understand. Even if alternate text is specified for images (alt attribute with descriptive value on the img or figure elements), the data is not structured or linked to related data, and human-readable words of conventional web page paragraphs are not associated with any particular software syntax or structure. Without context, the information provided by web sites can be ambiguous to search engines.

In contrast to the conventional Web (the “Web of documents”), the Semantic Web includes the “Web of Data,” which connects “things” (representing real-world humans and objects) rather than documents meaningless to computers. The machine-readable datasets of the Semantic Web are used in a variety of web services, such as search engines, data integration, resource discovery and classification, cataloging, intelligent software agents, content rating, and intellectual property right descriptions, museum portals, community sites, podcasting, Big Data processing, business process modeling, and medical research. On the Semantic Web, data can be retrieved from seemingly unrelated fields automatically, in order to combine them, find relations, and make discoveries.

Note that the word semantic is used on the Web in other contexts as well. For example, in HTML5 there are semantic (in other words, meaningful) structuring elements, but this expression refers to the “meaning” of elements. In this context, the word semantic contrasts the “meaning” of elements, such as that of section (a thematic grouping), with the generic elements of older HTML versions, such as the “meaningless” div. The semantics of markup elements should not be confused with the semantics (in other words, machine-processability) of metadata annotations and web ontologies used on the Semantic Web. The latter can provide far more sophisticated data than the meaning of a markup element.

Main Components of the Semantic Web

Semantic Web Stack
The architecture of the Semantic Web

Core Semantic Web Features

  • Machine-interpretable, structured, interlinked, open access data repositories
  • Globally edited adaptive information resources
  • Unique web resource identifiers for every bit of information (e.g., each table data cell of a table has a unique identifier)

Further Reading