HTML: The structure of web pages

TLDR

This chapter covers the fundamentals of HTML as the structural foundation of web pages, explaining how it evolved from a language for sharing academic papers to the backbone of modern web applications. It describes HTML's basic syntax as an XML-based markup language consisting of nested tags, details essential elements like head and body, and explains how browsers handle URLs and render different types of media content.

HTML started as a portable markup language to share academic papers with other people on the same network. Browsers were co-developed as a way to (1) fetch those files, and (2) display them.

While the web has changed a lot since then, those core technologies haven’t.

A basic page

The most basic HTML page you could write looks something like this:


<!DOCTYPE html>
<html lang="en">
  <head>
    <title>Demo app</title>
  </head>
  <body>
    <p>Hello, world!</p>
  </body>
</html>

In this example, we’re creating a page that has a single paragraph, <p>, with the text “Hello, world!”.

HTML is a subset of XML, that is to say it encodes the document as a tree of tags; tags can be <opening>, </closing>, or <both />. Every opening tag must have a closing tag, and opening and self-closing tag (what we referred to as “both”) can have attributes that change the behavior of the tag (for example, <img width="800" /> is an image with a width of 800px). You can even define your own attributes if you prefix them with data- (this is used for scripting and is covered later).

A tag nested within another tag is referred to as a child, and the outer tag is referred to as the parent:


<parent>
  <child />
</parent>

Building on that, the entire nomenclature for family trees applies (there are grandparents, siblings, descendants, etc…)

HTML documents begin with a <!DOCTYPE html> declaration that tells the parser the document is HTML (much like a shebang in a scripting language), and consist of at most two children:

head: defining the metadata of the document (such as the tab’s title)
body: defining the contents of the document (such as the paragraph with the text hello world to show)

Tags, tags, tags

If you were to only know 2 other tags beyond <html>, <title>, and <body> they would definitely be:

<div>: an arbitrary block of content
<span>: a way to select text within a block

We’ll talk about both of these more in the CSS section.

There are other common tags to add functionality to the page: <img> for images, <video> for videos, <textarea> for long-form text-input, but we don’t need to understand those yet. There are also other tags that encode the purpose of the element: <main>, <article>, <header> (these act the same as a div, but allow bots and accessibility software to more easily read the webpage). My advice with these semantic elements is to use them where you can, but not to stress about them; there are still many, many websites that get along just fine with divs.

Try it yourself

To play with this further, you can use a service like CodeSandbox, which allows you to define and render an HTML file (and allows you to define CSS and JS files, but more on that later).

<!DOCTYPE html>
<html lang="en">
  <head>
    <title>Demo app</title>
  </head>
  <body>
    <p>Hello, world!</p>
  </body>
</html>

Open Sandbox

The browser and URLs

When you navigate to a URL like google.com/foo/bar.html, the browser requests the shared file foo/bar.html from the server at google.com; if you specify a folder path without a file, the browser will automatically request index.html on your behalf (specifically, the requests google.com/foo/ and google.com/foo/index.html are the same).

URLs can also contain query parameters, which are used to pass information to the server. For example, google.com/search?q=hello will pass “hello” to Google as an argument as it generates the search page.

While most of this is probably familiar to anyone who’s worked with web services before, one part that’s unique to the browser is the hashtag (#): an artifact of the browser’s origins as a document viewer. If there is a hashtag, the browser will scroll to the corresponding a element with that ID; for example, navigating to #try-it-yourself will scroll to the element <a id="try-it-yourself">.

Embedding media

In addition to HTML, browsers can render and embed a variety of media files to make documents more rich. For example, they can handle a variety of images:

JPEG: Lossy image files
PNG: Lossless image files
WEBP: Google-optimized image files
SVG: Vector files, structured as XML

They can also natively embed audio and video through MP4s.

<!DOCTYPE html>
<html lang="en">
  <head>
    <title>Demo app</title>
  </head>
  <body>
    <img src="https://placehold.co/80x80.png" />
  </body>
</html>

Open Sandbox