Learn more about Israeli war crimes in Gaza, funded by the USA, Germany, the UK and others.

How does HotJar record your screen?

I was blown away when I first saw tools like HotJar. You could see everything your users were doing! The only thing missing was a secret recording of their webcam!

How does it work? Capturing mouse and keyboard events is easy, but how can you record exactly what the user sees? Browsers have a Screen Capture API, but these tools sure don’t use that. They need to work efficiently in the background without extra permissions.

PostHog session replay is a modern, open-source implementation. It uses rrweb, a library to “record and replay the web”. All the magic is in there.

Here’s a first attempt, which sends the DOM as HTML to your recording endpoint once per second:

const sessionId = Math.random();

function snapshot() {
  return (new XMLSerializer()).serializeToString(document);
}

function sendSnapshot() {
  fetch(`/recordings/${sessionId}`, {
    method: 'POST',
    body: snapshot()
  })
}

setInterval(sendSnapshot, 1000)

Problems with this naive implementation:

  1. It doesn’t capture everything.
  2. It misses changes, and captures them too late.
  3. It’s very inefficient (in CPU, network, and storage).

What other state is there to capture? The HTML has references to external resources, like images and stylesheets. To capture the image data, rrweb draws the image to a canvas. And to capture a stylesheet, we can consult Document.styleSheets. We also need the window.innerWidth and window.innerHeight, and the scroll offsets for anything with a scrollbar.

To capture all changes instantly, we can use the MutationObserver API. This lets us replace setInterval with something like:

const observer = new MutationObserver(sendSnapshot);
observer.observe(
  document.documentElement,
  { attributes: true, childList: true, subtree: true }
);

Finally, we can make this more efficient by capturing changes rather than snapshots. The MutationObserver callback gets a list of MutationRecords. In theory they can be applied to a snapshot to get an updated DOM. We can send deltas these to our recording API. We’ll need to also send any external resources that the updated nodes refer to.

Tagged #programming, #web.

Similar posts

More by Jim

Want to build a fantastic product using LLMs? I work at Granola where we're building the future IDE for knowledge work. Come and work with us! Read more or get in touch!

This page copyright James Fisher 2024. Content is not associated with my employer. Found an error? Edit this page.