{"body":{"version":"https://jsonfeed.org/version/1.1","title":"Daniël Illouz","description":"Writing about things I learned and find interesting","home_page_url":"https://www.danillouz.dev","feed_url":"https://www.danillouz.dev/posts.json","favicon":"https://www.danillouz.dev/favicon-32.png","authors":[{"name":"Daniël Illouz"}],"language":"en-US","items":[{"id":"https://www.danillouz.dev/posts/go-pgo/","url":"https://www.danillouz.dev/posts/go-pgo/","title":"Go PGO","summary":"Profile-guided optimizations in Go.","content_html":"<p>When Go builds a binary, the compiler will optimize it by default (e.g. by inlining code). But here the compiler will make a \"best effort guess\" by using static heuristics based on (un)common paths in functions.</p>\n<p>Starting from Go <code>1.21</code>, the compiler supports profile-guided optimizations (PGO) to better optimize built binaries by using collected CPU pprof profiles.</p>\n<p>Providing profiles to the compiler gives it more information about how code behaves in a \"real\" production environment, and it can better optimize the built binary. For example, by more aggressively optimizing the most frequently used functions, or by more accurately selecting common cases.</p>\n<h2>Setting expectations</h2>\n<blockquote>\n<p>As of Go 1.21, benchmarks for a representative set of Go programs show that building with PGO improves performance by around 2-7%.</p>\n</blockquote>\n<blockquote>\n<p>As of Go 1.22, benchmarks for a representative set of Go programs show that building with PGO improves performance by around 2-14%.</p>\n</blockquote>\n<p>The Go team expects performance gains to keep increasing over time as more optimizations take advantage of PGO in future Go versions.</p>\n<h2>How to use PGO</h2>\n<ul>\n<li>Add a profile named <code>default.pgo</code> in the main package directory.</li>\n<li>Use the <code>-pgo</code> flag to provide the path to a profile when using <code>go build</code>.</li>\n</ul>\n<h2>Combining profiles</h2>\n<p>To use a more representative profile for PGO, it's possible to combine multiple profiles:</p>\n<pre><code>go tool pprof -proto a.prof b.prof c.prof &gt; combined.prof\n</code></pre>\n<h2>Resources</h2>\n<ul>\n<li>https://go.dev/blog/pgo</li>\n<li>https://go.dev/doc/pgo</li>\n<li>https://theyahya.com/posts/go-pgo/</li>\n</ul>\n","date_published":"2024-03-30T00:00:00.000Z","tags":["golang","pprof"]},{"id":"https://www.danillouz.dev/posts/sqlite-cli/","url":"https://www.danillouz.dev/posts/sqlite-cli/","title":"SQLite CLI","summary":"Learning about the SQLite Command Line Interface.","content_html":"<p><a href=\"https://www.sqlite.org/index.html\">SQLite</a> provides a <a href=\"https://www.sqlite.org/cli.html\">Command Line Interface</a> (CLI) program named <code>sqlite3</code>. And it's already installed on most operating systems.</p>\n<h2>Basic usage</h2>\n<p>The CLI can be run with or without command line options (flags).</p>\n<p>When a flag is provided, it must be prefixed with <code>-</code> or <code>--</code>. For example, <code>-version</code> and <code>--version</code> do the same thing:</p>\n<pre><code>sqlite3 -version\n</code></pre>\n<p>When <code>sqlite3</code> is run without flags, it will connect to a temporary in-memory database (which will be deleted on exit) in <strong>interactive mode</strong>:</p>\n<pre><code>sqlite3\n\nSQLite version 3.37.0 2021-12-09 01:34:53\nEnter \".help\" for usage hints.\nConnected to a transient in-memory database.\nUse \".open FILENAME\" to reopen on a persistent database.\n\nsqlite&gt;\n</code></pre>\n<p>When in interactive mode, the prompt is <code>sqlite&gt;</code> and it reads text input from the keyboard:</p>\n<ul>\n<li>SQL statements.</li>\n<li>Dot commands like <code>.open</code> (where some dot commands also accept flags).</li>\n</ul>\n<p>But it's also possible to redirect <code>sqlite3</code> I/O (input/output) to:</p>\n<ul>\n<li><a href=\"#read-sql-statements-from-a-file\">Read SQL statements from a file</a>.</li>\n<li><a href=\"#write-results-to-a-file\">Write results to a file</a>.</li>\n</ul>\n<h3>Help</h3>\n<p>To see how to use the CLI (and print all available CLI flags):</p>\n<pre><code>sqlite3 -help\n</code></pre>\n<p>To print all available dot commands (in interactive mode):</p>\n<pre><code>sqlite&gt; .help\n</code></pre>\n<p>To see how to use a dot command (in interactive mode), and print available dot command flags, run <code>.help DOT_COMMAND</code>. For example:</p>\n<pre><code>sqlite&gt; .help .import\n</code></pre>\n<h2>Open a database</h2>\n<p>When a filename is provided to the <code>sqlite3</code> command, it will either create a new database or open an existing database in interactive mode:</p>\n<pre><code>sqlite3 mydb\n</code></pre>\n<p>In interactive mode, a connection to a new or existing database can always be created via the <code>.open</code> dot command. And to connect to a temporary in-memory database, use <code>:memory:</code> as the database file name.</p>\n<p>To destroy any data in an existing database run <code>.open -new FILENAME</code>. For example:</p>\n<pre><code>sqlite&gt; .open -new existingdb\n</code></pre>\n<p>To open a database in read-only mode use the <code>-readonly</code> flag:</p>\n<pre><code>sqlite3 -readonly mydb\n</code></pre>\n<p>This also works in interactive mode:</p>\n<pre><code>sqlite&gt; .open -readonly myotherdb\n</code></pre>\n<h2>Databases and schemas</h2>\n<p>To see all databases in interactive mode:</p>\n<pre><code>sqlite&gt; .databases\n</code></pre>\n<p>To see all tables (including <a href=\"https://www.sqlite.org/lang_attach.html\">attached databases</a>) in interactive mode:</p>\n<pre><code>sqlite&gt; .tables\n</code></pre>\n<p>To see all indexes in interactive mode:</p>\n<pre><code>sqlite&gt; .indexes\nsqlite&gt; .indexes tablename\n</code></pre>\n<p>To see the complete schema of the database (including attached databases) in interactive mode:</p>\n<pre><code>sqlite&gt; .schema\nsqlite&gt; .schema tablename\n</code></pre>\n<h2>Read SQL statements from a file</h2>\n<p>In interactive mode the <code>.read</code> dot command can be used to read SQL statements (and dot commands) from a file:</p>\n<pre><code>sqlite&gt; .read script.sql\n</code></pre>\n<h3>Pipe input</h3>\n<p>If the argument to <code>.read</code> begins with the pipe symbol (<code>|</code>), then instead of opening the argument as a file, it runs the argument as a command, and uses the output of that command as its input. This can be useful to run scripts that generate SQL.</p>\n<h2>Write results to a file</h2>\n<p>By default <code>sqlite3</code> sends all output to \"standard output\", but this can be changed via the <code>.output</code> and <code>.once</code> dot commands in interactive mode.</p>\n<p>To output <em>all</em> query results to a file:</p>\n<pre><code>sqlite&gt; .mode list\nsqlite&gt; .separator ,\nsqlite&gt; .output books_and_authors.txt\nsqlite&gt;\nsqlite&gt; SELECT * FROM books;\nsqlite&gt; SELECT * FROM authors;\nsqlite&gt;\nsqlite&gt; .exit\n</code></pre>\n<p>To do the above just once, use the <code>.once</code> dot command instead.</p>\n<h3>Pipe results</h3>\n<p>If the argument to <code>.output</code> or <code>.once</code> begins with the pipe symbol (<code>|</code>), then it runs the argument as a command, and the output is sent to that command.</p>\n<p>For example:</p>\n<pre><code>sqlite&gt; .once | open -f\nsqlite&gt; SELECT * FROM books;\n</code></pre>\n<h2>Load file content into a table column</h2>\n<p>The <code>readfile()</code> function loads file content as a <code>BLOB</code> in interactive mode. For example:</p>\n<pre><code>sqlite&gt; CREATE TABLE images(\nsqlite&gt; name TEXT,\nsqlite&gt; type TEXT,\nsqlite&gt; img BLOB\nsqlite&gt; );\nsqlite&gt;\nsqlite&gt; INSERT INTO images(name,type,img)\nsqlite&gt; VALUES('icon','png',readfile('icon.png'));\n</code></pre>\n<h2>Write a table column to a file</h2>\n<p>The <code>writefile()</code> function writes a column value to a file in interactive mode. For example:</p>\n<pre><code>sqlite&gt; SELECT writefile('icon.png',img) FROM images WHERE name='icon';\n</code></pre>\n<h2>Import CSV into table</h2>\n<p>To import a CSV file into a table in interactive mode:</p>\n<pre><code>sqlite&gt; .import -csv file.csv tablename\n</code></pre>\n<p>And to import into a table not part of the \"main\" database the <code>-schema</code> flag can be used. This specifies that the table is part of another \"schema\" (useful for attached databases or to import into a temporary table).</p>\n<h2>Export results to CSV</h2>\n<p>To export results to a CSV file in interactive mode:</p>\n<pre><code>sqlite&gt; .headers on\nsqlite&gt; .mode csv\nsqlite&gt; .once ~/data.csv\nsqlite&gt;\nsqlite&gt; SELECT * FROM table;\nsqlite&gt;\nsqlite&gt; .exit\n</code></pre>\n<h2>Dump and restore a database</h2>\n<p>Dump (converts entire database content into a single UTF-8 text file):</p>\n<pre><code>sqlite3 mydb .dump | gzip -c &gt; mydb.dump.gz\n</code></pre>\n<p>Restore:</p>\n<pre><code>zcat mydb.dump.gz | sqlite3 mydb\n</code></pre>\n<h2>Configuration</h2>\n<p>An <code>.sqliterc</code> resource file can be created in the \"home directory\" to configure dot command settings. For example to change the output format for all queries:</p>\n<pre><code>.mode box\n</code></pre>\n<p>After creating the <code>.sqliterc</code> file, it will be loaded on startup:</p>\n<pre><code>sqlite3 mydb\n\n-- Loading resources from /Users/daniel/.sqliterc\nSQLite version 3.37.0 2021-12-09 01:34:53\nEnter \".help\" for usage hints.\n\nsqlite&gt;\n</code></pre>\n<h2>One-line commands</h2>\n<p>It's possible to \"bypass\" interactive mode and run SQL statements directly when using the <code>sqlite3</code> command via the last argument:</p>\n<pre><code>sqlite3 mydb \"SELECT * FROM table;\"\n</code></pre>\n<p>And by using CLI flags like <code>-cmd</code> it's possible to shorten certain actions.</p>\n<h3>One-line import and query CSV</h3>\n<pre><code>sqlite3 -csv -cmd \".import ~/data.csv data\" :memory: \"SELECT * FROM data;\"\n</code></pre>\n<h3>One-line export results to CSV</h3>\n<pre><code>sqlite3 -csv -header mydb \"SELECT * FROM books;\" &gt; ~/books.csv\n</code></pre>\n","date_published":"2023-07-15T00:00:00.000Z","date_modified":"2023-07-16T00:00:00.000Z","tags":["cli","sqlite"]},{"id":"https://www.danillouz.dev/posts/caddy-ca/","url":"https://www.danillouz.dev/posts/caddy-ca/","title":"Caddy local CA","summary":"Firefox does not recognize Caddy's local Certificate Authority by default.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>When running <a href=\"https://caddyserver.com/\">Caddy</a> locally, it will also generate its own local Certificate Authority (CA). Caddy will use this CA to sign certificates for <a href=\"https://caddyserver.com/docs/automatic-https#local-https\">local HTTPS</a>.</p>\n<p>This is pretty cool! But Caddy's local HTTPS does not work in Firefox by default. When running Caddy on <code>localhost</code>, Firefox will show the error code <code>SEC_ERROR_UNKNOWN_ISSUER</code> when visiting <code>https://localhost</code> (other browsers like Safari don't have this issue).</p>\n\n<p>Turns out that Firefox <a href=\"https://caddy.community/t/ocsp-stapling-error-certificate-not-trusted-by-the-web-browser/7691/2\">does not recognize</a> Caddy's local CA by default. And you have to <a href=\"https://support.mozilla.org/en-US/questions/1175296\">manually import</a> Caddy's local root certificate into Firefox.</p>\n<h2>How to import Caddy's local root certificate into Firefox?</h2>\n<ol>\n<li>\n<p>Open Firefox and go to <code>about:preferences#privacy</code>.</p>\n</li>\n<li>\n<p>Scroll down to the <code>Security &gt; Certificates</code> section, and click <code>View Certificates</code>.</p>\n</li>\n</ol>\n\n<ol>\n<li>Select the <code>Authorities</code> tab, and click <code>Import</code>.</li>\n</ol>\n\n<ol>\n<li>Find Caddy's local root certificate in its <a href=\"https://caddyserver.com/docs/conventions#data-directory\">data directory</a>, and open it. On a Mac it's located at <code>~/Library/Application\\ Support/Caddy/pki/authorities/local/root.crt</code>.</li>\n</ol>\n\n<ol>\n<li>Check the <code>Trust this CA to identify websites</code> checkbox, and click <code>OK</code>.</li>\n</ol>\n\n<ol>\n<li>The <code>Caddy Local Authority</code> should now be listed in the <code>Authorities</code> tab.</li>\n</ol>\n\n<ol>\n<li>Restart Firefox, and accessing localhost over HTTPS will now work!</li>\n</ol>\n","date_published":"2023-06-23T00:00:00.000Z","tags":["caddy","certificate-authorities","firefox","proxies","tls-certificates"]},{"id":"https://www.danillouz.dev/posts/obsidian-web-clipper/","url":"https://www.danillouz.dev/posts/obsidian-web-clipper/","title":"Obsidian web clipper","summary":"My bookmarklet to clip web pages to Obsidian.","content_html":"<p>import Bookmarklet from \"./Bookmarklet\"</p>\n<p>I recently started using <a href=\"https://obsidian.md\">Obsidian</a> and I like it a lot! One thing I was missing though, was to quickly save (i.e. \"clip\") a webpage to Obsidian from my browser. So I was happy to find Stephan Ango's <a href=\"https://stephanango.com/obsidian-web-clipper\">Obsidian web clipper</a> which does just that (thanks Stephan!).</p>\n<p>Stephan's web clipper works pretty well, but I wanted slightly different behavior. And since the web clipper is an open source bookmarklet, it was easy for me to modify.</p>\n<h2>What is a bookmarklet?</h2>\n<p>A <a href=\"https://en.wikipedia.org/wiki/Bookmarklet\">bookmarklet</a> is a browser bookmark that runs some JavaScript code every time you click it.</p>\n<p>You can create a bookmarklet by creating a new bookmark in your browser, but instead of providing a link to a website, you give it a <code>javascript</code> URI. For example:</p>\n<pre><code>javascript: alert(\"Go eat ice cream!\")\n</code></pre>\n<p>So the bookmarklet above would show a \"Go eat ice cream!\" alert every time you click it (and I highly recommend you install it).</p>\n<h2>My Obsidian web clipper</h2>\n<p>My version of the bookmarklet is based on Stephan Ango's <a href=\"https://gist.github.com/kepano/90c05f162c37cf730abb8ff027987ca3\">Obsidian web clipper</a>, so it does pretty much the same thing, but with these differences:</p>\n<ul>\n<li>npm dependencies are loaded as ECMAScript modules from <a href=\"https://www.jsdelivr.com/?docs=esm\">jsDelivr</a>.</li>\n<li>Clippings of entire webpages, and clippings of selections are stored in <em>separate</em> Obsidian folders: <code>Clippings</code> and <code>Clippings/Quotes</code>.</li>\n<li>Clippings of selections (quotes) of the same webpage are <em>appended</em> to the same Obsidian note.</li>\n<li>Quotes include the selected <a href=\"https://web.dev/text-fragments/\">text fragment</a> in the source link. So visiting the quote's source link will scroll you to, and highlight, the clipped text on the webpage. This only works natively in Chromium and Safari, but <a href=\"https://github.com/GoogleChromeLabs/link-to-text-fragment#installation\">this browser extension</a> can be installed to polyfill the functionality.</li>\n<li>An alert dialog will show when clipping fails.</li>\n</ul>\n<h3>How to use it?</h3>\n<ol>\n<li>\n<p>Drag this link to your bookmarks: &lt;Bookmarklet /&gt;.</p>\n</li>\n<li>\n<p>Visit a webpage:</p>\n<p>a. To clip an entire webpage: click the bookmark.</p>\n<p>b. To only clip part of a webpage: first select some text (can include images), then click the bookmark.</p>\n</li>\n</ol>\n<h3>Known issues</h3>\n<ul>\n<li>Clipping does not work on webpages that enable <a href=\"https://developer.mozilla.org/en-US/docs/Web/HTTP/CSP\">Content Security Policy</a> (CSP), which block inline scripts (e.g. you can't clip Reddit and Twitter posts).</li>\n<li>Clipping selections does not work in Safari. Because Safari's confirmation dialog \"unselects\" any content before clipping (so it always clips the entire webpage).</li>\n</ul>\n<h3>The code</h3>\n<p>Feel free to remix the code below. And after changing the code, you can turn it into a bookmarklet with <a href=\"https://make-bookmarklets.com/\">Make Bookmarklets</a>.</p>\n<pre><code>/**\n * Obsidian web clipper (bookmarklet).\n *\n * Based on Stephan Ango's \"Obsidian web clipper\".\n * @see {@link https://stephanango.com/obsidian-web-clipper}\n *\n * Uses jsDelivr to import npm dependencies as ESM modules.\n * @see {@link https://www.jsdelivr.com/?docs=esm}\n *\n * Made into a bookmarklet with \"Make Bookmarklets\".\n * @see {@link https://make-bookmarklets.com/}\n */\nPromise.all([\n  // Dependencies.\n  import(\"https://cdn.jsdelivr.net/npm/@mozilla/readability/+esm\"),\n  import(\"https://cdn.jsdelivr.net/npm/turndown/+esm\"),\n  import(\n    \"https://cdn.jsdelivr.net/npm/text-fragments-polyfill/dist/fragment-generation-utils.js/+esm\"\n  ),\n\n  // Config.\n  Promise.resolve({\n    // Clippings of entire webpages will be stored as separate notes in\n    // this Obsidian folder.\n    folderName: \"Clippings\",\n\n    // Clippings of selections will be stored in this Obsidian folder,\n    // where clippings of the same webpage will be appended to the same\n    // Obsidian note.\n    selectionFolderName: \"Clippings/Quotes\",\n  }),\n])\n  .then(([readabilityJs, turndownJs, textFragmentsPolyfillJs, config]) =&gt; {\n    const { Readability } = readabilityJs.default\n    const { default: Turndown } = turndownJs\n    const { generateFragment } = textFragmentsPolyfillJs\n\n    const selection = _getSelection(generateFragment)\n\n    /**\n     * Readability removes clutter from web pages.\n     * It's the same library that's used in Firefox's Reader View.\n     * @see {@link https://www.npmjs.com/package/@mozilla/readability}\n     */\n    const {\n      byline: author,\n      content,\n      excerpt,\n      title,\n    } = new Readability(window.document.cloneNode(true)).parse()\n\n    /**\n     * Converts HTML to Markdown.\n     * @see {@link https://www.npmjs.com/package/turndown}\n     */\n    const markdown = new Turndown({\n      headingStyle: \"atx\",\n      hr: \"---\",\n      bulletListMarker: \"-\",\n      codeBlockStyle: \"fenced\",\n    }).turndown(selection.html || content)\n\n    const obsidianContent = _makeObsidianNoteContent({\n      author,\n      body: markdown,\n      excerpt,\n      selection,\n      title,\n      url: window.document.URL,\n    })\n\n    const obsidianUri = _makeObsidianUri({\n      config,\n      content: obsidianContent,\n      selection,\n      title,\n    })\n\n    window.document.location.href = obsidianUri\n  })\n  .catch((error) =&gt; {\n    alert(\n      \"Failed to clip to Obsidian\" +\n        \"\\n\\n\" +\n        error +\n        \"\\n\\n\" +\n        \"(see the browser developer console for more details)\"\n    )\n  })\n\nfunction _getSelection(generateFragmentFn) {\n  if (typeof window.getSelection === \"undefined\") {\n    return {\n      hasSelection: false,\n      html: \"\",\n      textFragment: \"\",\n    }\n  }\n\n  const sel = window.getSelection()\n  if (!sel || sel.rangeCount &lt; 1) {\n    return {\n      hasSelection: false,\n      html: \"\",\n      textFragment: \"\",\n    }\n  }\n\n  const { status, fragment } = generateFragmentFn(sel)\n  const textFragment = _makeTextFragmentDirective(status, fragment)\n  const container = window.document.createElement(\"div\")\n  for (let i = 0, len = sel.rangeCount; i &lt; len; ++i) {\n    container.appendChild(sel.getRangeAt(i).cloneContents())\n  }\n  const html = container.innerHTML\n  return {\n    hasSelection: Boolean(html),\n    html,\n    textFragment,\n  }\n}\n\n/**\n * Makes the text fragment directive to highlight a text selection.\n *\n * Only Chromium/Safari browsers support text fragments.\n * @see {@link https://web.dev/text-fragments/}\n *\n * But a browser extension can be installed to polyfill the functionality.\n * @see {@link https://github.com/GoogleChromeLabs/link-to-text-fragment}\n */\nfunction _makeTextFragmentDirective(status, fragment) {\n  if (status !== 0) {\n    /**\n     * Non-0 status means error.\n     * @see {@link https://github.com/GoogleChromeLabs/link-to-text-fragment/blob/main/fragment-generation-utils.js#L779}\n     */\n    return \"\"\n  }\n\n  const prefix = fragment.prefix\n    ? `${encodeURIComponent(fragment.prefix)}-,`\n    : \"\"\n  const suffix = fragment.suffix\n    ? `,-${encodeURIComponent(fragment.suffix)}`\n    : \"\"\n  const start = encodeURIComponent(fragment.textStart)\n  const end = fragment.textEnd ? `,${encodeURIComponent(fragment.textEnd)}` : \"\"\n\n  return `#:~:text=${prefix}${start}${end}${suffix}`\n}\n\n/**\n * Makes the Obsidian note content.\n *\n * For webpage clippings only (i.e. not webpage selection clippings):\n *\n * Uses YAML front matter to add metadata about the clipping to the note.\n * @see {@link https://help.obsidian.md/Editing+and+formatting/Metadata}\n *\n * Uses a comment to link to a daily note (you can't link to other notes\n * in the front matter).\n * @see {@link https://help.obsidian.md/Editing+and+formatting/Basic+formatting+syntax#Comments}\n */\nfunction _makeObsidianNoteContent({\n  author,\n  body,\n  excerpt,\n  selection,\n  title,\n  url,\n}) {\n  // NOTE: I'm stripping the query/hash params because I just need to\n  // link back to the clipping source. But it could be that a page is\n  // using those params to show specific content, which will be \"lost\"\n  // after stripping. So might have to revisit this..\n  let cleanUrl = new URL(url)\n  cleanUrl.search = \"\"\n  cleanUrl.hash = \"\"\n  cleanUrl = cleanUrl.toString()\n\n  const now = new Date()\n\n  if (selection.hasSelection) {\n    /**\n     * `locales` is set to `undefined` to use the default locale.\n     * @see {@link https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Date/toLocaleDateString}\n     */\n    const prettyDate = now.toLocaleDateString(undefined, {\n      weekday: \"short\",\n      year: \"numeric\",\n      month: \"short\",\n      day: \"numeric\",\n      hour: \"numeric\",\n      minute: \"numeric\",\n    })\n    return `&gt; [!quote] ${prettyDate} &amp;bull; [Source](${cleanUrl}${selection.textFragment})\n\n${body}\n\n---\n\n`\n  } else {\n    const summary = excerpt !== title ? excerpt : \"\"\n    const [yyyy_mm_dd] = now.toISOString().split(\"T\")\n    const titleLink = `[${title}](${cleanUrl})`\n    return `---\naliases:\nclipping_author: ${author ? author : \"\"}\nclipping_url: ${cleanUrl}\ncreated_at_unix: ${Math.round(Date.now() / 1000)}\nsummary: ${summary}\n---\n\n%%\ndates:: [[${yyyy_mm_dd}]]\nrelated::\n%%\n\n#clipping\n\n&gt; [!info]\n&gt; ${titleLink}${author ? \" by \" + author : \"\"}\n\n${body}\n`\n  }\n}\n\n/**\n * Makes the Obsidian URI to create/append to a note.\n * @see {@link https://help.obsidian.md/Advanced+topics/Using+Obsidian+URI#Action+%60new%60}\n */\nfunction _makeObsidianUri({ config, content, selection, title }) {\n  const folderName = selection.hasSelection\n    ? config.selectionFolderName\n    : config.folderName\n\n  // NOTE: characters \":\", \"/\" and \"\\\" are not allowed in file names.\n  const fileName = title\n    .replace(/:/g, \"\")\n    .replace(/\\//g, \"-\")\n    .replace(/\\\\/g, \"-\")\n\n  const query = {\n    content,\n    file: `${folderName}/${fileName}`,\n  }\n\n  if (selection.hasSelection) {\n    // NOTE: \"boolean\" params trigger with any truthy value, like\n    // `append=false`.\n    query.append = \"true\"\n  }\n\n  // NOTE: URLSearchParams().toString() encoding leads to unexpected\n  // behavior, so use `encodeURIComponent()` instead.\n  const queryString = Object.entries(query)\n    .map(([k, v]) =&gt; `${k}=${encodeURIComponent(v)}`)\n    .join(\"&amp;\")\n\n  return `obsidian://new?${queryString}`\n}\n</code></pre>\n","date_published":"2023-06-18T00:00:00.000Z","tags":["bookmarklet","obsidian","web-clipper"]},{"id":"https://www.danillouz.dev/posts/example-com/","url":"https://www.danillouz.dev/posts/example-com/","title":"example.com","summary":"IANA reserved domains.","content_html":"<p>There are a few domains that are <a href=\"https://www.iana.org/domains/reserved\">reserved by IANA</a>. Reserved means that these domains can't be registered by anyone, and can't be transferred. One of them is <a href=\"https://example.com/\">example.com</a>.</p>\n<p>Such domains are sometimes called <strong>special use domain names</strong>, and the full list can be found <a href=\"https://www.iana.org/assignments/special-use-domain-names/special-use-domain-names.xhtml\">here</a>.</p>\n<h2>Special TLDs</h2>\n<p>The most notable special <a href=\"/posts/dns#top-level-domains-and-subdomains\">TLDs</a> are:</p>\n<ul>\n<li><code>test</code></li>\n<li><code>example</code></li>\n<li><code>invalid</code></li>\n<li><code>localhost</code></li>\n</ul>\n<h2>How to use special domains?</h2>\n<p><a href=\"https://datatracker.ietf.org/doc/rfc2606/\">RFC 2606</a> specifies best practices on how to use the special domains. It recommends the following:</p>\n<ul>\n<li><strong>test</strong> domains are recommended for testing \"DNS related\" code.</li>\n<li><strong>example</strong> domains are recommended for documentation and examples.</li>\n<li><strong>invalid</strong> domains are recommended for demonstrating invalid domain names.</li>\n<li><strong>localhost</strong> domains are reserved for loopback addresses to the local host (so local networks don't break).</li>\n</ul>\n<blockquote>\n<p>There's also <a href=\"https://datatracker.ietf.org/doc/rfc6761/\">RFC 6761</a> with more information, like how DNS servers should handle these domains.</p>\n</blockquote>\n<h2>Why are special domains useful?</h2>\n<p><strong>Special domains guarantee deterministic behavior in tests and documentation.</strong></p>\n<p>Let's say I make up a domain for (local) testing, where I expect certain behavior (e.g. it must resolve, or it must fail). It could happen that at some point the domain becomes available, gets registered, and now my test will behave unexpectedly.</p>\n<p>This is for example what happened with the <code>dev</code> TLD! It was sometimes used for testing (locally), but then Google \"bought\" it.</p>\n","date_published":"2023-06-09T00:00:00.000Z","tags":["domain-names","iana","internet","tld"]},{"id":"https://www.danillouz.dev/posts/dns/","url":"https://www.danillouz.dev/posts/dns/","title":"The Domain Name System (DNS)","summary":"What I learned about DNS so far.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>My understanding of DNS was always pretty basic. But since I started working more with hosting infrastructure, I've learned a lot more about it. I think DNS is really cool, but it is complicated. DNS has a lot of moving parts and terminology you need to know about to really get it. So I decided to write a bit about this. Mostly to capture and solidify my learnings, but maybe it can also be useful to others.</p>\n<p>In this post I'll cover what problem DNS solves, what DNS is, and how DNS works when you visit a website in your browser. This post gets a bit technical at times, but I try not to assume any prior knowledge, so you can (hopefully) also follow along if you're new to the topic.</p>\n<h2>Why do we need DNS?</h2>\n<p>The internet is a <em>massive</em> system of interconnected computer networks. And devices connected to this network communicate with each other by sending \"packets of data\". But to make sure that these packets are routed to the correct destination, a <strong>protocol</strong> must be followed.</p>\n<p>What's a protocol? It's basically a set of rules that need to be followed to achieve \"something\".</p>\n<p>For example, to mail a letter, the protocol is that you must:</p>\n<ol>\n<li>Use an envelope.</li>\n<li>Write the sender and delivery address on the envelope (using a name, street address, city and zip code).</li>\n<li>Use a postage stamp.</li>\n<li>Deposit the envelope in a mailbox or at a post office.</li>\n</ol>\n<p>But if one of these rules is broken (e.g. because phone numbers were used for rule 2 above) the letter will not be delivered.</p>\n<p>It's a bit like that on the internet, but the protocol that's used is called the <strong>Internet Protocol</strong> (IP). And instead of using mail addresses to deliver mail to the correct destination, <strong>IP addresses</strong> must be used to deliver packets of data to the correct destination[^1].</p>\n<p>[^1]: The IP protocol is basically the addressing system of the internet, but there's more needed to deliver packets from source to destination. The exact details are out of scope for this post, but there's also a transport protocol needed to define rules <em>how</em> data is sent and received. Ultimately there are multiple protocols needed which are \"layered\" on top of each other, like <a href=\"https://en.wikipedia.org/wiki/Internet_protocol_suite\">TCP/IP</a>.</p>\n<p>IP addresses are unique identifiers. For example, if a device wants to visit this website it must (at the time of this writing) go to the IP address <code>76.76.21.21</code>[^2].</p>\n<p>[^2]: This is an IPv4 address, and IP version 4 has been around since 1983. It works great, but we're running out of unique IPv4 addresses because nowadays even toasters must connect to the internet. This is where IPv6 comes in: IPv6 uses more characters to make sure all toasters all covered! For example, <code>2606:4700::6810:84e5</code> is an IPv6 address. But IPv6 is not completely adopted yet, so it's still common to use IPv4.</p>\n<p>IP addresses work great for machines and robots because they <em>love</em> numbers. But us humans usually have difficulty remembering them, and we prefer using a more memorable <strong>domain name</strong> instead.</p>\n<p>But on the internet IP addresses must be used, so how can you for example type a domain name in a web browser, and somehow still end up at the correct IP address? Well, this is the main problem that DNS solves: <strong>DNS can look up the IP address of a domain name</strong>.</p>\n<h2>What is DNS?</h2>\n<p>Practically speaking, DNS is like a phone book[^3] for the internet.</p>\n<p>[^3]: In case you don't know, a phone book is literally a <a href=\"https://en.wikipedia.org/wiki/Telephone_directory\">book of phone numbers</a> And a long time ago, they were used to find a phone number for a person or business when you only knew their name. (Yes, people would actually call each other!)</p>\n<p>Technically speaking, DNS is a distributed naming system that consists of many servers spread across the globe.</p>\n<p>I like to think about DNS as a very large partitioned database that organizes, stores, and retrieves information about domain names. To do all of this, DNS has the following main components:</p>\n<ul>\n<li>The domain name space (to organize domain names).</li>\n<li>Name servers and resource records (to store information about domain names).</li>\n<li>Resolvers (to retrieve information about domain names).</li>\n</ul>\n<h3>The domain name space</h3>\n<p>The domain name space is a conceptual model that organizes all domain names on the internet, and it can be visualized as a hierarchical structure that looks like a <a href=\"https://en.wikipedia.org/wiki/Tree_(data_structure)\">tree</a>.</p>\n<p>This hierarchy is reflected in domain names themselves:</p>\n<ul>\n<li>Each part of a domain name that is separated with a <code>.</code> (dot) is called a <strong>label</strong>.</li>\n<li>Each label represents a node in the tree, and is a \"sublevel\" in the naming hierarchy.</li>\n<li>The root of the tree is the \"nameless\" label <code>.</code> (dot), also called the <strong>root domain</strong>[^4].</li>\n</ul>\n<p>[^4]: The root domain is typically not specified. For example, you'd usually type <code>github.com</code> in your browser instead of <code>github.com.</code> (note the trailing dot). But you can absolutely do this! And when you do explicitly provide the root, the domain name is referred to as a <strong>Fully Qualified Domain Name</strong> (FQDN).</p>\n<p>For example, the labels of the domain names:</p>\n<ul>\n<li><code>www.framer.com</code></li>\n<li><code>github.com</code></li>\n<li><code>www.danillouz.dev</code></li>\n<li><code>en.wikipedia.org</code></li>\n</ul>\n<p>Can be visualized in the domain name space like this:</p>\n\n<h4>Top-level domains and subdomains</h4>\n<p>By following the tree of the domain name space from top-to-bottom, the labels of a domain name go from most generic (<code>.</code>) to most specific (e.g. <code>www</code>). And depending on what \"level\" these labels sit in the tree, they are referred to differently.</p>\n<p>When reading a domain name from left-to-right:</p>\n<ul>\n<li>The right-most label is called the <strong>top-level domain</strong> (TLD). There are different kind of TLDs[^5], but the most notable are:\n<ul>\n<li>Generic top-level domains (gTLDs), like <code>com</code> or <code>org</code>.</li>\n<li>Country code top-level domains (ccTLDs), like <code>uk</code> or <code>nl</code>.</li>\n</ul>\n</li>\n<li>The label before the TLD is called the <strong>second-level domain</strong> (2LD). And the label before that is called the <strong>third-level domain</strong> (3LD). This can go on and on: fourth-level, fifth-level, etc. But often all labels before the 2LD are just called a <strong>subdomain</strong>.</li>\n</ul>\n<p>[^5]: There are <a href=\"https://en.wikipedia.org/wiki/Top-level_domain\">6 types of TLDs</a>: country code (ccTLD), generic (gTLD), generic restricted (grTLD), infrastructure (ARPA), sponsored (sTLD), and test (tTLD) top-level domains.</p>\n<p>For example, for the domain name <code>www.bbc.co.uk</code>:</p>\n<ul>\n<li><code>uk</code> is the TLD (ccTLD).</li>\n<li><code>co</code> is the 2LD.</li>\n<li><code>bbc</code> is the 3LD.</li>\n<li><code>www</code> is the 4LD.</li>\n</ul>\n<h3>Name servers and resource records</h3>\n<p>Each label in the domain name space will usually have some information associated with it (e.g. an IP address). This information is stored in text files called <strong>resource records</strong> (usually called DNS records), and DNS servers that store resource records are called <strong>name servers</strong>.</p>\n<p>There are different kind of resource records, and I won't cover all of them in this post, but 3 important ones are:</p>\n<ul>\n<li><strong>NS records</strong> store the name server of a domain name.</li>\n<li><strong>A records</strong> store the IPv4 address of a domain name.</li>\n<li><strong>CNAME records</strong> point to another domain name.</li>\n</ul>\n<h4>DNS zones</h4>\n<p>Name servers are grouped together into <strong>DNS zones</strong> and each zone has an <strong>operator</strong>: an organization responsible for managing a specific part of the domain name space.</p>\n<p>DNS zones usually don't map to domain names or DNS servers exactly, so they can be a bit ambiguous. But they will usually map to level(s) of the domain name space tree (like the root zone, but more on that later). This means that <strong>zones (i.e. name servers) only store parts of the information in the domain name space</strong>.</p>\n<p>I like to think about DNS zones as partitions of the entire database. DNS needs to store a lot of information (and make it globally available), so it splits up its database into zones. And to make sure the system as a whole scales and runs reliably, each zone has an operator that's responsible for it.</p>\n<h3>Resolvers</h3>\n<p>Name servers only store part of the domain name space, so how can DNS retrieve information for every name in the domain name space? Well, most name servers just point to <em>other</em> name servers, and its up to a different kind of DNS server called a <strong>resolver</strong> (also called a recursor) to follow these \"pointers\" and retrieve resource records.</p>\n<h2>How does DNS work?</h2>\n<p>So far we've covered the main components of DNS, but to understand how it works we first need to explicitly identify the different kind of DNS servers and how they interact with each other.</p>\n<p>There are 4 different kind of servers needed to make DNS work:</p>\n<ul>\n<li><strong>Root name servers</strong> are the name servers that serve the <strong>DNS root zone</strong>. This is a special DNS zone that contains <em>all</em> TLDs of the domain name space. The DNS root zone consists of 13 root name servers[^6], and each root name server contains the <a href=\"https://www.iana.org/domains/root/db\">root zone database</a>. This is a list that maps all TLDs to the IP address of their name servers (called the TLD name servers). Such lists are published as plain text files called <strong>DNS zone files</strong> (like the <a href=\"https://www.internic.net/domain/root.zone\">root zone file</a>).</li>\n<li><strong>TLD name servers</strong> store zone files that map <em>all</em> 2LDs (for a specific TLD) to the IP address of their name servers (usually the authoritative name servers).</li>\n<li><strong>Authoritative name servers</strong> have complete information for a domain name, and are the \"authority\" for that part of the domain name space.</li>\n<li><strong>Resolvers</strong> receive requests from a client (e.g. a web browser) to find resource records (e.g. an IP address). Resolvers send <strong>queries</strong> to name servers, and receive resource record(s) back as an <strong>answer</strong>. Depending on the query type (i.e. what resource record is being queried), a resolver might need to query multiple name servers, but (for uncached queries) it will always start with one of the root name servers. That's why every resolver stores a <a href=\"https://www.internic.net/domain/named.root\">hard-coded list</a> of all 13 root name servers. And by default the resolver of your Internet Service Provider (ISP) is used when you browse the internet[^7].</li>\n</ul>\n<p>[^6]: There are 13 clusters of hundreds of physical DNS root servers, distributed all over the globe. And you can see them (and their location) on <a href=\"https://root-servers.org/\">root-servers.org</a>.\n[^7]: But you can change this in the network settings of your operating system and use a different resolver, like Cloudflare's <a href=\"https://developers.cloudflare.com/1.1.1.1/\">1.1.1.1</a> or Google's <a href=\"https://developers.google.com/speed/public-dns/docs/using\">8.8.8.8</a>.</p>\n<blockquote>\n<p><strong>I used to be really confused about what authoritative name servers are, and how they differ from other name servers.</strong> But an authoritative name server is just the name server that \"knows\" the information being queried by a resolver. So it actually depends on the query type which name server is authoritative. For example, root name servers are authoritative for the root zone, TLD name servers are authoritative for a TLD zone, and when querying the A record for a domain name, the name server that stores the IPv4 address is authoritative.</p>\n</blockquote>\n<p>With that covered, we can finally answer the question below.</p>\n<h3>What happens when you visit a website in your browser?</h3>\n<p>The following occurs when a browser uses DNS to look up the IP address of a domain name:</p>\n<ol>\n<li>The browser sends a request to a resolver to find the A record of the entered domain name.</li>\n<li>The resolver sends a query to one of the 13 root name servers to find the TLD name server of the domain name. And when found, the root name server sends an NS record back--with the name of the TLD name server--as the answer to the resolver.</li>\n<li>The resolver sends a query to the TLD name server to find the authoritative name server of the 2LD of the domain name. And when found, the TLD name server sends an NS record back--with the name of the authoritative name server--as the answer to the resolver.</li>\n<li>The resolver sends a query to the authoritative name server to find the A record of the domain name. And when found, the authoritative name server sends an A record back--with the IP address of the domain name--as the answer to the resolver.</li>\n<li>The resolver responds with the IP address of the domain name to the browser.</li>\n<li>The browser can now make an HTTP request to the IP address and fetch the website.</li>\n</ol>\n<blockquote>\n<p><strong>Note that the steps above happen for uncached queries.</strong> Since there can be a lot steps needed to look up information for a domain name, resolvers will cache the results of queries. So when a query is made for a domain name that was recently looked up, the resolver can skip (some of) the steps above and return the cached result immediately. Caching can happen at every step above, on the name servers, resolver, on the browser and operating system.</p>\n</blockquote>\n<h2>Bonus: how is the domain name system managed?</h2>\n<p>We now know that DNS is basically a very large database that's split up into zones, and that zones are managed by operators. But how do operators work together? How do operators know about changes that occur in the domain name space (like when a new domain name is registered)? And who oversees all of this?</p>\n<h3>ICANN and IANA</h3>\n<p><a href=\"https://www.icann.org/\">ICANN</a> (Internet Corporation for Assigned Names and Numbers) and <a href=\"https://www.iana.org/\">IANA</a> (Internet Assigned Numbers Authority) are 2 organizations that help provide stability and consistency on the internet.</p>\n<p>ICANN helps with administration, oversight and maintenance. But delegates some of this to IANA (which is part of ICANN).</p>\n<p>For example, ICANN helps make technical decisions on the internet, coordinates adding <a href=\"https://newgtlds.icann.org/en/about/program\">new TLDs</a>, and operates 1 of the 13 DNS root name servers. While IANA maintains what protocols are used on the internet, coordinates IP addresses globally, and manages the DNS root zone.</p>\n<h3>Domain name registries and registrars</h3>\n<p>Besides the root zone database managed by IANA, there are also organizations that manage a database of all 2LDs with a specific TLD. These organizations are called <strong>registry operators</strong>[^8]. Strictly speaking, the databases they maintain are called <strong>registries</strong>, but often the operator itself will also be referred to as the registry.</p>\n<p>[^8]: Registry operators (or registries) are sometimes also called a Network Information Center (NIC).</p>\n<p>This means each TLD has a registry. For example, <a href=\"https://www.verisign.com\">Verisign</a> is the registry for <code>.com</code> domain names, and <a href=\"https://www.registry.google\">Google Registry</a> is the registry for <code>.dev</code> domain names. Verisign and Google actually manage multiple TLDs, but there are also registries that manage a single TLD.</p>\n<p>So how do these registries know about (new) domain names? Well, some registries allow you to directly register a domain name with them. But most registries will partner with a different organization called a <strong>domain name registrar</strong>.</p>\n<p>Domain name registrars are companies that allow you to register domain names by paying them a fee. When you register a domain name, you don't actually buy the domain name. But you will hold the \"right\" to use it for a specific amount of time. You then become the <strong>registrant</strong> of the domain name and will be considered the \"owner\" of it.</p>\n<p>Registries allow registrars to partner with them by entering a <strong>Registry-Registrar Agreement</strong>. But in order to do so, the registrar must meet the requirements[^9] set by the registry (and ICANN). After the agreement is in place, the registrar may offer their customers to register domain names for the specific TLD(s). And every time a domain name is registered, renewed, transferred, or expires, the registrar will notify[^10] the registry--where for some operations registrars also pay registries (and ICANN) a fee[^11].</p>\n<p>[^9]: These requirements can differ per registry (and some make them <a href=\"https://www.verisign.com/en_US/channel-resources/become-a-registrar/verisign-domain-registrar/index.xhtml\">available online</a>). For example, most registries require the registrar to be <a href=\"https://www.icann.org/en/accredited-registrars\">accredited by ICANN</a>. And sometimes registries even set rules that affect which <em>registrants</em> may register a domain name for their TLD (e.g. <a href=\"https://get.gov/registration/requirements/\">only US governments</a> may register a <code>.gov</code> domain name).\n[^10]: Registrars usually use the <a href=\"https://en.wikipedia.org/wiki/Extensible_Provisioning_Protocol\">Extensible Provisioning Protocol</a> (EPP) to interact with registries.\n[^11]: The registrar must pay fees every time a domain name is registered, renewed or transferred. There's the registry fee (as defined in the Registry-Registrant Agreement). And the $0.18 ICANN fee. But there might also be other fees, like a <a href=\"https://www.icann.org/resources/pages/registrar-fees-2018-08-10-en\">yearly fee of $4000</a> when the registry is ICANN accredited.</p>\n<h2>In closing</h2>\n<p>That's it for now! Everything covered in this post is pretty theoretical, and one way to see DNS in action is to query resource records with <a href=\"https://en.wikipedia.org/wiki/Dig_(command)\">dig</a>. But I'll save that for another post.</p>\n<p>By the way, these are some resources I used to learn more about DNS:</p>\n<ul>\n<li><a href=\"https://datatracker.ietf.org/doc/rfc1034/\">RFC 1034: Domain names concepts and facilities</a></li>\n<li><a href=\"https://datatracker.ietf.org/doc/rfc8499/\">RFC 8499: DNS Terminology</a></li>\n<li><a href=\"https://www.cloudflare.com/learning/dns/what-is-dns/\">What is DNS?</a></li>\n<li><a href=\"https://www.icann.org/resources/pages/what-2012-02-25-en\">What does ICANN do?</a></li>\n</ul>\n<p>(and let me know if you have any good ones to add!)</p>\n","date_published":"2023-06-03T00:00:00.000Z","tags":["dns","dns-zones","domain-names","iana","icann","internet","ip","name-servers","registrars","registries","resolvers","resource-records","subdomains","tld"]},{"id":"https://www.danillouz.dev/posts/mastodon-alias/","url":"https://www.danillouz.dev/posts/mastodon-alias/","title":"Aliasing your Mastodon handle","summary":"Using a custom domain to alias your Mastodon handle.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>I'm not very active on social media, but I recently created a <a href=\"https://joinmastodon.org/\">Mastodon</a> account.</p>\n<p>I'm still learning about the fediverse. So I was reading the docs a bit, and that's when I stumbled upon <a href=\"https://docs.joinmastodon.org/spec/webfinger/\">WebFinger</a>.</p>\n<p>I never heard of it before, but Mastodon uses WebFinger to figure out the location of an account. So it can for example resolve the account <code>danillouz@mastodon.social</code> to the location <code>https://mastodon.social/@danillouz</code>.</p>\n<p>This location information is returned by a WebFinger endpoint. And this made me wonder. Could my site, hosted on a custom domain, return this information as well, so that I could use my custom domain as an \"alias\" for my Mastodon handle? Turns out you can! But there are some caveats.</p>\n<h2>What's WebFinger?</h2>\n<p>WebFinger is a protocol[^1] that allows information about people or entities to be discovered over HTTP. It basically resolves some sort of URI identifier (like an email address, Mastodon account, or phone number) to a location (i.e. an URL), which can be retrieved by making a WebFinger request.</p>\n<p>[^1]: <a href=\"https://www.rfc-editor.org/rfc/rfc7033\">RFC 7033</a> describes the WebFinger protocol.</p>\n<h3>Making a WebFinger request</h3>\n<p>A WebFinger request is an HTTP <code>GET</code> request to a resource. The resource is a well-known URI with a query target. And the query target identifies the entity to get the location for, which is specified via the <code>?resource=</code> query parameter in the request. The endpoint then returns the location information as JSON.</p>\n<p>For example, to get WebFinger information for the Mastodon account <code>danillouz@mastodon.social</code>[^2], you need to make the following request:</p>\n<p>[^2]: Mastodon uses the <code>acct:</code> URI scheme as described in <a href=\"https://www.rfc-editor.org/rfc/rfc7565\">RFC 7565</a>.</p>\n<pre><code>GET /.well-known/webfinger?resource=acct:danillouz@mastodon.social\nHOST: mastodon.social\n</code></pre>\n<pre><code>200 OK\nContent-Type: application/json\n\n{\n  \"subject\": \"acct:danillouz@mastodon.social\",\n  \"aliases\": [\n    \"https://mastodon.social/@danillouz\",\n    \"https://mastodon.social/users/danillouz\"\n  ],\n  \"links\": [\n    {\n      \"rel\": \"http://webfinger.net/rel/profile-page\",\n      \"type\": \"text/html\",\n      \"href\": \"https://mastodon.social/@danillouz\"\n    },\n    {\n      \"rel\": \"self\",\n      \"type\": \"application/activity+json\",\n      \"href\": \"https://mastodon.social/users/danillouz\"\n    },\n    {\n      \"rel\": \"http://ostatus.org/schema/1.0/subscribe\",\n      \"template\": \"https://mastodon.social/authorize_interaction?uri={uri}\"\n    }\n  ]\n}\n</code></pre>\n<p>You can replace the Mastodon domain and username with your own, to get your information instead:</p>\n<pre><code>https://{MASTODON_DOMAIN}/.well-known/webfinger?resource=acct:{MASTODON_USERNAME}\n</code></pre>\n<h3>Why is WebFinger used?</h3>\n<p>On Mastodon, users have accounts on different servers. Like <a href=\"https://mastodon.social\">mastodon.social</a> or <a href=\"https://mas.to\">mas.to</a>. So even though the handles <code>danillouz@mastodon.social</code> and <code>danillouz@mas.to</code> share the same \"local\" username <code>danillouz</code>, they are different accounts.</p>\n<p>And from what I understand, Mastodon's internal implementation can't just use the account handle. It requires the location (provided by WebFinger) to convert an account to a user on its server for things like mentions and search to work.</p>\n<h2>Adding a WebFinger endpoint</h2>\n<p>The RFC mentions that WebFinger information is static:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;The information is intended to be static in nature, and, as such, WebFinger is not intended to be used to return dynamic information like the temperature of a CPU or the current toner level in a laser printer.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://www.rfc-editor.org/rfc/rfc7033#section-1\">RFC 7033: Introduction</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>So if you can host some static JSON on your custom domain, you can add a WebFinger endpoint.</p>\n<p>You can do this by:</p>\n<ol>\n<li><a href=\"#making-a-webfinger-request\">Making a WebFinger request</a> for your Mastodon account to get your WebFinger information.</li>\n<li>Copy-and-pasting the WebFinger JSON response from step 1 to a static file.</li>\n<li>Returning the JSON[^3] from step 2 whenever an HTTP <code>GET</code> request is made to <code>/.well-known/webfinger?resource=acct:{MASTODON_USERNAME}</code> on your custom domain.</li>\n</ol>\n<p>[^3]: The WebFinger RFC <a href=\"https://www.rfc-editor.org/rfc/rfc7033#section-10.2\">mentions</a> that the <code>Content-Type</code> of a WebFinger response should be <code>application/jrd+json</code>. But it looks like using <code>application/json</code> also works.</p>\n<p>With that in place, your custom domain can be used to find your Mastodon account.</p>\n<h3>Static file endpoint</h3>\n<p>I'm using <a href=\"https://astro.build/\">Astro</a>, so I just added a <a href=\"https://docs.astro.build/en/core-concepts/endpoints/#static-file-endpoints\">static file endpoint</a>:</p>\n<pre><code>import type { APIRoute } from \"astro\"\n\nconst MASTODON_USERNAME = \"danillouz\"\nconst MASTODON_DOMAIN = \"mastodon.social\"\n\nexport const GET: APIRoute = async function ({ params, request }) {\n  return new Response(\n    JSON.stringify({\n      body: JSON.stringify({\n        subject: `acct:${MASTODON_USERNAME}@${MASTODON_DOMAIN}`,\n        aliases: [\n          `https://${MASTODON_DOMAIN}/@${MASTODON_USERNAME}`,\n          `https://${MASTODON_DOMAIN}/users/${MASTODON_USERNAME}`,\n        ],\n        links: [\n          {\n            rel: \"http://webfinger.net/rel/profile-page\",\n            type: \"text/html\",\n            href: `https://${MASTODON_DOMAIN}/@${MASTODON_USERNAME}`,\n          },\n          {\n            rel: \"self\",\n            type: \"application/activity+json\",\n            href: `https://${MASTODON_DOMAIN}/users/${MASTODON_USERNAME}`,\n          },\n          {\n            rel: \"http://ostatus.org/schema/1.0/subscribe\",\n            template: `https://${MASTODON_DOMAIN}/authorize_interaction?uri={uri}`,\n          },\n        ],\n      }),\n    })\n  )\n}\n</code></pre>\n<h3>Redirecting WebFinger requests</h3>\n<p>Note that the <a href=\"#static-file-endpoint\">static JSON endpoint</a> I added will only serve the WebFinger information when making the request:</p>\n<pre><code>GET /.well-known/webfinger.json\nHost: www.danillouz.dev\n</code></pre>\n<p>But Mastodon will actually make the following request:</p>\n<pre><code>GET /.well-known/webfinger?resource=acct:danillouz@mastodon.social\nHost: www.danillouz.dev\n</code></pre>\n<p>Since I have just one Mastodon account, I chose to just ignore the <code>?resource=</code> query parameter, and redirect all requests from <code>/.well-known/webfinger</code> to <code>/.well-known/webfinger.json</code>.</p>\n<p>I'm using <a href=\"https://vercel.com/\">Vercel</a>, which supports <a href=\"https://vercel.com/docs/project-configuration#project-configuration/redirects\">redirects</a>. So I can achieve the desired redirect by adding the following rule:</p>\n<pre><code>{\n  \"redirects\": [\n    {\n      \"source\": \"/.well-known/webfinger\",\n      \"destination\": \"/.well-known/webfinger.json\"\n    }\n  ]\n}\n</code></pre>\n<h2>Using my custom domain as an alias</h2>\n<p>Now that my custom domain has a <a href=\"#adding-a-webfinger-endpoint\">WebFinger endpoint</a>, I can find my Mastodon account by using my custom domain!</p>\n<p>For example, searching for <code>hi@danillouz.dev</code> will now give me a hit.</p>\n\n<h2>So how useful is this?</h2>\n<p>I'm not sure to be honest.</p>\n<p>Like mentioned before, Mastodon is a bit different, where an account handle consists of two parts:</p>\n<ul>\n<li>The local username. For example <code>danillouz</code>.</li>\n<li>The server domain. For example <code>mastodon.social</code>.</li>\n</ul>\n<p>And the docs mention that you should include the server domain when sharing your handle with other people, because otherwise they won't be able to find you easily:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;Mastodon allows you to skip the second part when addressing people on the same server as you, but you have to keep in mind when sharing your username with other people, you need to include the domain or they won't be able to find you as easily.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.joinmastodon.org/user/signup/#address\">Mastodon docs: Your username and your domain</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>So in theory, setting up an alias allows you to create a handle that does not change when migrating to a different Mastodon server. And it might make your account easier to find if people know your custom domain.</p>\n<p>It's also pretty cool that with the alias you can have a Mastodon handle that includes your custom domain without needing to host your own Mastodon server. But practically speaking, searching for just the local username on different servers also works as far as I can tell.</p>\n<p>When the docs mentioned that you should include the server domain when sharing your handle (because otherwise people won't be able to find you easily) I thought this meant that someone would always have to search for <code>danillouz@mastodon.social</code> on servers <em>other</em> than <code>mastodon.social</code> to find me. But this doesn't appears to be the case. For example, I can search for <code>danillouz</code> on <code>mast.to</code>, and it will find me.</p>\n<p>So maybe, aliasing your handle isn't really a good idea?</p>\n<p>I'm not sure if having an \"extra\" WebFinger endpoint can actually break stuff (can information become stale?). But there are some caveats when using your custom domain as an alias to be aware of.</p>\n<h2>Caveats</h2>\n<p>There might be more, but these are the ones I encountered.</p>\n<h3>Users need to be signed in to find you via the alias</h3>\n<p>I created my account on <code>mastodon.social</code>, and there I can find my account when searching for the alias without problems. But when I tried finding my account using the alias on a <em>different</em> server, I was surprised there was no result.</p>\n<p>Turns out that when you're not signed in to a server, the search API will not use WebFinger to resolve the handle!</p>\n<p>This is how the search request looks like when I'm signed in:</p>\n<pre><code>GET /api/v2/search?q=hi@danillouz.dev&amp;resolve=true\nHost: mastodon.social\n</code></pre>\n<p>And this is how the same search request looks like when I'm signed out:</p>\n<pre><code>GET /api/v2/search?q=hi@danillouz.dev&amp;resolve=false\nHost: mastodon.social\n</code></pre>\n<p>The difference is that the query parameter <code>resolve</code> is set to <code>true</code> when signed in. But is set to <code>false</code> when signed out.</p>\n<p>And checking the v2 search API docs, we can see that <code>resolve</code> controls if a WebFinger lookup should happen or not:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;Boolean. Attempt WebFinger lookup? Defaults to false.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.joinmastodon.org/methods/search/#query-parameters\">Mastodon docs: Perform a search</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<h3>The alias behaves like a \"catch-all\"</h3>\n<p>Since I'm <a href=\"#redirecting-webfinger-requests\">redirecting WebFinger requests</a>, I'm returning the same response for all <code>acct:</code> queries. So any[^4] local username can be provided together with my custom domain.</p>\n<p>[^4]: Sadly, using emoji doesn't work though.</p>\n<p>For example, these all work:</p>\n<ul>\n<li><code>hey@danillouz.dev</code></li>\n<li><code>737@danillouz.dev</code></li>\n<li><code>lol@danillouz.dev</code></li>\n</ul>\n<h2>In closing</h2>\n<p>It was fun to learn a bit more about Mastodon's internals. And while searching around how Mastodon uses WebFinger, I saw others had the same idea to alias their handle. Like <a href=\"https://blog.maartenballiauw.be/post/2022/11/05/mastodon-own-donain-without-hosting-server.html\">Maarten Balliauw</a> and <a href=\"https://www.lindsaykwardell.com/blog/integrate-mastodon-with-astro\">Lindsay Wardell</a>.</p>\n<p>I think the latter post is pretty cool, where Lindsay is fetching Mastodon posts via RSS to show them on their site. I learned that you can postfix any account or tag with <code>.rss</code>, and Mastodon will give you the RSS feed for it! This reminded me of the Reddit API. So I tried postfixing with <code>.json</code>, and that also works[^5].</p>\n<p>For example:</p>\n<p>[^5]: But as far as I can tell, you won't get the posts in JSON for an account.</p>\n<ul>\n<li><a href=\"https://mastodon.social/@Mastodon.rss\">https://mastodon.social/@Mastodon.rss</a></li>\n<li><a href=\"https://mastodon.social/tags/introduction.rss\">https://mastodon.social/tags/introduction.rss</a></li>\n<li><a href=\"https://mastodon.social/@Mastodon.json\">https://mastodon.social/@Mastodon.json</a></li>\n<li><a href=\"https://mastodon.social/tags/introduction.json\">https://mastodon.social/tags/introduction.json</a></li>\n</ul>\n<p>I especially like the RSS feed functionality, since that allows me to subscribe to accounts and tags from my favourite RSS reader[^6]!</p>\n<p>[^6]: If you're not familiar with RSS, have a look at <a href=\"https://aboutfeeds.com\">aboutfeeds</a>.</p>\n","date_published":"2023-01-03T00:00:00.000Z","date_modified":"2023-02-03T00:00:00.000Z","tags":["astro","mastodon","vercel","webfinger"]},{"id":"https://www.danillouz.dev/posts/go-handlers/","url":"https://www.danillouz.dev/posts/go-handlers/","title":"Handling Go handlers","summary":"Learning about the HTTP request multiplexer, handlers and middleware in Go.","content_html":"<p>I recently had to hook-up some middleware in a Go service. And while looking into the Go standard library <a href=\"https://pkg.go.dev/net/http\">net/http</a> package, I got a bit confused by all the different (but similarly named) types and functions that deal with HTTP handlers.</p>\n<p>For example, the <code>http.Handler</code> and <code>http.HandlerFunc</code> types. The <code>http.Handle()</code> and <code>http.HandleFunc()</code> functions. And the <code>http.ServeMux</code> type that <em>also</em> defines <code>Handle()</code> and <code>HandleFunc()</code> methods.</p>\n<p>At first I didn't really get the difference. And I didn't understand why middleware in Go is typically a function that accepts and returns an <code>http.Handler</code>. But after some (re)reading and experimentation, it all made sense!</p>\n<p>This is what I learned.</p>\n<h2>Handler &amp; ServeMux</h2>\n<p>In a web server we'd typically have <em>handlers</em> that respond to HTTP requests. And <em>routers</em> that map URL patterns to handlers. But how are these exposed via the standard library?</p>\n<h3>Handler</h3>\n<p>The <code>net/http</code> package exposes the <a href=\"https://pkg.go.dev/net/http#Handler\">http.Handler</a> interface:</p>\n<pre><code>type Handler interface {\n\tServeHTTP(ResponseWriter, *Request)\n}\n</code></pre>\n<p>And any type that satisfies the <code>http.Handler</code> interface can be used as a handler. Or in other words, any type that implements the <code>ServeHTTP(ResponseWriter, *Request)</code> method can be used to respond to HTTP requests.</p>\n<h3>ServeMux</h3>\n<p>As far as I know, the standard library doesn't use the term \"router\". It uses the term <em>HTTP request multiplexer</em> instead. But they are essentially the same thing.</p>\n<p>The multiplexer matches the URL path of an incoming request against registered patterns, and calls the handler for the pattern that most closely matches the URL. The standard library exposes <a href=\"https://pkg.go.dev/net/http#ServeMux\">http.ServeMux</a> for this purpose.</p>\n<p>So if we implement an <code>http.Handler</code> and use it together with an <code>http.ServeMux</code>[^1], we can use <a href=\"https://pkg.go.dev/net/http#ServeMux.Handle\">Handle()</a> to respond to HTTP requests:</p>\n<p>[^1]: <code>http.ServeMux</code> also satisfies the <code>http.Handler</code> interface, as it implements a <a href=\"https://pkg.go.dev/net/http#ServeMux.ServeHTTP\">ServeHTTP(ResponseWriter, *Request)</a> method.</p>\n<pre><code>package main\n\nimport (\n  \"log\"\n  \"net/http\"\n)\n\ntype HomeHandler struct {}\nfunc (h HomeHandler) ServeHTTP(w http.ResponseWriter, r *http.Request) {\n  w.Write([]byte(\"Home\"))\n}\n\nfunc main() {\n  mux := http.NewServeMux()\n  handler := HomeHandler{}\n  mux.Handle(\"/\", handler)\n  if err := http.ListenAndServe(\":8888\", mux); err != nil {\n    log.Fatal(err)\n  }\n}\n</code></pre>\n<h3>Handle vs HandleFunc</h3>\n<p>In the example above we used the <code>Handle()</code> method to respond to requests. But <code>http.ServeMux</code> also has the <a href=\"https://pkg.go.dev/net/http#ServeMux.HandleFunc\">HandleFunc()</a> method. So what's the difference?</p>\n<p>At first glance it looks like both accept a pattern and a handler. But <code>Handle()</code> requires a handler that satisfies the <code>http.Handler</code> interface. While <code>HandleFunc()</code> accepts any function that defines <code>http.ResponseWriter</code> and <code>*http.Request</code> parameters:</p>\n<ul>\n<li><code>Handle(pattern string, handler Handler){:go}</code></li>\n<li><code>HandleFunc(pattern string, handler func(ResponseWriter, *Request)){:go}</code></li>\n</ul>\n<p>So we can achieve the exact same thing as in the example above with the following:</p>\n<pre><code>package main\n\nimport (\n  \"log\"\n  \"net/http\"\n)\n\nfunc main() {\n  mux := http.NewServeMux()\n  mux.HandleFunc(\"/\", func (w http.ResponseWriter, r *http.Request) {\n    w.Write([]byte(\"Home\"))\n  })\n  if err := http.ListenAndServe(\":8888\", mux); err != nil {\n    log.Fatal(err)\n  }\n}\n</code></pre>\n<h3>DefaultServeMux</h3>\n<p>We saw in the above examples that <code>http.ServeMux</code> exposes the <code>Handle()</code> and <code>HandleFunc()</code> methods. But it turns out that instead of first creating a multiplexer with <code>http.NewServeMux()</code>, it's also possible to just use <a href=\"https://pkg.go.dev/net/http#Handle\">http.Handle()</a> or <a href=\"https://pkg.go.dev/net/http#HandleFunc\">http.HandleFunc()</a>.</p>\n<p>For example:</p>\n<pre><code>package main\n\nimport (\n  \"log\"\n  \"net/http\"\n)\n\nfunc main() {\n  http.HandleFunc(\"/\", func (w http.ResponseWriter, r *http.Request) {\n    w.Write([]byte(\"Home\"))\n  })\n  if err := http.ListenAndServe(\":8888\", nil); err != nil {\n    log.Fatal(err)\n  }\n}\n</code></pre>\n<p>Using these functions will actually make use of a \"default\" <code>http.ServeMux</code> under the hood. This default multiplexer is defined by the standard library, and named <a href=\"https://cs.opensource.google/go/go/+/refs/tags/go1.19.2:src/net/http/server.go;l=2552\">DefaultServeMux</a>[^2].</p>\n<p>[^2]: <code>DefaultServeMux</code> is just a <a href=\"https://cs.opensource.google/go/go/+/refs/tags/go1.19.2:src/net/http/server.go;drc=867babe1b1587ab6961c1d6274be2426e90bf5d4;l=2305\">ServeMux</a>.</p>\n<h3>So what's HandlerFunc?</h3>\n<p>Turns out that a very useful type to know about when working with handlers is <a href=\"https://pkg.go.dev/net/http#HandlerFunc\">http.HandlerFunc</a>.</p>\n<p>This type allows us to convert a \"plain\" handler function (i.e. <code>func(ResponseWriter, *Request)</code>) into a \"real\" <code>http.Handler</code>. Which is great, because this makes it more convenient to work with handlers.</p>\n<p>So the following won't compile:</p>\n<pre><code>handler := func (w http.ResponseWriter, r *http.Request) {\n  w.Write([]byte(\"Home\"))\n}\nhttp.Handle(\"/\", handler) // ❌ Does not compile\n</code></pre>\n<p>But this will compile:</p>\n<pre><code>handler := func (w http.ResponseWriter, r *http.Request) {\n  w.Write([]byte(\"Home\"))\n}\nhttp.Handle(\"/\", http.HandlerFunc(handler)) // ✅ Compiles\n</code></pre>\n<p>Note that <code>http.HandlerFunc(handler)</code> does <em>not</em> invoke <code>http.HandlerFunc</code> (it's a type, not a function!). But that it's doing a <a href=\"https://go.dev/ref/spec#Conversions\">type conversion</a>[^3] which converts <code>handler</code> with type <code>func(ResponseWriter, *Request)</code> into type <code>http.HandlerFunc</code>.</p>\n<p>[^3]: A type conversion is <em>not</em> the same thing as a <a href=\"https://go.dev/ref/spec#Type_assertions\">type assertion</a>.</p>\n<h2>Middleware</h2>\n<p>Middleware are typically small functions which take a request, do something with it, and then pass it to <em>another</em> middleware or the (final) handler.</p>\n<p>In Go, middleware will sit \"between\" the multiplexer and the handler responding to the HTTP requests.</p>\n<p>A few examples of typical middleware use cases are:</p>\n<ul>\n<li>Logging requests.</li>\n<li>Auth (i.e. authenticate and/or authorize requests).</li>\n<li>Header and response manipulation.</li>\n</ul>\n<p>Generally speaking, in Go, functions that accept and return an <code>http.Handler</code> are considered middleware:</p>\n<pre><code>func(next http.Handler) http.Handler\n</code></pre>\n<p>For example:</p>\n<pre><code>func someMiddleware(next http.Handler) http.Handler {\n  return http.HandlerFunc(func (w http.ResponseWriter, r *http.Request) {\n    // Do something with `r`.\n\n    next.ServeHTTP(w, r)\n  })\n}\n</code></pre>\n<p>Why does middleware accept and return an <code>http.Handler</code>? This allows us to create a \"chain\" of handlers:</p>\n<pre><code>http.Handle(\"/\", middlewareA(middlewareB(middlewareC(handler))))\n</code></pre>\n<p>But this can a get a bit unreadable. And that's why third-party libraries typically offer a <code>Use()</code> function.</p>\n<p>For example, this is how you'd use it with <a href=\"https://go-chi.io/#/pages/middleware\">chi</a>:</p>\n<pre><code>r := chi.NewRouter()\nr.Use(middlewareA, middlewareB, middlewareC)\nr.Get(\"/\", handler)\n</code></pre>\n<h2>ServeMux gotchas</h2>\n<p>To wrap up, I want to highlight some (sometimes unexpected) behavior I learned about while reading the docs and playing with <code>http.ServeMux</code>.</p>\n<h3>Paths and patterns</h3>\n<p>When registering a handler for a pattern with <code>http.ServeMux</code>, the pattern can either name <strong>fixed paths</strong>, or <strong>subtree paths</strong>.</p>\n<p>Fixed paths do <em>not</em> have a trailing slash (e.g. <code>/blog</code> or <code>/blog/create</code>). And they are only matched when the URL <em>exactly</em> matches the pattern.</p>\n<p>Subtree paths <em>do</em> have a trailing slash (e.g. <code>/</code> or <code>/blog/</code> or <code>/blog/create/</code>). And they match all paths <em>not</em> matched by other registered patterns. So subtree paths kind of work like \"catch all\" patterns:</p>\n<pre><code>mux.HandleFunc(\"/\", homeHandler) // Subtree path\n</code></pre>\n<table>\n<thead>\n<tr>\n<th>Request path</th>\n<th>Calls <code>homeHandler</code></th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td><code>/</code></td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog</code></td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/</code></td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/create</code></td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/notfound</code></td>\n<td>✅ Yes</td>\n</tr>\n</tbody>\n</table>\n<p>Note that subtree path patterns will match when <em>not</em> matched by other registered (fixed path) patterns:</p>\n<pre><code>mux.HandleFunc(\"/\", homeHandler) // Subtree path\nmux.HandleFunc(\"/blog\", blogHandler) // Fixed path\n</code></pre>\n<table>\n<thead>\n<tr>\n<th>Request path</th>\n<th>Calls <code>homeHandler</code></th>\n<th>Calls <code>blogHandler</code></th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td><code>/</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog/create</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/notfound</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n</tbody>\n</table>\n<p>So to for example let handlers match the <code>/blog/*</code> URL patterns, a subtree path must be used instead of a fixed path:</p>\n<pre><code>mux.HandleFunc(\"/\", homeHandler) // Subtree path\nmux.HandleFunc(\"/blog/\", blogHandler) // Subtree path\n</code></pre>\n<table>\n<thead>\n<tr>\n<th>Request path</th>\n<th>Calls <code>homeHandler</code></th>\n<th>Calls <code>blogHandler</code></th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td><code>/</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/create</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/notfound</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n</tbody>\n</table>\n<p>Also note that longer registered path patterns take precedence over shorter ones:</p>\n<pre><code>mux.HandleFunc(\"/blog/\", blogHandler) // Subtree path\nmux.HandleFunc(\"/blog/create/\", blogCreateHandler) // Subtree path\n</code></pre>\n<table>\n<thead>\n<tr>\n<th>Request path</th>\n<th>Calls <code>blogHandler</code></th>\n<th>Calls <code>blogCreateHandler</code></th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td><code>/</code></td>\n<td>❌ No</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog/</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog/1</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n<tr>\n<td><code>/blog/create</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/blog/create/1</code></td>\n<td>❌ No</td>\n<td>✅ Yes</td>\n</tr>\n<tr>\n<td><code>/notfound</code></td>\n<td>✅ Yes</td>\n<td>❌ No</td>\n</tr>\n</tbody>\n</table>\n<h3>Path redirects</h3>\n<p>If a subtree path pattern has been registered with <code>http.ServeMux</code>, and it receives a request path <em>without</em> a trailing slash, it will redirect the request to the \"subtree root\" (i.e. redirect to the request path <em>with</em> the trailing slash).</p>\n<p>To prevent this from happening you need to register the pattern for the path <em>without</em> the trailing slash.</p>\n<p>For example, when registering <code>/blog/</code>, request to <code>/blog</code> will redirect to <code>/blog/</code>, <em>unless</em> <code>/blog</code> is also registered.</p>\n<h3>Sanitization</h3>\n<p><code>http.ServeMux</code> will \"sanitize\" the URL request path and the Host header.</p>\n<p>It will strip the port number and redirect any request containing <code>.</code> or <code>..</code> elements, or repeated slashes, to a similar but cleaner URL.</p>\n<h3>Limitations</h3>\n<p><code>http.ServeMux</code> only supports basic prefix matching. So it does <em>not</em> have support for:</p>\n<ul>\n<li>Path variables.</li>\n<li>Regex path patterns.</li>\n<li>Method-based routing.</li>\n</ul>\n<p>For such features, you either need to implement that yourself (e.g. check the request method in a handler). Or use a third-party library like <a href=\"https://github.com/go-chi/chi\">chi</a> or <a href=\"https://github.com/gin-gonic/gin\">gin</a>.</p>\n<h2>In closing</h2>\n<p>I mostly wrote this as a reference for my future self. But perhaps it can be useful to others as well!</p>\n","date_published":"2022-12-22T00:00:00.000Z","date_modified":"2023-01-29T00:00:00.000Z","tags":["golang","http","middleware","multiplexer","request-handler","router"]},{"id":"https://www.danillouz.dev/posts/audio-transcoding-lambda/","url":"https://www.danillouz.dev/posts/audio-transcoding-lambda/","title":"Audio transcoding with AWS Lambda","summary":"Transcoding short audio files with Amazon Elastic Transcoder or FFmpeg.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>For a side project I'm converting WebM audio files to MP3. I initially started doing this with <a href=\"https://aws.amazon.com/elastictranscoder\">Amazon Elastic Transcoder</a>. But after doing the same with <a href=\"https://www.ffmpeg.org\">FFmpeg</a> and <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/configuration-layers.html\">Lambda Layers</a>, my initial testing showed that the latter is around <strong>10 times cheaper</strong> and <strong>2 times faster</strong> for <strong>short audio</strong> recordings (~3 minute / ~3 MB files).</p>\n<p>If you just want to read the code, have a look at <a href=\"https://github.com/upstandfm/audio-transcoder\">github.com/upstandfm/audio-transcoder</a>.</p>\n<h2>Use case</h2>\n<p>My <a href=\"https://github.com/upstandfm/app\">side project</a> is a web app that allows users to record their voice so others can listen to it. In the app I use the <a href=\"https://developer.mozilla.org/en-US/docs/Web/API/MediaStream_Recording_API\">MediaStream Recording API</a> (aka Media Recording API) to easily record audio from the user's input device. It works really well, and you don't have to use any external libraries!</p>\n<p>There's one catch though. At the time of this writing it only works in Firefox, Chrome and Opera. And it \"sort of\" works in Safari[^1]. Even though that's a bit disappointing, I'm okay with that for my use case.</p>\n<p>[^1]: In Safari the Media Recording API is hidden behind a feature flag. And not all events are supported.</p>\n<p>So after I had built something functional that allowed me to record my voice, it turned out that the audio file I ended up with had to be <em>transcoded</em> if I wanted to listen to it across a wide range of browsers and devices.</p>\n<h2>What does transcoding even mean?</h2>\n<p>Before I can answer that, we need to explore <em>what</em> an audio file is.</p>\n<p>We can think of an audio file like a stream of data elements wrapped in a container. This container is formally called a <a href=\"https://developer.mozilla.org/en-US/docs/Web/Media/Formats/Containers\">media container format</a>. And it's basically a <em>file format</em> (think file type) that can store different types of data elements (i.e. bits).</p>\n<p>The container describes how this data \"coexists\" in a file. Some container formats only support audio, like <a href=\"https://en.wikipedia.org/wiki/WAV\">WAVE</a> (usually referred to as WAV). And others support both audio and video, like <a href=\"https://www.webmproject.org\">WebM</a>.</p>\n<p>So a container \"wraps\" data to store it in a file, but information can be stored in different ways. And we'll also want to <em>compress</em> the data to optimize for storage and/or bandwidth by <em>encoding</em> it (i.e. converting it from one \"form\" to another).</p>\n<p>This is where a <em>codec</em> (<strong>co</strong>der/<strong>dec</strong>oder) comes into play. It handles all the processing that's required to <em>encode</em> (compress) and <em>decode</em> (decompress) the audio data.</p>\n<p>Therefore, in order to define the format of an audio file (or a video file) we need both a container and a codec. For example, when the MPEG-1 Audio Layer 3 codec is used to store only audio data in an <a href=\"https://en.wikipedia.org/wiki/MPEG-4\">MPEG-4</a> container[^2], we get an <a href=\"https://en.wikipedia.org/wiki/MP3\">MP3</a> file (even though it's technically still an MPEG format file).</p>\n<p>[^2]: A container is not always required. <a href=\"https://developer.mozilla.org/en-US/docs/Web/API/WebRTC_API\">WebRTC</a> does not use a container at all. Instead, it streams the encoded audio and video tracks directly from one peer to another using <code>MediaStreamTrack</code> objects to represent each track.</p>\n<p>So what does transcoding mean? It's the process of converting one encoding into another. And if we convert one container format into another, this process is called <em>transmuxing</em>.</p>\n<p>There are a lot of codecs available. And each codec will have a different effect on the quality, size and/or compatibility of the audio file[^3].</p>\n<p>[^3]: If you'd like to learn more about audio codecs, I recommend reading the <a href=\"https://developer.mozilla.org/en-US/docs/Web/Media/Formats/Audio_codecs\">Mozilla web audio codec guide</a>.</p>\n<h3>Why do you need to transcode audio?</h3>\n<p>You might be wondering (like I was), if we can record audio directly in the browser and immediately use the result in our app, why do we even have to transcode it?</p>\n<p>The answer is: to optimize for <em>compatibility</em>. Because the Media Recording API can <em>not</em> record audio in all media formats.</p>\n<p>For example, MP3 has good compatibility across browsers and devices for playback, but is <em>not</em> supported by the Media Recording API. What formats are supported depend on the browser's specific implementation of said API.</p>\n<p>We can use the <a href=\"https://developer.mozilla.org/en-US/docs/Web/API/MediaRecorder/isTypeSupported\">isTypeSupported</a> method to figure out if we can record in a specific media type by calling it with a <a href=\"https://developer.mozilla.org/en-US/docs/Web/HTTP/Basics_of_HTTP/MIME_types\">MIME</a> type. Run the following code in the web console (e.g. in Firefox) to see it in action:</p>\n<pre><code>MediaRecorder.isTypeSupported(\"audio/mpeg\") // false\n</code></pre>\n<p>Okay, MP3 isn't supported. Which format can we use to record in then? It looks like WebM is a good choice:</p>\n<pre><code>MediaRecorder.isTypeSupported(\"audio/webm\") // true\n</code></pre>\n<p>Also note that you can specify the codec in addition to the container:</p>\n<pre><code>MediaRecorder.isTypeSupported(\"audio/webm;codecs=opus\") // true\n</code></pre>\n<p>So if we want to end up with MP3 files of the recordings, we need to transcode (and technically also transmux) the WebM audio recordings.</p>\n<h3>How will we do this?</h3>\n<p>We'll explore two implementations that both convert a WebM audio file to MP3:</p>\n<ol>\n<li><a href=\"#using-amazon-elastic-transcoder\">Using Amazon Elastic Transcoder</a></li>\n<li><a href=\"#using-ffmpeg-and-lambda-layers\">Using FFmpeg and Lambda Layers</a></li>\n</ol>\n<p>For both implementations we'll use the <a href=\"https://serverless.com\">Serverless Framework</a> and <a href=\"https://nodejs.org/en\">Node.js</a> to write the code for the <a href=\"https://aws.amazon.com/lambda\">Lambda</a> function that converts an audio file.</p>\n<p>Before we get started, make sure you have Node.js installed. And then use <a href=\"https://www.npmjs.com\">npm</a> to install the Serverless Framework globally:</p>\n<pre><code>npm i -G serverless\n</code></pre>\n<p>Additionally, we'll need two <a href=\"https://aws.amazon.com/s3\">S3</a> buckets to process and store the converted audio files:</p>\n<ul>\n<li>An <em>input</em> bucket to upload WebM audio files.</li>\n<li>An <em>output</em> bucket to store transcoded MP3 files.</li>\n</ul>\n<h2>Using Amazon Elastic Transcoder</h2>\n<p>Amazon Elastic Transcoder is a fully managed and highly scalable AWS service that can be used to transcode audio and video files.</p>\n<p>We can use this service to schedule a transcoding job in a pipeline. The pipeline knows from which bucket to read a file that needs to be converted, and to which bucket the converted file should be written. Whereas the job contains instructions on which file to transcode, and to what format it should be converted.</p>\n<p>We'll create a Lambda function that will \"listen\" to the S3 input bucket. And whenever a new object is created in that bucket, Lambda will schedule a transcoder job to create the MP3 file.</p>\n<p>So the flow will be like this:</p>\n<ul>\n<li>A WebM audio file is uploaded to the input bucket.</li>\n<li>The Lambda function is triggered, and uses the key of the created S3 object to schedule a transcoder job.</li>\n<li>A job is scheduled in the pipeline. And Amazon Elastic Transcoder:\n<ul>\n<li>Fetches the WebM audio file from the input bucket.</li>\n<li>Transcodes the WebM audio file to MP3.</li>\n<li>Uploads the MP3 file to the output bucket.</li>\n</ul>\n</li>\n</ul>\n<blockquote>\n<p>At the time of this writing <a href=\"https://aws.amazon.com/cloudformation\">AWS CloudFormation</a> has <strong>no</strong> support for Amazon Elastic Transcoder. So you'll have to use the AWS web console to create and configure your pipeline(s).</p>\n</blockquote>\n<p>We'll go through the following steps to get it up and running:</p>\n<ol>\n<li><a href=\"#1-create-a-pipeline\">Create a pipeline</a></li>\n<li><a href=\"#2-choose-a-preset\">Choose a preset</a></li>\n<li><a href=\"#3-create-an-iam-policy\">Create an IAM Policy</a></li>\n<li><a href=\"#4-create-a-serverless-project\">Create a Serverless project</a></li>\n<li><a href=\"#5-implement-the-lambda-function\">Implement the Lambda function</a></li>\n<li><a href=\"#6-release-the-lambda-function\">Release the Lambda function</a></li>\n<li><a href=\"#7-schedule-a-job\">Schedule a job</a></li>\n</ol>\n<h3>1. Create a pipeline</h3>\n<p>Navigate to the Elastic Transcoder service in the AWS web console. Select a region (we'll use <code>eu-west-1</code>), and click on \"Create New Pipeline\".</p>\n\n<p>Create the pipeline and take note of the ARN and Pipeline ID. We'll need both to configure the Lambda function later on.</p>\n\n<h3>2. Choose a preset</h3>\n<p>The pipeline we created in the previous step requires a <a href=\"https://docs.aws.amazon.com/elastictranscoder/latest/developerguide/working-with-presets.html\">preset</a> to work. Presets contain settings we want to be applied during the transcoding process. And lucky for us, AWS already has system presets to convert to MP3 files.</p>\n<p>In the web console, click on \"Presets\" and filter on the keyword \"MP3\". Select one and take note of its ARN and Preset ID. We'll also need these to configure the Lambda function.</p>\n\n<h3>3. Create an IAM Policy</h3>\n<p>AWS will already have created am IAM Role named <code>Elastic_Transcoder_Default_Role</code>. But in order for the pipeline to read objects from the input bucket and write objects to the output bucket, we need to make sure the role has the required permissions to do so.</p>\n<p>Create a new IAM Policy with the following configuration:</p>\n<pre><code>{\n  \"Version\": \"2012-10-17\",\n  \"Statement\": [\n    {\n      \"Effect\": \"Allow\",\n      \"Action\": \"s3:GetObject\",\n      \"Resource\": \"arn:aws:s3:::raw.recordings/*\"\n    },\n    {\n      \"Effect\": \"Allow\",\n      \"Action\": \"s3:PutObject\",\n      \"Resource\": \"arn:aws:s3:::transcoded.recordings/*\"\n    },\n    {\n      \"Effect\": \"Allow\",\n      \"Action\": \"s3:ListBucket\",\n      \"Resource\": \"arn:aws:s3:::transcoded.recordings\"\n    }\n  ]\n}\n</code></pre>\n<p>Make sure the resource ARNs of your input and output buckets are named correctly. And after the Policy has been created, attach it to <code>Elastic_Transcoder_Default_Role</code>.</p>\n<h3>4. Create a Serverless project</h3>\n<p>Create a new project named \"audio-transcoder\". Move into this directory and create a Serverless manifest in the project root:</p>\n<pre><code>service: audio-transcoder\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n</code></pre>\n<p>Add the Elastic Transcoder Pipeline ID, MP3 Preset ID and region (from <a href=\"#1-create-a-pipeline\">step 1</a> and <a href=\"#2-choose-a-preset\">step 2</a>) as environment variables:</p>\n<pre><code>service: audio-transcoder\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n  environment:\n    TRANSCODE_AUDIO_PIPELINE_ID: \"1572538082044-xmgzaa\"\n    TRANSCODER_MP3_PRESET_ID: \"1351620000001-300040\"\n    ELASTIC_TRANSCODER_REGION: \"eu-west-1\"\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n</code></pre>\n<p>Use the Elastic Transcoder Pipeline ARN and MP3 Preset ARN (from <a href=\"#1-create-a-pipeline\">step 1</a> and <a href=\"#2-choose-a-preset\">step 2</a>) to configure the Lambda with the required IAM permissions, so it can create transcoder jobs:</p>\n<pre><code>service: audio-transcoder\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n  environment:\n    TRANSCODE_AUDIO_PIPELINE_ID: \"1572538082044-xmgzaa\"\n    TRANSCODER_MP3_PRESET_ID: \"1351620000001-300040\"\n    ELASTIC_TRANSCODER_REGION: \"eu-west-1\"\n  iamRoleStatements:\n    - Effect: Allow\n      Action:\n        - elastictranscoder:CreateJob\n      Resource:\n        - YOUR_PIPELINE_ARN # Replace this with the ARN from step 1\n        - YOUR_PRESET_ARN # Replace this with the ARN from step 2\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n</code></pre>\n<p>And finally, add the Lambda function definition. This Lambda will be executed whenever an object is created in the input bucket:</p>\n<pre><code>service: audio-transcoder\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n  environment:\n    TRANSCODE_AUDIO_PIPELINE_ID: \"1572538082044-xmgzaa\"\n    TRANSCODER_MP3_PRESET_ID: \"1351620000001-300040\"\n    ELASTIC_TRANSCODER_REGION: \"eu-west-1\"\n  iamRoleStatements:\n    - Effect: Allow\n      Action:\n        - elastictranscoder:CreateJob\n      Resource:\n        - YOUR_PIPELINE_ARN # Replace this with the ARN from step 1\n        - YOUR_PRESET_ARN # Replace this with the ARN from step 2\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  transcodeToMp3:\n    handler: src/handler.transcodeToMp3\n    description: Transcode an audio file to MP3\n    events:\n      - s3:\n          bucket: \"raw.recordings\"\n          event: \"s3:ObjectCreated:*\"\n          existing: true\n</code></pre>\n<h3>5. Implement the Lambda function</h3>\n<p>In order to match the Lambda function definition in the Serverless manifest, create a file named <code>handler.js</code> in <code>src</code>. And export a method named <code>transcodeToMp3</code>:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.transcodeToMp3 = async () =&gt; {\n  try {\n    // Implementation goes here.\n  } catch (err) {\n    console.log(\"Transcoder Error: \", err)\n  }\n}\n</code></pre>\n<p>In the previous step we configured the Lambda to be executed whenever an object is created in the input bucket. This means that AWS will call the Lambda with an <code>event</code> message that contains a list of <code>Records</code>. And each <code>Record</code> will contain an <code>s3</code> object with information about the <code>s3:ObjectCreated</code> event:</p>\n<pre><code>// \"event\" object:\n{\n  \"Records\":[\n    // \"Record\" object:\n    {\n      \"s3\":{\n        // Contains information about the \"s3:ObjectCreated\" event.\n      }\n    }\n  ]\n}\n</code></pre>\n<p>The <code>s3</code> object will contain a property called <code>key</code>, which is the \"name\" of the file that was created in the input bucket. For example, if we upload a file named <code>test.webm</code> to the S3 bucket, the value of <code>key</code> will be the (URL encoded!) string <code>test.webm</code>.</p>\n<p>You can see the entire event message structure in the <a href=\"https://docs.aws.amazon.com/AmazonS3/latest/dev/notification-content-structure.html\">AWS S3 docs</a>.</p>\n<p>Also be aware that you can get <strong>more than one</strong> <code>Record</code>. So always process all of them:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.transcodeToMp3 = async (event) =&gt; {\n  try {\n    for (const Record of event.Records) {\n      const { s3 } = Record\n      if (!s3) {\n        continue\n      }\n\n      const { object: s3Object = {} } = s3\n      const { key } = s3Object\n      if (!key) {\n        continue\n      }\n\n      const decodedKey = decodeURIComponent(key)\n      // TODO: use \"decodedKey\" to schedule transcoder job.\n    }\n  } catch (err) {\n    console.log(\"Transcoder Error: \", err)\n  }\n}\n</code></pre>\n<p>Finally, initialize the transcoder client. And schedule a transcoder job for every created object in the input bucket:</p>\n<pre><code>\"use strict\"\n\nconst ElasticTranscoder = require(\"aws-sdk/clients/elastictranscoder\")\n\nconst {\n  ELASTIC_TRANSCODER_REGION,\n  TRANSCODE_AUDIO_PIPELINE_ID,\n  TRANSCODER_MP3_PRESET_ID,\n} = process.env\n\nconst transcoderClient = new ElasticTranscoder({\n  region: ELASTIC_TRANSCODER_REGION,\n})\n\nmodule.exports.transcodeToMp3 = async (event) =&gt; {\n  try {\n    for (const Record of event.Records) {\n      const { s3 } = Record\n      if (!s3) {\n        continue\n      }\n\n      const { object: s3Object = {} } = s3\n      const { key } = s3Object\n      if (!key) {\n        continue\n      }\n\n      const decodedKey = decodeURIComponent(key)\n      await transcoderClient\n        .createJob({\n          PipelineId: TRANSCODE_AUDIO_PIPELINE_ID,\n          Input: {\n            Key: decodedKey,\n          },\n          Outputs: [\n            {\n              Key: decodedKey.replace(\"webm\", \"mp3\"),\n              PresetId: TRANSCODER_MP3_PRESET_ID,\n            },\n          ],\n        })\n        .promise()\n    }\n  } catch (err) {\n    console.log(\"Transcoder Error: \", err)\n  }\n}\n</code></pre>\n<p>You can read more about the <code>createJob</code> API in the <a href=\"%22https://docs.aws.amazon.com/AWSJavaScriptSDK/latest/AWS/ElasticTranscoder.html#createJob-property\">AWS JavaScript SDK</a> docs.</p>\n<h3>6. Release the Lambda function</h3>\n<p>In order to upload the Lambda to AWS, make sure you have your <a href=\"https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-files.html\">credentials configured</a>. And then run the following command from the project root to release the Lambda:</p>\n<pre><code>sls deploy --region eu-west-1 --stage prod\n</code></pre>\n<h3>7. Schedule a job</h3>\n<p>With everything up and running, we can now upload a WebM audio file to the input bucket to schedule a transcoder job. Navigate to the S3 service in the AWS web console:</p>\n<ul>\n<li>Select your input bucket.</li>\n<li>Click \"Upload\".</li>\n<li>Add a WebM audio file.</li>\n<li>Click on \"Upload\" again.</li>\n</ul>\n<p>This action will trigger an <code>s3:ObjectCreated</code> event. AWS will execute the Lambda function we deployed in the previous step, and it will schedule a transcoder job.</p>\n<p>To get more information about a scheduled job, navigate to the Elastic Transcoder service in the AWS web console. Click on \"Jobs\", select your pipeline and click \"Search\". Here you can select a job to get more details about it.</p>\n\n<p>If it has status \"Complete\", there should be a file named <code>test.mp3</code> in the output bucket!</p>\n<h2>Using FFmpeg and Lambda Layers</h2>\n<p>FFmpeg is a cross-platform solution that can be used to convert audio and video files. And since it's a binary, we'll use a Lambda Layer to execute it from the Lambda function.</p>\n<h3>What's a Lambda Layer?</h3>\n<p>Lambda Layers allow us to \"pull in\" extra dependencies into Lambda functions. A layer is basically a ZIP archive that contains some code. And in order to use a layer we first must create and publish one.</p>\n<p>After we publish a layer we can configure any Lambda function to use it[^4]. AWS will then extract the layer to a special directory called <code>/opt</code>. And the Lambda function runtime will be able to execute it.</p>\n<p>[^4]: At the time of this writing a Lambda function can use <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/configuration-layers.html\">up to 5 layers at a time</a>.</p>\n<h3>How different is this implementation?</h3>\n<p>We're basically \"swapping out\" Amazon Elastic Transcoder with FFmpeg. Other than that the flow is still the same.</p>\n<p>So since we're still converting a WebM audio file to MP3 whenever it's uploaded to the input bucket, we can reuse the Lambda from the <a href=\"#4-create-a-serverless-project\">previous implementation</a> by making these changes:</p>\n<ul>\n<li>Replace Amazon Elastic Transcoder with FFmpeg.</li>\n<li>Within the Lambda we will:\n<ul>\n<li>Retrieve the WebM audio file from the input bucket whenever it's uploaded.</li>\n<li>Convert the retrieved WebM audio file to MP3 using FFmpeg.</li>\n<li>Write the converted MP3 file to the output bucket.</li>\n</ul>\n</li>\n</ul>\n<p>We'll apply these changes by going through the following steps:</p>\n<ol>\n<li><a href=\"#1-create-and-publish-ffmpeg-lambda-layer\">Create and publish FFmpeg Lambda Layer</a></li>\n<li><a href=\"#2-update-the-serverless-manifest\">Update the Serverless manifest</a></li>\n<li><a href=\"#3-update-the-lambda-function\">Update the Lambda function</a></li>\n<li><a href=\"#4-release-the-updated-lambda-function\">Release the updated Lambda function</a></li>\n<li><a href=\"#5-upload-another-webm-audio-file\">Upload another WebM audio file</a></li>\n<li><a href=\"#6-optimize-the-lambda-function\">Optimize the Lambda function</a></li>\n</ol>\n<h3>1. Create and publish FFmpeg Lambda Layer</h3>\n<p>The Serverless Framework makes it very easy to work with layers. To get started create a new project named \"lambda-layers\". Move into this directory and create a Serverless manifest in the project root:</p>\n<pre><code>service: lambda-layers\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n\npackage:\n  exclude:\n    - ./*\n  include:\n    - layers\n\nlayers:\n  ffmpeg:\n    path: layers\n    description: FFmpeg binary\n    compatibleRuntimes:\n      - nodejs10.x\n    licenseInfo: GPL v2+, for more info see https://github.com/FFmpeg/FFmpeg/blob/master/LICENSE.md\n</code></pre>\n<p>The layer is named <code>ffmpeg</code> and the <code>path</code> property dictates that the layer code will reside in a directory named <code>layers</code>. Match this structure in the project by creating that directory first.</p>\n<p>Move into the <code>layers</code> directory and download a static build of FFmpeg from <a href=\"https://johnvansickle.com/ffmpeg\">johnvansickle.com/ffmpeg</a>[^5].</p>\n<p>[^5]: These FFmpeg builds are all compatible with Amazon Linux 2. This is the operating system on which Lambda runs when the <code>Node.js</code> <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/lambda-runtimes.html\">runtime</a> is used.</p>\n<p>Use the recommended <code>ffmpeg-git-amd64-static.tar.xz</code> master build:</p>\n<pre><code>curl -O https://johnvansickle.com/ffmpeg/builds/ffmpeg-git-amd64-static.tar.xz\n</code></pre>\n<p>Extract the files from the downloaded archive:</p>\n<pre><code>tar -xvf ffmpeg-git-amd64-static.tar.xz\n</code></pre>\n<p>Remove the downloaded archive:</p>\n<pre><code>rm ffmpeg-git-amd64-static.tar.xz\n</code></pre>\n<p>And rename the extracted directory to <code>ffmpeg</code>, so it matches the configured layer name in the Serverless manifest. For example:</p>\n<pre><code>mv ffmpeg-git-20191029-amd64-static ffmpeg\n</code></pre>\n<p>You should now have the following files and folder structure:</p>\n<pre><code>lambda-layers\n  ├── layers\n  │   └── ffmpeg\n  │       ├── GPLv3.txt\n  │       ├── ffmpeg\n  │       ├── ffprobe\n  │       ├── manpages\n  │       ├── model\n  │       ├── qt-faststart\n  │       └── readme.txt\n  └── serverless.yml\n</code></pre>\n<p>Publish the layer by running the following command from the project root:</p>\n<pre><code>sls deploy --region eu-west-1 --stage prod\n</code></pre>\n<p>When Serverless finishes deploying, navigate to the Lambda service in the AWS web console and click on \"Layers\". Here you should see the published layer. Click on it and take note of the ARN. We'll need it in the next step.</p>\n\n<h3>2. Update the Serverless manifest</h3>\n<p>We'll now be modifying the manifest file of the <code>audio-transcoder</code> project.</p>\n<p>First change the environment variables, and add the names of your input and output buckets. Then change the IAM permissions so the Lambda function can read from the input bucket and write to the output bucket. And finally, change the Lambda function to use the FFmpeg layer with the ARN from the previous step:</p>\n<pre><code>service: audio-transcoder\n\nprovider:\n  name: aws\n  runtime: nodejs10.x\n  environment:\n    S3_INPUT_BUCKET_NAME: \"raw.recordings\"\n    S3_OUTPUT_BUCKET_NAME: \"transcoded.recordings\"\n  iamRoleStatements:\n    - Effect: Allow\n      Action:\n        - s3:GetObject\n      Resource: arn:aws:s3:::raw.recordings/*\n    - Effect: Allow\n      Action:\n        - s3:PutObject\n      Resource: arn:aws:s3:::transcoded.recordings/*\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  transcodeToMp3:\n    handler: src/handler.transcodeToMp3\n    description: Transcode an audio file to MP3\n    events:\n      - s3:\n          bucket: \"raw.recordings\"\n          event: \"s3:ObjectCreated:*\"\n          existing: true\n    layers:\n      - YOUR_FFMPEG_LAYER_ARN # Replace this with the ARN from step 1\n</code></pre>\n<h3>3. Update the Lambda function</h3>\n<p>Since we have to read from the input bucket and write to the output bucket, replace the Elastic Transcoder client with the S3 client. And use the <code>decodedKey</code> to get the WebM recording from the input bucket:</p>\n<pre><code>\"use strict\"\n\nconst S3 = require(\"aws-sdk/clients/s3\")\nconst { S3_INPUT_BUCKET_NAME, S3_OUTPUT_BUCKET_NAME } = process.env\nconst s3Client = new S3()\n\nmodule.exports.transcodeToMp3 = async (event) =&gt; {\n  try {\n    for (const Record of event.Records) {\n      const { s3 } = Record\n      if (!s3) {\n        continue\n      }\n\n      const { object: s3Object = {} } = s3\n      const { key } = s3Object\n      if (!key) {\n        continue\n      }\n\n      const decodedKey = decodeURIComponent(key)\n      const webmRecording = await s3Client\n        .getObject({\n          Bucket: S3_INPUT_BUCKET_NAME,\n          Key: decodedKey,\n        })\n        .promise()\n    }\n  } catch (err) {\n    console.log(\"Transcoder Error: \", err)\n  }\n}\n</code></pre>\n<p>The S3 client returns an object that contains a <code>Body</code> property. The value of <code>Body</code> is a blob, which we'll feed to the FFmpeg layer and convert it to MP3.</p>\n<p>We'll do this via a helper function that will spawn a <a href=\"https://nodejs.org/api/child_process.html#child_process_child_process_spawnsync_command_args_options\">synchronous child process</a> which allows us to execute the <code>ffmpeg</code> \"command\" (provided by the FFmpeg layer):</p>\n<pre><code>\"use strict\"\n\nconst { spawnSync } = require(\"child_process\")\n\nmodule.exports = {\n  convertWebmToMp3(webmBlob) {\n    spawnSync(\n      \"/opt/ffmpeg/ffmpeg\", // \"/opt/:LAYER_NAME/:BINARY_NAME\"\n      [\n        // FFmpeg command arguments go here.\n      ],\n      { stdio: \"inherit\" }\n    )\n\n    // Rest of the implementation goes here.\n  },\n}\n</code></pre>\n<p>The <code>ffmpeg</code> command requires the file system to do its magic. And we'll use a \"special\" directory called <code>/tmp</code>[^6] for this.</p>\n<p>[^6]: At the time of this writing the <code>/tmp</code> directory allows you to <em>temporarily</em> store up to <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/limits.html\">512 MB</a>.</p>\n<p>First write the WebM blob to <code>/tmp</code> so FFmpeg can read it. And then tell it to write the produced MP3 file back to the same directory:</p>\n<pre><code>\"use strict\"\n\nconst { spawnSync } = require(\"child_process\")\nconst { writeFileSync } = require(\"fs\")\n\nmodule.exports = {\n  convertWebmToMp3(webmBlob) {\n    const now = Date.now()\n    const input = `/tmp/${now}.webm`\n    const output = `/tmp/${now}.mp3`\n\n    writeFileSync(input, webmBlob)\n\n    spawnSync(\"/opt/ffmpeg/ffmpeg\", [\"-i\", input, output], {\n      stdio: \"inherit\",\n    })\n\n    // TODO: cleanup and return MP3 blob.\n  },\n}\n</code></pre>\n<p>Now read the produced MP3 file from disk, clean <code>/tmp</code>, and return the MP3 blob:</p>\n<pre><code>\"use strict\"\n\nconst { spawnSync } = require(\"child_process\")\nconst { readFileSync, writeFileSync, unlinkSync } = require(\"fs\")\n\nmodule.exports = {\n  convertWebmToMp3(webmBlob) {\n    const now = Date.now()\n    const input = `/tmp/${now}.webm`\n    const output = `/tmp/${now}.mp3`\n\n    writeFileSync(input, webmBlob)\n\n    spawnSync(\"/opt/ffmpeg/ffmpeg\", [\"-i\", input, output], {\n      stdio: \"inherit\",\n    })\n\n    const mp3Blob = readFileSync(output)\n\n    unlinkSync(input)\n    unlinkSync(output)\n\n    return mp3Blob\n  },\n}\n</code></pre>\n<p>Finally, use the MP3 blob in the handler to write it to the output bucket:</p>\n<pre><code>\"use strict\"\n\nconst S3 = require(\"aws-sdk/clients/s3\")\nconst ffmpeg = require(\"./ffmpeg\")\nconst { S3_INPUT_BUCKET_NAME, S3_OUTPUT_BUCKET_NAME } = process.env\nconst s3Client = new S3()\n\nmodule.exports.transcodeToMp3 = async (event) =&gt; {\n  try {\n    for (const Record of event.Records) {\n      const { s3 } = Record\n      if (!s3) {\n        continue\n      }\n\n      const { object: s3Object = {} } = s3\n      const { key } = s3Object\n      if (!key) {\n        continue\n      }\n\n      const decodedKey = decodeURIComponent(key)\n      const webmRecording = await s3Client\n        .getObject({\n          Bucket: S3_INPUT_BUCKET_NAME,\n          Key: decodedKey,\n        })\n        .promise()\n\n      const mp3Blob = ffmpeg.convertWebmToMp3(webmRecording.Body)\n      await s3Client\n        .putObject({\n          Bucket: S3_OUTPUT_BUCKET_NAME,\n          Key: decodedKey.replace(\"webm\", \"mp3\"),\n          ContentType: \"audio/mpeg\",\n          Body: mp3Blob,\n        })\n        .promise()\n    }\n  } catch (err) {\n    console.log(\"Transcoder Error: \", err)\n  }\n}\n</code></pre>\n<h3>4. Release the updated Lambda function</h3>\n<p>Run the same command like before from the project root to release the Lambda:</p>\n<pre><code>sls deploy --region eu-west-1 --stage prod\n</code></pre>\n<h3>5. Upload another WebM audio file</h3>\n<p>When Serverless is done deploying, upload another WebM audio file to the input bucket.</p>\n<p>But nothing happens... Where's the MP3 file?</p>\n<p>Lets find out why this is happening by checking the Lambda function's log files in the AWS web console:</p>\n<ul>\n<li>Go to the Lambda service.</li>\n<li>Click on the <code>audio-transcoder-prod-transcodeToMp3</code> function.</li>\n<li>Click on the \"Monitoring\" tab.</li>\n<li>Click the \"View logs in CloudWatch\" button.</li>\n<li>Select the latest log group.</li>\n</ul>\n<p>Here you should see the logs of the Lambda function.</p>\n\n<p>The logs tell us that FFmpeg is executing (hooray!) but that it doesn't complete (boo!).</p>\n<p>In the middle of the transcoding process the logs just say <code>END</code>. And on the last line we see that the Lambda had a duration of <code>6006.17 ms</code>.</p>\n<p>What's happening? The Lambda function takes \"too long\" to finish executing. By default Lambda has a timeout of 6 seconds[^7]. And after 6 seconds the Lambda function is still not done transcoding, so AWS <em>terminates</em> it.</p>\n<p>[^7]: At the time of this writing the maximum timeout is <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/limits.html\">900 seconds</a>.</p>\n<p>How do we solve this? By optimizing the Lambda function!</p>\n<h3>6. Optimize the Lambda function</h3>\n<p>First let's just set the timeout to a larger value. For example 180 seconds. This way we can see how long it would actually take to complete the transcoding process:</p>\n<pre><code>functions:\n  transcodeToMp3:\n    timeout: 180\n</code></pre>\n<p>Deploy again. When Serverless is done, upload another WebM audio file, and check the logs.</p>\n\n<p>This time we see FFmpeg completes the transcoding process and that the Lambda had a duration of <code>7221.95 ms</code>. If we check the output bucket now, we'll see the MP3 file!</p>\n<h4>Optimizing further</h4>\n<p>Transcoding the audio file in ~7 seconds isn't bad. Actually, it's very similar to Amazon Elastic Transcoder. But we can do better.</p>\n<p>Something that's very important when working with Lambda, is to <em>always</em> performance tune your functions. Or in other words, always make sure that a Lambda function has the <em>optimum</em> memory size configured.</p>\n<p>This is important because when you choose a higher memory setting, AWS will also give you an equivalent resource boost (like CPU). And this will usually positively impact the Lambda function's runtime duration. Which means you'll pay less money.</p>\n<p>By default a Lambda function has a memory setting of 128 MB. So lets increase it and compare results. A good strategy is usually to keep doubling memory and measure the duration. But for the sake of brevity, I'm jumping ahead to 2048 MB:</p>\n<pre><code>functions:\n  transcodeToMp3:\n    memorySize: 2048\n</code></pre>\n<p>Deploy again. And when Serverless is done, upload another WebM audio file and check the logs.</p>\n\n<p>Great, it's even faster now! Does this mean we can just keep increasing the memory and reap the benefits? Sadly, no. There's a tipping point where increasing the memory wont make it run faster.</p>\n<p>For example, increasing the memory to 3008 MB (the maximum <a href=\"https://docs.aws.amazon.com/lambda/latest/dg/limits.html\">memory limit</a> at the time of this writing) will result in a similar runtime duration:</p>\n<h5>Memory 2048 MB</h5>\n<table>\n<thead>\n<tr>\n<th>Test run</th>\n<th>Duration</th>\n<th>Billed Duration</th>\n<th>Cold Start Duration</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>1</td>\n<td><code>3775,63 ms</code></td>\n<td><code>3800 ms</code></td>\n<td><code>392,59 ms</code></td>\n</tr>\n<tr>\n<td>2</td>\n<td><code>3604,71 ms</code></td>\n<td><code>3700 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>3</td>\n<td><code>3682,62 ms</code></td>\n<td><code>3700 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>4</td>\n<td><code>3677,14 ms</code></td>\n<td><code>3700 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>5</td>\n<td><code>3725,77 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n</tbody>\n</table>\n<h5>Memory 3008 MB</h5>\n<table>\n<thead>\n<tr>\n<th>Test run</th>\n<th>Duration</th>\n<th>Billed Duration</th>\n<th>Cold Start Duration</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>1</td>\n<td><code>4125,12 ms</code></td>\n<td><code>4200 ms</code></td>\n<td><code>407,92 ms</code></td>\n</tr>\n<tr>\n<td>2</td>\n<td><code>3767,79 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>3</td>\n<td><code>3736,06 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>4</td>\n<td><code>3662,68 ms</code></td>\n<td><code>3700 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>5</td>\n<td><code>3717,01 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n</tbody>\n</table>\n<p>When done optimizing, make sure to apply a sensible value for the Lambda timeout. In this case, the default of 6 seconds would be a good one.</p>\n<h2>Comparing costs</h2>\n<p>To compare costs between both implementation, I did a couple of test runs converting a 3 minute (2,8 MB) WebM audio file to MP3.</p>\n<p>The following comparison is by no means very extensive, and your mileage may vary. But in my opinion I think it's good enough to get a decent impression of the cost range.</p>\n<h3>Amazon Elastic Transcoder costs</h3>\n<p>The <a href=\"https://aws.amazon.com/elastictranscoder/pricing\">pricing</a> page tells us we pay per minute (with 20 free minutes every month). And when we only transcode audio in region <code>eu-west-1</code>, we'll currently pay <code>$0,00522</code> per minute transcoding time.</p>\n<p>These are the timing results of the test runs:</p>\n<table>\n<thead>\n<tr>\n<th>Test run</th>\n<th>Transcoding Time</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>1</td>\n<td><code>7638 ms</code></td>\n</tr>\n<tr>\n<td>2</td>\n<td><code>6663 ms</code></td>\n</tr>\n<tr>\n<td>3</td>\n<td><code>7729 ms</code></td>\n</tr>\n<tr>\n<td>4</td>\n<td><code>6595 ms</code></td>\n</tr>\n<tr>\n<td>5</td>\n<td><code>8752 ms</code></td>\n</tr>\n<tr>\n<td>6</td>\n<td><code>7216 ms</code></td>\n</tr>\n<tr>\n<td>7</td>\n<td><code>7167 ms</code></td>\n</tr>\n<tr>\n<td>8</td>\n<td><code>6605 ms</code></td>\n</tr>\n<tr>\n<td>9</td>\n<td><code>6718 ms</code></td>\n</tr>\n<tr>\n<td>10</td>\n<td><code>8700 ms</code></td>\n</tr>\n</tbody>\n</table>\n<p>So the average transcoding time of the audio file would be:</p>\n<pre><code>7638 + 6663 + 7729 + 6595 + 8752 + 7216 + 7167 + 6605 + 6718 + 8700 = 73 783 ms\n\n73783 / 10 = 7378,3 ms\n\n7378,3 / 1000 = 7,3783 sec\n</code></pre>\n<p>Lets say we would be transcoding <code>100 000</code> of these audio files per month, that would amount to a total transcoding time of:</p>\n<pre><code>7,3783 * 100 000 = 737 830 sec\n\n737 830 / 60 = 12 297,166 666 667 min\n</code></pre>\n<p>Since we pay <code>$0,00522</code> per minute, the costs without free tier would be:</p>\n<pre><code>12 297,166 666 667 * 0,00522 = $64,191 21\n</code></pre>\n<p>And with free tier it would cost:</p>\n<pre><code>(12 297,166 666 667 - 20) * 0,00522 = $64,086 81\n</code></pre>\n<h4>What about Lambda costs?</h4>\n<p>We're using Lambda to schedule Amazon Elastic Transcoder jobs. So we also have to calculate those (minor if not negligible) costs.</p>\n<p>The Lambda <a href=\"https://aws.amazon.com/lambda/pricing\">pricing</a> page tells us we pay for the <strong>number of requests</strong> and the <strong>duration</strong> (which depends on memory setting).</p>\n<p>We get 1 million requests for free every month, and after that you pay <code>$0,20</code> per 1 million requests. Since we're only doing 1/10th of that in this example, I'm <em>not</em> including number of requests in the calculations. I'm only focusing on duration costs here.</p>\n<p>These are the Lambda durations (with 128 MB memory) for the accompanying transcoder test runs:</p>\n<table>\n<thead>\n<tr>\n<th>Test run</th>\n<th>Duration</th>\n<th>Billed Duration</th>\n<th>Cold Start Duration</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>1</td>\n<td><code>494,08 ms</code></td>\n<td><code>500 ms</code></td>\n<td><code>401,61 ms</code></td>\n</tr>\n<tr>\n<td>2</td>\n<td><code>185,01 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>3</td>\n<td><code>168,29 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>4</td>\n<td><code>165,29 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>5</td>\n<td><code>184,89 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>6</td>\n<td><code>210,19 ms</code></td>\n<td><code>300 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>7</td>\n<td><code>162,64 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>8</td>\n<td><code>178,79 ms</code></td>\n<td><code>200 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>9</td>\n<td><code>318,84 ms</code></td>\n<td><code>400 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>10</td>\n<td><code>206,18 ms</code></td>\n<td><code>300 ms</code></td>\n<td>-</td>\n</tr>\n</tbody>\n</table>\n<p>The average billed duration would be:</p>\n<pre><code>500 + 200 + 200 + 200 + 200 + 300 + 200 + 200 + 400 + 300 = 2700 ms\n\n2700 / 10 = 270 ms\n\n270 / 1000 = 0,27 sec\n</code></pre>\n<p>In region <code>eu-west-1</code>, we'll currently pay <code>$0,000 016 6667</code> for every GB per second (GB/sec). That means we first have to calculate \"how much\" memory the Lambda function uses for its runtime duration.</p>\n<p>For <code>100 000</code> transcoding jobs per month (with 128 MB memory) that would be:</p>\n<pre><code>100 000 * 0,27 = 27000 sec\n\n(128 / 1024) * 27000 = 3375 GB/sec\n</code></pre>\n<p>Currently you get <code>400 000</code> GB/sec for free every month, so depending on your scale you may or may not have to include it in your calculations. But without free tier it would cost:</p>\n<pre><code>3375 * 0,000 016 6667 = $0,056 250 113\n</code></pre>\n<h3>FFmpeg and Lambda Layers costs</h3>\n<p>These are the Lambda durations (with 2048 MB memory) of the test runs:</p>\n<table>\n<thead>\n<tr>\n<th>Test run</th>\n<th>Duration</th>\n<th>Billed Duration</th>\n<th>Cold Start Duration</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>1</td>\n<td><code>4068,56 ms</code></td>\n<td><code>4100 ms</code></td>\n<td><code>408,17 ms</code></td>\n</tr>\n<tr>\n<td>2</td>\n<td><code>3880,55 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>3</td>\n<td><code>3910,52 ms</code></td>\n<td><code>4000 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>4</td>\n<td><code>3794,20 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>5</td>\n<td><code>3856,73 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>6</td>\n<td><code>3859,06 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>7</td>\n<td><code>3810,93 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>8</td>\n<td><code>3799,19 ms</code></td>\n<td><code>3800 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>9</td>\n<td><code>3858,49 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n<tr>\n<td>10</td>\n<td><code>3866,53 ms</code></td>\n<td><code>3900 ms</code></td>\n<td>-</td>\n</tr>\n</tbody>\n</table>\n<p>The average <em>billed duration</em> would be:</p>\n<pre><code>4100 + 3900 + 4000 + 3800 + 3900 + 3900 + 3900 + 3800 + 3900 + 3900 = 39100 ms\n\n39100 / 10 = 3910 ms\n\n3910 / 1000 = 3,91 sec\n</code></pre>\n<p>In region <code>eu-west-1</code>, we'll currently pay <code>$0,000 016 6667</code> for every GB/sec. For <code>100 000</code> transcoding jobs (with 2048 MB memory) that would be:</p>\n<pre><code>100 000 * 3,91 = 391 000 sec\n\n(2048 / 1024) * 391 000 = 782 000 GB/sec\n</code></pre>\n<p>Without free tier it would cost:</p>\n<pre><code>782 000 * 0,000 016 6667 = $13,033 3594\n</code></pre>\n<p>With free tier it would cost:</p>\n<pre><code>(782 000 - 400 000) * 0,000 016 6667 = $6,366 6794\n</code></pre>\n<h3>What about data transfer costs?</h3>\n<p>&lt;blockquote&gt;\n&lt;p&gt;Data transferred between S3, Glacier, DynamoDB, SES, SQS, Kinesis, ECR, SNS, or SimpleDB and Lambda functions <strong>in the same AWS Region is free</strong>.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://aws.amazon.com/lambda/pricing\">AWS Lambda: Pricing</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>Otherwise, data transferred into and out of Lambda functions will be charged at the <a href=\"https://aws.amazon.com/ec2/pricing/on-demand\">EC2 data transfer rates</a> as listed under the “Data transfer” section.</p>\n<h3>Putting it all together</h3>\n<p>Costs of transcoding <code>100 000</code> 3 minute (2,8 MB) WebM audio files to MP3 per month:</p>\n<table>\n<thead>\n<tr>\n<th>Implementation</th>\n<th>Cost without free tier</th>\n<th>Cost with free tier</th>\n</tr>\n</thead>\n<tbody>\n<tr>\n<td>Amazon Elastic Transcoder</td>\n<td>~ $64</td>\n<td>~ $64</td>\n</tr>\n<tr>\n<td>FFmpeg and Lambda Layers</td>\n<td>~ $13</td>\n<td>~ $6</td>\n</tr>\n</tbody>\n</table>\n<h2>In closing</h2>\n<p>That's a wrap! The post turned out a bit longer than expected, but hopefully it will prove useful in your transcoding adventures.</p>\n<p>Happy transcoding!</p>\n","date_published":"2019-10-27T00:00:00.000Z","date_modified":"2023-02-03T00:00:00.000Z","tags":["audio","aws","elastic-transcoder","ffmpeg","lambda","lambda-layers","nodejs","serverless-framework","transcoding","tutorial"]},{"id":"https://www.danillouz.dev/posts/serverless-auth/","url":"https://www.danillouz.dev/posts/serverless-auth/","title":"Serverless auth","summary":"Protecting AWS API Gateway endpoints with AWS Lambda and Auth0.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>Auth is complicated. It can be difficult to reason about and can be hard to work with. The terminology can be complex as well, and terms are sometimes used interchangeably or can be ambiguous. Like saying \"auth\" to refer both to authentication (who are you?) and authorization (I know who you are, but what are you allowed to do?).</p>\n<p>On top of that it can also be challenging to know when to use what. Depending on what you're building and for whom, different auth protocols and strategies might be more suitable or required.</p>\n<p>In this post I won't be exploring these protocols and strategies in depth. Instead, I want to show that implementing something as complex as auth doesn't have to be too difficult. In order to do that I'll focus on a specific (but common) use case, and show a way to implement it.</p>\n<p>If you just want to read the code, have a look at <a href=\"https://github.com/danillouz/serverless-auth\">github.com/danillouz/serverless-auth</a>.</p>\n<h2>Use case and technologies</h2>\n<p>How can we secure an HTTP API with a token based authentication strategy, so only authenticated and authorized clients can access it?</p>\n<p>More specifically:</p>\n<ul>\n<li>The HTTP API is an <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/welcome.html\">AWS API Gateway</a> (APIG).</li>\n<li>The API endpoints are protected with a <a href=\"https://oauth.net/2/bearer-tokens\">bearer token</a> and implemented as <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/set-up-lambda-proxy-integrations.html\">Lambda Proxy Integrations</a>.</li>\n<li><a href=\"https://auth0.com\">Auth0</a> is used as a third-party auth provider.</li>\n<li>An <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-use-lambda-authorizer.html\">APIG Lambda Authorizer</a> is used to verify the token with Auth0.</li>\n<li>The Lambdas are implemented using <a href=\"https://nodejs.org/en\">Node.js</a> and the <a href=\"https://serverless.com\">Serverless Framework</a>.</li>\n<li><a href=\"https://en.wikipedia.org/wiki/CURL\">curl</a> is used as a \"client\" to send HTTP requests to the API with a token.</li>\n</ul>\n<h2>Why use a third-party auth provider?</h2>\n<p>I'll be using Auth0 as a third-party auth provider. This means that I'm choosing <em>not</em> to build (nor operate!) my own \"auth server\". So before we get started, I think it's important to explain the motivation behind this decision.</p>\n<p>In order to build an auth server you could use:</p>\n<ul>\n<li><a href=\"https://oauth.net/2\">OAuth 2.0</a>: an authorization protocol.</li>\n<li><a href=\"https://openid.net/connect\">OpenID Connect</a> (OIDC): an authentication protocol. This is an \"identity layer\" built on top of OAuth 2.0.</li>\n<li><a href=\"https://auth0.com/learn/token-based-authentication-made-easy\">Token based authentication</a>: a strategy that requires a client to send a signed bearer token when making requests to a protected API. The API will only respond to requests successfully when it receives a verified token.</li>\n<li><a href=\"https://tools.ietf.org/html/rfc7519\">JSON Web Token</a> (JWT): a way to send auth information (i.e. \"claims\") as JSON. A JWT contains a <code>Header</code>, <code>Payload</code> and <code>Signature</code> which are Base64 encoded and separated by a period. In effect, a JWT can be used as a bearer token[^1].</li>\n</ul>\n<p>[^1]: You can see how a JWT looks like by visiting <a href=\"https://jwt.io\">jwt.io</a>.</p>\n<p>And with perhaps the help of some other tools and libraries you might be confident enough to build an auth server yourself. But I think that in most cases you shouldn't go down this route[^2]. Why not? Because it will cost a <em>lot</em> of time, energy and money to build, operate and maintain it.</p>\n<p>[^2]: However, building an auth service yourself is a great learning experience. I think it's quite fun and challenging. And more importantly, you'll get a deeper understanding of the subject, which will be <em>very</em> helpful when you're navigating the \"documentation jungle\" of your favorite auth provider.</p>\n<p>If you do have a valid use case, plus enough resources, time and knowledge to build your own auth server, it might make sense for you. But I think that in most cases you should use a third party auth provider instead. Like <a href=\"https://aws.amazon.com/cognito\">AWS Cognito</a> or <a href=\"https://auth0.com\">Auth0</a>.</p>\n<p>Third-party auth providers give you all the fancy tooling, scalable infrastructure and resources you will need to provide a secure, reliable, performant and usable solution. Sure, you'll have to pay for it. But I think the pricing is typically fair. And it will most likely be a small fraction of what it would cost when you'd roll your own solution.</p>\n<p>Another sometimes overlooked benefit of choosing \"buy over build\", is that you'll get access to the domain expert's knowledge. Where they can advise and help you choose the best auth strategy for your use case.</p>\n<p>And last but not least. By having someone else deal with the complexities and challenges of auth, you can focus on building your product!</p>\n<p>Okay, let's get started.</p>\n<h2>What will we build?</h2>\n<p>We'll build an Account API with a single endpoint that returns some profile information for a user.</p>\n<p>Requirements and constraints are:</p>\n<ul>\n<li>The endpoint will be <code>GET /profile</code>.</li>\n<li>The business logic of the endpoint will be implemented by a Lambda handler:\n<ul>\n<li>The Lambda will return data as JSON.</li>\n<li>The Lambda will return a single property <code>name</code> with value <code>Daniël</code>.</li>\n<li>The Lambda will return HTTP status code <code>200</code>.</li>\n</ul>\n</li>\n<li>The endpoint will require a bearer token to return the profile data.\n<ul>\n<li>The token will be sent via the <code>Authorization</code> request header.</li>\n<li>The <code>Authorization</code> request header value must have the format: <code>Bearer &lt;TOKEN&gt;</code>.</li>\n<li>The token is verified by a Lambda Authorizer with the help of Auth0.</li>\n</ul>\n</li>\n</ul>\n<p>This API isn't very useful, but gives us something to work with in order to implement auth.</p>\n<h3>Example</h3>\n<pre><code>GET /profile\nAuthorization: Bearer eyJ...lKw\n</code></pre>\n<pre><code>200 OK\nContent-Type: application/json\n\n{\n  \"name\": \"Daniël\"\n}\n</code></pre>\n<h2>Registering the API with Auth0</h2>\n<p>When the Account API receives a request with the bearer token, it will have to verify the token with the help of Auth0. In order to do that, we first have to register our API with them:</p>\n<ol>\n<li><a href=\"https://auth0.com/signup\">Sign up</a> and setup your tenant.</li>\n<li>In the Auth0 dashboard, navigate to \"APIs\" and click on \"Create API\".</li>\n<li>Follow the <a href=\"https://auth0.com/docs/apis\">instructions</a> and provide a \"Name\" and \"Identifier\". For example <code>Account API</code> and <code>https://api.danillouz.dev/account</code>[^3].</li>\n<li>Use <code>RS256</code> as the signing algorithm (more on that later).</li>\n<li>Click on \"Create\".</li>\n</ol>\n<p>[^3]: The \"Identifier\" doesn't have to be a \"real\" endpoint.</p>\n\n<h3>Lambda Authorizer configuration</h3>\n<p>Now that our API is registered, we need to take note of the following (public) properties, to later on configure our Lambda Authorizer:</p>\n<ul>\n<li>Token issuer: this is basically your Auth0 tenant. It always has the format <code>https://TENANT_NAME.REGION.auth0.com</code>. For example <code>https://danillouz.eu.auth0.com</code>.</li>\n<li>JWKS URI: this returns a <a href=\"https://auth0.com/docs/jwks\">JSON Web Key Set</a> (JWKS). The URI will be used by the Lambda Authorizer to fetch a public key from Auth0 and verify a token (more on that later). It always has the format <code>https://TENANT_NAME.REGION.auth0.com/.well-known/jwks.json</code>. For example <code>https://danillouz.eu.auth0.com/.well-known/jwks.json</code>.</li>\n<li>Audience: this is the \"Identifier\" you provided during step 3 of <a href=\"#registering-the-api-with-auth0\">registering the API with Auth0</a>. For example <code>https://api.danillouz.dev/account</code>.</li>\n</ul>\n<p>You can also find these values under the \"Quick Start\" tab of the API details screen (you were redirected there after registering the API). For example, click on the \"Node.js\" tab and look for these properties:</p>\n<ul>\n<li><code>issuer</code></li>\n<li><code>jwksUri</code></li>\n<li><code>audience</code></li>\n</ul>\n\n<h2>What's a Lambda Authorizer?</h2>\n<p>I haven't explained what a Lambda Authorizer is yet. In short, it's a feature of APIG to control access to an API.</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;A Lambda authorizer is useful if you want to implement a custom authorization scheme that uses a bearer token authentication strategy such as OAuth.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-use-lambda-authorizer.html\">AWS docs: Use API Gateway Lambda authorizers</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>There are actually two types of Lambda Authorizers:</p>\n<ol>\n<li>Token based authorizers.</li>\n<li>Request parameter based authorizers.</li>\n</ol>\n<p>We'll be using the token based authorizer, because that supports bearer tokens.</p>\n<h3>What should it do?</h3>\n<p>When a Lambda Authorizer is configured, and a client makes a request to APIG, AWS will invoke the Lambda Authorizer <em>first</em> (i.e. before the Lambda handler). The Lambda Authorizer must then extract the bearer token from the <code>Authorization</code> request header and validate it by:</p>\n<ol>\n<li>Fetching the JWKS (which contains the public key) from Auth0 using the JWKS URI[^4].</li>\n<li>Verifying the token signature with the fetched public key.</li>\n<li>Verifying the token has the correct issuer and audience claims.</li>\n</ol>\n<p>[^4]: We get the JWKS URI, issuer and audience values from the <a href=\"#lambda-authorizer-configuration\">Lambda Authorizer configuration</a>.</p>\n<p>Only when the token passes these checks should the Lambda Authorizer return an <a href=\"https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies.html\">IAM Policy</a> document with <code>\"Effect\"</code> set to <code>\"Allow\"</code>:</p>\n<pre><code>{\n  \"Version\": \"2012-10-17\",\n  \"Statement\": [\n    {\n      \"Action\": \"execute-api:Invoke\",\n      \"Effect\": \"Allow\",\n      \"Resource\": \"ARN_OF_LAMBDA_HANDLER\"\n    }\n  ]\n}\n</code></pre>\n<p>It's this policy that tells APIG it's <em>allowed</em> to invoke our downstream Lambda handler. In our case that will be the Lambda handler that returns the profile data.</p>\n<p>Alternatively, the Lambda authorizer may <em>deny</em> invoking the downstream handler by setting <code>\"Effect\"</code> to <code>\"Deny\"</code>:</p>\n<pre><code>{\n  \"Version\": \"2012-10-17\",\n  \"Statement\": [\n    {\n      \"Action\": \"execute-api:Invoke\",\n      \"Effect\": \"Deny\",\n      \"Resource\": \"ARN_OF_LAMBDA_HANDLER\"\n    }\n  ]\n}\n</code></pre>\n<p>This will make APIG respond with <code>403 Forbidden</code>. To make APIG respond with <code>401 Unauthorized</code>, return an <code>Unauthorized</code> error from the Lambda Authorizer. We'll see this in action when implementing the Lambda Authorizer.</p>\n<h3>A note on authorization</h3>\n<p>I found it good practice to only <em>authenticate</em> the caller from the Lambda Authorizer and apply <em>authorization</em> logic downstream (i.e. in the Lambda handlers).</p>\n<p>This may not be feasible in all use cases, but doing this keeps the Lambda Authorizer <em>simple</em>. So I think that ideally the Lambda Authorizer is only responsible for:</p>\n<ul>\n<li>Verifying the token.</li>\n<li>Propagating authorization information downstream.</li>\n</ul>\n<p>The downstream Lambda handler can then use the authorization information to decide if it should execute its business logic for the specific caller or not.</p>\n<p>Following this design also leads to a nice decoupling between the authentication and authorization logic (i.e. between the Lambda Authorizer and Lambda handlers).</p>\n<h4>Scopes</h4>\n<p>When using OAuth 2.0, scopes can be used to apply authorization logic. In our case we could have a <code>get:profile</code> scope. And a Lambda handler can check if the caller has been authorized to perform the action that is represented by the scope. If the scope is not present, the Lambda handler can return a <code>403 Forbidden</code> response back to the caller.</p>\n<p>You can configure scope in the Auth0 dashboard by adding permissions to the registered API. Navigate to the \"Permissions\" tab of the API details screen and add <code>get:profile</code> as a scope.</p>\n\n<p>We'll use this scope when implementing the Account API. And you can read more about scopes in the Auth0 <a href=\"https://auth0.com/docs/scopes/current\">docs</a>.</p>\n<h4>Context</h4>\n<p>You can propagate authorization information (like scopes) downstream by returning a <code>context</code> object in the Lambda Authorizer's response:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.authorizer = (event) =&gt; {\n  const authResponse = {\n    principalId: \"UNIQUE_ID\",\n    policyDocument: {\n      Version: \"2012-10-17\",\n      Statement: [\n        {\n          Action: \"execute-api:Invoke\",\n          Effect: \"Allow\",\n          Resource: event.methodArn,\n        },\n      ],\n    },\n    context: {\n      scope: \"get:profile\",\n    },\n  }\n\n  return authResponse\n}\n</code></pre>\n<p>But there's a caveat here. You can <em>not</em> set a JSON serializable object or array as a valid value of any key in the <code>context</code> object. It can only be a <code>String</code>, <code>Number</code> or <code>Boolean</code>:</p>\n<pre><code>context: {\n  a: 'value', // ✅ OK\n  b: 1, // ✅ OK\n  c: true, // ✅ OK\n  d: [9, 8, 7], // ❌ Will NOT be serialized\n  e: { x: 'value', y: 99, z: false } // ❌ Will NOT be serialized\n}\n</code></pre>\n<p>Any \"valid\" properties passed to the <code>context</code> object will be made available to downstream Lambda handlers via the <code>event</code> object:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.handler = (event) =&gt; {\n  const { authorizer } = event.requestContext\n  console.log(authorizer.scope) // \"get:profile\"\n}\n</code></pre>\n<h2>Solidifying our mental model</h2>\n<p>With that covered, we're ready to build the Lambda Authorizer and the Account API. But before we do, let's take a step back and solidify our mental model first.</p>\n<p>To summarize, we need the following components to protect our API:</p>\n<ul>\n<li>Auth0 as the third-party auth provider to issue and help verify bearer tokens.</li>\n<li>APIG to represent the Account API.</li>\n<li>A Lambda Authorizer to verify tokens with Auth0.</li>\n<li>A Lambda handler for the <code>GET /profile</code> endpoint to return the profile data.</li>\n<li><code>curl</code> as the client to send HTTP requests to the API with a token.</li>\n</ul>\n<p>We can visualize how these components will interact with each other like this.</p>\n\n<ol>\n<li>\n<p><code>curl</code> will send an HTTP request to the <code>GET /profile</code> endpoint with a token via the <code>Authorization</code> request header.</p>\n</li>\n<li>\n<p>When the HTTP request reaches APIG, it will check if a Lambda Authorizer is configured for the called endpoint. If so, APIG will invoke the Lambda Authorizer.</p>\n</li>\n<li>\n<p>The Lambda Authorizer will then:</p>\n</li>\n</ol>\n<ul>\n<li>Extract the token from the <code>Authorization</code> request header.</li>\n<li>Fetch the JWKS (which contains the public key) from Auth0.</li>\n<li>Verify the token signature with the fetched public key.</li>\n<li>Verify the token has the correct issuer and audience claims.</li>\n</ul>\n<ol>\n<li>\n<p>If the token is verified, the Lambda Authorizer will return an IAM Policy document with <code>Effect</code> set to <code>Allow</code>.</p>\n</li>\n<li>\n<p>APIG will now evaluate the IAM Policy and if the <code>Effect</code> is set to <code>Allow</code>, it will invoke the specified Lambda handler.</p>\n</li>\n<li>\n<p>The Lambda handler will execute and when the <code>get:profile</code> scope is present, it will return the profile data back to the client.</p>\n</li>\n</ol>\n<p>Now for the easy part, writing the code!</p>\n<h2>Implementing the Lambda Authorizer</h2>\n<p>We'll do this by:</p>\n<ol>\n<li><a href=\"#1-setting-up-the-project\">Setting up the project</a></li>\n<li><a href=\"#2-configuring-a-serverless-manifest\">Configuring a Serverless manifest</a></li>\n<li><a href=\"#3-defining-the-lambda-authorizer\">Defining the Lambda Authorizer</a></li>\n<li><a href=\"#4-getting-the-token\">Getting the token</a></li>\n<li><a href=\"#5-verifying-the-token\">Verifying the token</a></li>\n<li><a href=\"#6-creating-the-auth-response\">Creating the auth response</a></li>\n<li><a href=\"#7-releasing-the-lambda-authorizer\">Releasing the Lambda Authorizer</a></li>\n</ol>\n<h3>1. Setting up the project</h3>\n<p>Create a new directory for the code:</p>\n<pre><code>mkdir lambda-authorizers\n</code></pre>\n<p>Move to this directory and initialize a new <a href=\"https://www.npmjs.com\">npm</a> project with:</p>\n<pre><code>npm init -y\n</code></pre>\n<p>This creates a <code>package.json</code> file. Now you can install the following required npm dependencies:</p>\n<pre><code>npm i jsonwebtoken jwks-rsa\n</code></pre>\n<p>The <a href=\"https://github.com/auth0/node-jsonwebtoken\">jsonwebtoken</a> library will help use decode the bearer token (a JWT) and verify its signature, issuer and audience claims. The <a href=\"https://github.com/auth0/node-jwks-rsa\">jwks-rsa</a> library will help us fetch the JWKS from Auth0.</p>\n<p>We'll use the Serverless Framework to configure and upload the Lambda to AWS, so install it as a dev dependency:</p>\n<pre><code>npm i -D serverless\n</code></pre>\n<h3>2. Configuring a Serverless manifest</h3>\n<p>Create a Serverless manifest:</p>\n<pre><code>service: lambda-authorizers\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n</code></pre>\n<p>Add the properties we got from the <a href=\"#lambda-authorizer-configuration\">Lambda Authorizer configuration</a> as environment variables. For example:</p>\n<pre><code>service: lambda-authorizers\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n  environment:\n    JWKS_URI: \"https://danillouz.eu.auth0.com/.well-known/jwks.json\"\n    TOKEN_ISSUER: \"https://danillouz.eu.auth0.com/\"\n    AUDIENCE: \"https://api.danillouz.dev/account\"\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n</code></pre>\n<p>And add the Lambda function definition:</p>\n<pre><code>service: lambda-authorizers\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n  environment:\n    JWKS_URI: \"https://danillouz.eu.auth0.com/.well-known/jwks.json\"\n    TOKEN_ISSUER: \"https://danillouz.eu.auth0.com/\"\n    AUDIENCE: \"https://api.danillouz.dev/account\"\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  auth0VerifyBearer:\n    handler: src/auth0.verifyBearer\n    description: Verifies the bearer token with the help of Auth0\n</code></pre>\n<h3>3. Defining the Lambda Authorizer</h3>\n<p>In order to match the Lambda function definition in the Serverless manifest, create a file named <code>auth0.js</code> in <code>src</code>. And in that file export a method named <code>verifyBearer</code>:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.verifyBearer = async () =&gt; {\n  try {\n    // Lambda Authorizer implementation goes here.\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<p>If something goes wrong in the Lambda, we'll log the error and throw a new <code>Unauthorized</code> error. This will make APIG return a <code>401 Unauthorized</code> response back to the caller[^5].</p>\n<p>[^5]: The thrown error message <em>must</em> match the string <code>\"Unauthorized\"</code> <em>exactly</em> for this to work.</p>\n<h3>4. Getting the token</h3>\n<p>The Lambda will first have to get the bearer token from the <code>Authorization</code> request header. Create a helper function for that in <code>src/get-token.js</code>. And in that file export a function named <code>getToken</code>:</p>\n<pre><code>\"use strict\"\n\nmodule.exports = function getToken(event) {\n  if (event.type !== \"TOKEN\") {\n    throw new Error('Authorizer must be of type \"TOKEN\"')\n  }\n\n  const { authorizationToken: bearer } = event\n  if (!bearer) {\n    throw new Error('Authorization header with \"Bearer TOKEN\" must be provided')\n  }\n\n  const [, token] = bearer.match(/^Bearer (.*)$/) || []\n  if (!token) {\n    throw new Error(\"Invalid bearer token\")\n  }\n\n  return token\n}\n</code></pre>\n<p>Here we're only interested in <code>TOKEN</code> events because we're implementing a <a href=\"#whats-a-lambda-authorizer\">token based authorizer</a>. And we can access the value of the <code>Authorization</code> request header via the <code>event.authorizationToken</code> property.</p>\n<p>Then <code>require</code> and call the helper in the Lambda with the APIG HTTP input <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/set-up-lambda-proxy-integrations.html#api-gateway-simple-proxy-for-lambda-input-format\">event</a> as an argument:</p>\n<pre><code>\"use strict\"\n\nconst getToken = require(\"./get-token\")\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<h3>5. Verifying the token</h3>\n<p>Now we have the token, we need to verify it by:</p>\n<ol>\n<li>Decoding the bearer token (JWT).</li>\n<li>Fetching the public key from Auth0 using the JWKS URI (used to verify the token signature).</li>\n<li>Verifying the token signature, issuer and audience claims.</li>\n</ol>\n<p>We'll use another helper function for this. Create one in <code>src/verify-token.js</code>, and export a function named <code>verifyToken</code>:</p>\n<pre><code>\"use strict\"\n\nmodule.exports = async function verifyToken(\n  token,\n  decodeJwt,\n  getSigningKey,\n  verifyJwt,\n  issuer,\n  audience\n) {\n  // Step 1.\n  const decoded = decodeJwt(token, { complete: true })\n\n  if (!decoded || !decoded.header || !decoded.header.kid) {\n    throw new Error(\"Invalid JWT\")\n  }\n\n  // Step 2.\n  const { publicKey, rsaPublicKey } = await getSigningKey(decoded.header.kid)\n  const signingKey = publicKey || rsaPublicKey\n\n  // Step 3.\n  return verifyJwt(token, signingKey, {\n    issuer,\n    audience,\n  })\n}\n</code></pre>\n<p>After we decode the token with the option <code>{ complete: true }</code>, we can access the JWT <code>header</code> data. And by using the <a href=\"https://community.auth0.com/t/what-is-the-origin-of-the-kid-claim-in-the-jwt/8431\">kid</a> JWT claim, we can find out which key was used to sign the token.</p>\n<p>When we registered the API with Auth0 we chose the <code>RS256</code> signing algorithm. This algorithm generates an asymmetric signature. Which basically means that Auth0 uses a <em>private key</em> to sign a JWT when it issues one. And we can use a <em>public key</em> (fetched via the JWKS URI) to verify the authenticity of the token.</p>\n<p>First require the helper in the Lambda and pass the <code>token</code> as the first argument when calling it:</p>\n<pre><code>\"use strict\"\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(token)\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<p>To decode the token in the helper (step 1), we'll use the <code>jsonwebtoken</code> library. It exposes a <code>decode</code> method. Pass this method as the second argument when calling the helper:</p>\n<pre><code>\"use strict\"\n\nconst jwt = require(\"jsonwebtoken\")\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(token, jwt.decode)\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<p>To fetch the public key from Auth0 (step 2) we'll use the <code>jwks-rsa</code> library. It exposes a client with <code>getSigningKey</code> method to fetch the key. Pas a \"promisified\" version of this method as the third argument when calling the helper:</p>\n<pre><code>\"use strict\"\n\nconst util = require(\"util\")\nconst jwt = require(\"jsonwebtoken\")\nconst jwksRSA = require(\"jwks-rsa\")\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nconst { JWKS_URI } = process.env\n\nconst jwksClient = jwksRSA({\n  cache: true,\n  rateLimit: true,\n  jwksUri: JWKS_URI,\n})\nconst getSigningKey = util.promisify(jwksClient.getSigningKey)\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(token, jwt.decode, getSigningKey)\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<p>Finally, to verify the token signature, issuer and audience claims (step 3) we'll use the <code>jsonwebtoken</code> library again. It exposes a <code>verify</code> method. Pass a \"promisified\" version of this method together with the <code>TOKEN_ISSUER</code> and <code>AUDIENCE</code> as the final arguments when calling the helper:</p>\n<pre><code>\"use strict\"\n\nconst util = require(\"util\")\nconst jwt = require(\"jsonwebtoken\")\nconst jwksRSA = require(\"jwks-rsa\")\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nconst { JWKS_URI, TOKEN_ISSUER, AUDIENCE } = process.env\n\nconst jwksClient = jwksRSA({\n  cache: true,\n  rateLimit: true,\n  jwksUri: JWKS_URI,\n})\nconst getSigningKey = util.promisify(jwksClient.getSigningKey)\nconst verifyJwt = util.promisify(jwt.verify)\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(\n      token,\n      jwt.decode,\n      getSigningKey,\n      verifyJwt,\n      TOKEN_ISSUER,\n      AUDIENCE\n    )\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<p>When the helper verifies the token, it will return the JWT payload data (with all claims) as <code>verifiedData</code>. For example:</p>\n<pre><code>{\n  \"iss\": \"https://danillouz.eu.auth0.com/\",\n  \"sub\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1@clients\",\n  \"aud\": \"https://api.danillouz.dev/account\",\n  \"iat\": 1560762850,\n  \"exp\": 1560849250,\n  \"azp\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1\",\n  \"gty\": \"client-credentials\"\n}\n</code></pre>\n<h3>6. Creating the auth response</h3>\n<p>We'll use <code>verifiedData</code> to create the <code>authResponse</code>:</p>\n<pre><code>\"use strict\"\n\nconst util = require(\"util\")\nconst jwt = require(\"jsonwebtoken\")\nconst jwksRSA = require(\"jwks-rsa\")\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nconst { JWKS_URI, TOKEN_ISSUER, AUDIENCE } = process.env\n\nconst jwksClient = jwksRSA({\n  cache: true,\n  rateLimit: true,\n  jwksUri: JWKS_URI,\n})\nconst getSigningKey = util.promisify(jwksClient.getSigningKey)\nconst verifyJwt = util.promisify(jwt.verify)\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(\n      token,\n      jwt.decode,\n      getSigningKey,\n      verifyJwt,\n      TOKEN_ISSUER,\n      AUDIENCE\n    )\n    const authResponse = {\n      principalId: verifiedData.sub,\n      policyDocument: {\n        Version: \"2012-10-17\",\n        Statement: [\n          {\n            Action: \"execute-api:Invoke\",\n            Effect: \"Allow\",\n            Resource: event.methodArn,\n          },\n        ],\n      },\n    }\n    return authResponse\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<h4>Principal identifier</h4>\n<p>The <code>authResponse.principalId</code> property must represent a unique (user) identifier associated with the token sent by the client. Auth0 provides this via the <code>sub</code> claim and ours has the value:</p>\n<pre><code>{\n  \"iss\": \"https://danillouz.eu.auth0.com/\",\n  \"sub\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1@clients\", // Principal ID\n  \"aud\": \"https://api.danillouz.dev/account\",\n  \"iat\": 1560762850,\n  \"exp\": 1560849250,\n  \"azp\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1\",\n  \"gty\": \"client-credentials\"\n}\n</code></pre>\n<p>Note that if you use an Auth0 test token (like we'll do in a bit), the <code>sub</code> claim will be postfixed with <code>@clients</code>. This is because Auth0 automatically created a \"Test Application\" for us when we registered the Account API with them. And it's via this application that we obtain the test token, obtained via the <a href=\"https://auth0.com/docs/flows/concepts/client-credentials\">client credentials grant</a> (specified by the <code>gty</code> claim).</p>\n<p>In this case the test application represents a \"machine\" and <em>not</em> a user. But that's okay because the machine has a unique identifier the same way a user would have (by means of a client ID). This means that this implementation will also work when using \"user centric\" auth flows like the <a href=\"https://auth0.com/docs/flows/concepts/implicit\">implicit grant</a>.</p>\n<p>You can find the test application in the Auth0 dashboard by navigating to \"Applications\" and selecting \"Account API (Test Application)\".</p>\n\n<h4>Method ARN</h4>\n<p>The <a href=\"https://docs.aws.amazon.com/general/latest/gr/aws-arns-and-namespaces.html\">ARN</a> of the Lambda handler associated with the called endpoint can be obtained from <code>event.methodArn</code>. APIG will use this ARN to invoke said Lambda handler. In our case this will be the Lambda handler that gets the profile data.</p>\n<h4>Granting a client scopes</h4>\n<p>Like mentioned when discussing <a href=\"#scopes\">scopes</a>, Auth0 can provide scopes as authorization information. In order for Auth0 to do this, we need to \"grant\" our client the <code>get:profile</code> scope. In our case, the client is the \"Test Application\" that has been created for us.</p>\n<p>Navigate to the \"APIs\" tab in the \"Test Application\" details and click on the \"right pointing chevron\" (circled in red) to the right of \"Account API\".</p>\n\n<p>Then check the <code>get:profile</code> scope, click \"Update\" and click \"Continue\".</p>\n\n<p>Now the configured scope will be a claim on issued test tokens, and part of the <code>verifiedData</code>:</p>\n<pre><code>{\n  \"iss\": \"https://danillouz.eu.auth0.com/\",\n  \"sub\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1@clients\",\n  \"aud\": \"https://api.danillouz.dev/account\",\n  \"iat\": 1560762850,\n  \"exp\": 1560849250,\n  \"azp\": \"FHgLVARPk8oXjsP5utP8wYAnZePPAkw1\",\n  \"scope\": \"get:profile\", // Scope is now a claim\n  \"gty\": \"client-credentials\"\n}\n</code></pre>\n<p>So we can propagate it to downstream Lambda handlers like this:</p>\n<pre><code>\"use strict\"\n\nconst util = require(\"util\")\nconst jwt = require(\"jsonwebtoken\")\nconst jwksRSA = require(\"jwks-rsa\")\n\nconst getToken = require(\"./get-token\")\nconst verifyToken = require(\"./verify-token\")\n\nconst { JWKS_URI, TOKEN_ISSUER, AUDIENCE } = process.env\n\nconst jwksClient = jwksRSA({\n  cache: true,\n  rateLimit: true,\n  jwksUri: JWKS_URI,\n})\nconst getSigningKey = util.promisify(jwksClient.getSigningKey)\nconst verifyJwt = util.promisify(jwt.verify)\n\nmodule.exports.verifyBearer = async (event) =&gt; {\n  try {\n    const token = getToken(event)\n    const verifiedData = await verifyToken(\n      token,\n      jwt.decode,\n      getSigningKey,\n      verifyJwt,\n      TOKEN_ISSUER,\n      AUDIENCE\n    )\n    const authResponse = {\n      principalId: verifiedData.sub,\n      policyDocument: {\n        Version: \"2012-10-17\",\n        Statement: [\n          {\n            Action: \"execute-api:Invoke\",\n            Effect: \"Allow\",\n            Resource: event.methodArn,\n          },\n        ],\n      },\n      context: {\n        scope: verifiedData.scope, // Propagate scope downstream\n      },\n    }\n    return authResponse\n  } catch (err) {\n    console.log(\"Authorizer Error: \", err)\n    throw new Error(\"Unauthorized\")\n  }\n}\n</code></pre>\n<h3>7. Releasing the Lambda Authorizer</h3>\n<p>Finally, add a release command to the <code>package.json</code>:</p>\n<pre><code>{\n  \"scripts\": {\n    \"test\": \"echo \\\"Error: no test specified\\\" &amp;&amp; exit 1\",\n    \"release\": \"serverless deploy --stage prod\"\n  },\n  \"dependencies\": {\n    \"jsonwebtoken\": \"^8.5.1\",\n    \"jwks-rsa\": \"^1.5.1\"\n  },\n  \"devDependencies\": {\n    \"serverless\": \"^1.45.1\"\n  }\n}\n</code></pre>\n<p>And to upload the Lambda to AWS, <a href=\"https://portal.aws.amazon.com/billing/signup\">sign up</a> and make sure you have your <a href=\"https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-files.html\">credentials configured</a>. Then release the Lambda by running <code>npm run release</code>:</p>\n<pre><code>Serverless: Packaging service...\nServerless: Excluding development dependencies...\nServerless: Creating Stack...\nServerless: Checking Stack create progress...\nServerless: Stack create finished...\nServerless: Uploading CloudFormation file to S3...\nServerless: Uploading artifacts...\nServerless: Uploading service lambda-authorizers.zip file to S3...\nServerless: Validating template...\nServerless: Updating Stack...\nServerless: Checking Stack update progress...\nServerless: Stack update finished...\nService Information\n\nservice: lambda-authorizers\nstage: prod\nregion: eu-central-1\nstack: lambda-authorizers-prod\nresources: 5\napi keys:\n  None\nendpoints:\n  None\nfunctions:\n  auth0VerifyBearer: lambda-authorizers-prod-auth0VerifyBearer\nlayers:\n  None\n</code></pre>\n<h4>Finding the ARN</h4>\n<p>Now go to the AWS Console and visit the \"Lambda\" service. Find <code>lambda-authorizers-prod-auth0VerifyBearer</code> under \"Functions\" and take note of the ARN in the top right corner.</p>\n\n<p>We'll need this to configure the Account API in the next part.</p>\n<h2>Implementing the Account API</h2>\n<p>We'll do this by:</p>\n<ol>\n<li><a href=\"#1-setting-up-the-api-project\">Setting up the API project</a></li>\n<li><a href=\"#2-configuring-the-serverless-manifest\">Configuring the Serverless manifest</a></li>\n<li><a href=\"#3-defining-the-lambda-handler\">Defining the Lambda handler</a></li>\n<li><a href=\"#4-releasing-the-api\">Releasing the API</a></li>\n<li><a href=\"#5-configuring-the-lambda-authorizer\">Configuring the Lambda Authorizer</a></li>\n<li><a href=\"#6-adding-authorization-logic\">Adding authorization logic</a></li>\n<li><a href=\"#7-releasing-the-api-with-auth-enabled\">Releasing the API with auth enabled</a></li>\n<li><a href=\"#8-getting-a-test-token\">Getting a test token</a></li>\n</ol>\n<h3>1. Setting up the API project</h3>\n<p>Similar to the Lambda Authorizer, create a new directory for the code:</p>\n<pre><code>mkdir account-api\n</code></pre>\n<p>Move to this directory and initialize a new npm project with:</p>\n<pre><code>npm init -y\n</code></pre>\n<p>This creates a <code>package.json</code> file. Again, we'll use the Serverless Framework to configure and upload the Lambda to AWS, so install it as a dev dependency:</p>\n<pre><code>npm i -D serverless\n</code></pre>\n<h3>2. Configuring the Serverless manifest</h3>\n<p>Create a Serverless manifest, and add the Lambda function definition for the <code>GET /profile</code> endpoint handler:</p>\n<pre><code>service: account-api\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  getProfile:\n    handler: src/handler.getProfile\n    description: Gets the user profile data\n    events:\n      - http:\n          path: /profile\n          method: get\n</code></pre>\n<h3>3. Defining the Lambda handler</h3>\n<p>In order to match the Lambda function definition in the Serverless manifest, create a file named <code>handler.js</code> in <code>src</code>. And in that file export a method named <code>getProfile</code>:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.getProfile = async () =&gt; {\n  try {\n    // Lambda handler implementation goes here.\n  } catch (err) {\n    const statusCode = err.code || 500\n    return {\n      statusCode,\n      body: JSON.stringify({\n        message: err.message,\n        info: err.info,\n      }),\n    }\n  }\n}\n</code></pre>\n<p>If something goes wrong in the Lambda, we'll return an error response as <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/set-up-lambda-proxy-integrations.html#api-gateway-simple-proxy-for-lambda-output-format\">HTTP output</a> back to the caller.</p>\n<p>Otherwise we'll return the profile data:</p>\n<pre><code>\"use strict\"\n\nmodule.exports.getProfile = async () =&gt; {\n  try {\n    const profileData = {\n      name: \"Daniël\",\n    }\n    return {\n      statusCode: 200,\n      body: JSON.stringify(profileData),\n    }\n  } catch (err) {\n    const statusCode = err.code || 500\n    return {\n      statusCode,\n      body: JSON.stringify({\n        message: err.message,\n        info: err.info,\n      }),\n    }\n  }\n}\n</code></pre>\n<p>Before we enable auth, let's first release the API to see if we can call the endpoint.</p>\n<h3>4. Releasing the API</h3>\n<p>Add a release command to the <code>package.json</code>:</p>\n<pre><code>{\n  \"scripts\": {\n    \"test\": \"echo \\\"Error: no test specified\\\" &amp;&amp; exit 1\",\n    \"release\": \"serverless deploy --stage prod\"\n  },\n  \"devDependencies\": {\n    \"serverless\": \"^1.45.1\"\n  }\n}\n</code></pre>\n<p>Then release the Lambda by running <code>npm run release</code>:</p>\n<pre><code>Serverless: Packaging service...\nServerless: Excluding development dependencies...\nServerless: Creating Stack...\nServerless: Checking Stack create progress...\nServerless: Stack create finished...\nServerless: Uploading CloudFormation file to S3...\nServerless: Uploading artifacts...\nServerless: Uploading service account-api.zip file to S3...\nServerless: Validating template...\nServerless: Updating Stack...\nServerless: Checking Stack update progress...\nServerless: Stack update finished...\nService Information\n\nservice: account-api\nstage: prod\nregion: eu-central-1\nstack: account-api-prod\nresources: 10\napi keys:\n  None\nendpoints:\n  GET - https://9jwh.execute-api.eu-central-1.amazonaws.com/prod/profile\nfunctions:\n  getProfile: account-api-prod-getProfile\nlayers:\n  None\n</code></pre>\n<p>Now try to call the endpoint that has been created for you. For example:</p>\n<pre><code>curl https://9jwh.execute-api.eu-central-1.amazonaws.com/prod/profile\n</code></pre>\n<p>It should return:</p>\n<pre><code>200 OK\nContent-Type: application/json\n\n{\n  \"name\": \"Daniël\"\n}\n</code></pre>\n<h3>5. Configuring the Lambda Authorizer</h3>\n<p>Now we know the endpoint is working, we'll protect it by adding a custom <code>authorizer</code> property in the <code>serverless.yaml</code> manifest:</p>\n<pre><code>service: account-api\n\ncustom:\n  authorizer:\n    arn: LAMBDA_AUTHORIZER_ARN\n    resultTtlInSeconds: 0\n    identitySource: method.request.header.Authorization\n    identityValidationExpression: '^Bearer [-0-9a-zA-z\\.]*$'\n    type: token\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n  profile: danillouz\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  getProfile:\n    handler: src/handler.getProfile\n    description: Gets the user profile\n    events:\n      - http:\n          path: /profile\n          method: get\n          authorizer: ${self:custom.authorizer}\n</code></pre>\n<p>Let's go over the <code>authorizer</code> properties:</p>\n<ul>\n<li><code>arn</code>: must be the value of the Lambda Authorizer ARN we <a href=\"#finding-the-arn\">released</a> before.</li>\n<li><code>resultTtlInSeconds</code>: used to cache the IAM Policy document returned from the Lambda Authorizer[^6].</li>\n<li><code>identitySource</code>: where APIG should \"look\" for the bearer token.</li>\n<li><code>identityValidationExpression</code>: the expression used to extract the token from the <code>identitySource</code>.</li>\n</ul>\n<p>[^6]: Caching is <em>disabled</em> when set to <code>0</code>. When caching is enabled and a policy document has been cached, the Lambda Authorizer will <em>not</em> be executed. According to the AWS <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/configure-api-gateway-lambda-authorization-with-console.html\">docs</a> the default value is <code>300</code> seconds and the max value is <code>3600</code> seconds.</p>\n<h3>6. Adding authorization logic</h3>\n<p>Now the Lambda Authorizer is configured and we also propagate the <code>get:profile</code> scope from the Lambda Authorizer, we can check if a caller has been granted the required scope. If not, we'll return a <code>403 Forbidden</code> response back to the caller:</p>\n<pre><code>\"use strict\"\n\nconst REQUIRED_SCOPE = \"get:profile\"\n\nmodule.exports.getProfile = async (event) =&gt; {\n  try {\n    const { authorizer = {} } = event.requestContext\n    const { scope = \"\" } = authorizer\n    const hasScope = scope.split(\" \").includes(REQUIRED_SCOPE)\n    if (!hasScope) {\n      const err = new Error(\"Forbidden\")\n      err.code = 403\n      err.info = 'scope \"get:profile\" is required'\n      throw err\n    }\n\n    const profileData = {\n      name: \"Daniël\",\n    }\n    return {\n      statusCode: 200,\n      body: JSON.stringify(profileData),\n    }\n  } catch (err) {\n    const statusCode = err.code || 500\n    return {\n      statusCode,\n      body: JSON.stringify({\n        message: err.message,\n        info: err.info,\n      }),\n    }\n  }\n}\n</code></pre>\n<p>Note that the <code>authorizer.scope</code> is a string and that it may contain more than one scope value. When multiple scopes are configured, they will be space separated like this:</p>\n<pre><code>\"get:profile update:profile\"\n</code></pre>\n<h3>7. Releasing the API with auth enabled</h3>\n<p>Do another release by running <code>npm run release</code>. And after Serverless finishes, go to the AWS Console and visit the \"API Gateway\" service. Navigate to \"prod-account-api\" and click on the \"GET\" resource under \"/profile\". You should now see that the \"Method Request\" tile has a property \"Auth\" set to <code>auth0VerifyBearer</code>.</p>\n\n<p>This means our <code>GET /profile</code> endpoint is properly configured with a Lambda Authorizer. And we now require a bearer token to get the profile data. Let's verify this by making the same <code>curl</code> request like before (without a token):</p>\n<pre><code>curl https://9jwh.execute-api.eu-central-1.amazonaws.com/prod/profile\n</code></pre>\n<p>It should return:</p>\n<pre><code>401 Unauthorized\nContent-Type: application/json\n\n{\n  \"message\": \"Unauthorized\"\n}\n</code></pre>\n<h3>8. Getting a test token</h3>\n<p>We can get a test token from the Auth0 dashboard by navigating to the \"Test\" tab in the API details screen.</p>\n\n<p>If you scroll to the bottom, you'll see a <code>curl</code> command displayed with a ready to use test token:</p>\n<pre><code>curl --request GET \\\n  --url http://path_to_your_api/ \\\n  --header 'authorization: Bearer eyJ...lKw'\n</code></pre>\n<p>Pretty cool right! Use this, but set the URL to your profile endpoint. For example:</p>\n<pre><code>curl --request GET \\\n  --url https://9jwh.execute-api.eu-central-1.amazonaws.com/prod/profile \\\n  --header 'authorization: Bearer eyJ...lKw'\n</code></pre>\n<p>This should return the profile data again:</p>\n<pre><code>200 OK\nContent-Type: application/json\n\n{\n  \"name\": \"Daniël\"\n}\n</code></pre>\n<p>Also, sending a token <em>without</em> the required scope should return a <code>403</code>:</p>\n<pre><code>403 Forbidden\nContent-Type: application/json\n\n{\n  \"message\": \"Error: Forbidden\",\n  \"info\": \"scope \\\"get:profile\\\" is required\"\n}\n</code></pre>\n<p>Awesome! We successfully secured our API with a token based authentication strategy. So only authenticated <em>and</em> authorized clients can access it now!</p>\n<h2>CORS headers</h2>\n<p>On a final note, when your API needs to return <a href=\"https://serverless.com/blog/cors-api-gateway-survival-guide\">CORS headers</a>, make sure to add a <a href=\"https://docs.aws.amazon.com/apigateway/latest/developerguide/supported-gateway-response-types.html\">custom APIG Response</a> as well:</p>\n<pre><code>service: account-api\n\ncustom:\n  authorizer:\n    arn: LAMBDA_AUTHORIZER_ARN\n    resultTtlInSeconds: 0\n    identitySource: method.request.header.Authorization\n    identityValidationExpression: '^Bearer [-0-9a-zA-z\\.]*$'\n    type: token\n\nprovider:\n  name: aws\n  runtime: nodejs8.10\n  stage: ${opt:stage, 'prod'}\n  region: ${opt:region, 'eu-central-1'}\n  memorySize: 128\n  timeout: 3\n\npackage:\n  exclude:\n    - ./*\n    - ./**/*.test.js\n  include:\n    - node_modules\n    - src\n\nfunctions:\n  getProfile:\n    handler: src/handler.getProfile\n    description: Gets the user profile\n    events:\n      - http:\n          path: /profile\n          method: get\n          authorizer: ${self:custom.authorizer}\n\nresources:\n  Resources:\n    GatewayResponseDefault4XX:\n      Type: \"AWS::ApiGateway::GatewayResponse\"\n      Properties:\n        ResponseParameters:\n          gatewayresponse.header.Access-Control-Allow-Origin: \"'*'\"\n          gatewayresponse.header.Access-Control-Allow-Headers: \"'*'\"\n        ResponseType: DEFAULT_4XX\n        RestApiId:\n          Ref: \"ApiGatewayRestApi\"\n    GatewayResponseDefault5XX:\n      Type: \"AWS::ApiGateway::GatewayResponse\"\n      Properties:\n        ResponseParameters:\n          gatewayresponse.header.Access-Control-Allow-Origin: \"'*'\"\n          gatewayresponse.header.Access-Control-Allow-Headers: \"'*'\"\n        ResponseType: DEFAULT_5XX\n        RestApiId:\n          Ref: \"ApiGatewayRestApi\"\n</code></pre>\n<p>When the Lambda Authorizer throws an error or returns a \"Deny\" policy, APIG will <em>not</em> execute any Lambda handlers. This means that the CORS settings you added to the Lambda handler wont be applied. That's why we must define additional APIG response resources, to make sure we always return the proper CORS headers.</p>\n<h2>In closing</h2>\n<p>In this post I showed a way to implement \"serverless auth\" using a machine client. But you can use something like <a href=\"https://auth0.com/lock\">Auth0 Lock</a> and implement a user centric auth flow. This would allow users to sign up and log in to (for example) a web app, and get a token from Auth0. The web app can then use the token to send requests (on behalf of a user) to a protected API.</p>\n<p>You can find all code at <a href=\"https://github.com/danillouz/serverless-auth\">github.com/danillouz/serverless-auth</a>.</p>\n","date_published":"2019-06-19T00:00:00.000Z","date_modified":"2023-02-03T00:00:00.000Z","tags":["api-gateway","auth","auth0","aws","jwk","jwt","lambda","nodejs","serverless-framework","tutorial"]},{"id":"https://www.danillouz.dev/posts/lambda-nodejs-event-loop/","url":"https://www.danillouz.dev/posts/lambda-nodejs-event-loop/","title":"AWS Lambda and the Node.js event loop","summary":"Lambda can freeze and thaw its execution context, which can impact Node.js event loop behavior.","content_html":"<p>import { Image } from \"astro:assets\"</p>\n<p>One of the more surprising things I learned recently while working with AWS Lambda is how it interacts with the Node.js event loop.</p>\n<p>Lambda is powered by a <a href=\"https://aws.amazon.com/blogs/aws/firecracker-lightweight-virtualization-for-serverless-computing\">virtualization technology</a>. And to optimize performance it can freeze and thaw the execution context of your code so it can be reused.</p>\n<p>This will make your code run faster, but can impact the \"expected\" event loop behavior. We'll explore this in detail. But before we dive in, lets quickly refresh the Node.js concurrency model.</p>\n<p>If you're already familiar with the event loop, you can jump straight to the <a href=\"#aws-lambda\">AWS Lambda</a> section.</p>\n<h2>Concurrency model</h2>\n<p>Node.js is <em>single threaded</em> and the <a href=\"https://nodejs.org/en/docs/guides/event-loop-timers-and-nexttick\">event loop</a> is the concurrency model that allows non-blocking I/O operations to be performed[^1].</p>\n<p>[^1]: The event loop is what allows Node.js to perform non-blocking I/O operations (despite the fact that JavaScript is single-threaded) by offloading operations to the system kernel whenever possible.</p>\n<p>How? Well, we'll have to discuss the call stack and the task queue first.</p>\n<h3>Call stack</h3>\n<p>Function calls form a <em>stack of frames</em>, where each frame represents a single function call.</p>\n<p>Every time a function is called, it's <em>pushed</em> onto the stack (i.e. added to the stack). And when the function is done executing, it's <em>popped</em> off the stack (i.e. removed from the stack).</p>\n<p>The frames in a stack are popped off in &lt;abbr title=\"Last In First Out\"&gt;LIFO&lt;/abbr&gt; order.</p>\n\n<p>Each frame stores information about the invoked function. Like the arguments the function was called with and any variables defined inside the called function's body.</p>\n<p>When we execute the following code:</p>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n<p>We can visualize the call stack over time like this.</p>\n\n<ol>\n<li>\n<p>When the script starts executing, the call stack is empty.</p>\n</li>\n<li>\n<p><code>main()</code> is called, and pushed onto the call stack:</p>\n</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n<ol>\n<li>While executing <code>main</code>, <code>console.log(\"main start\")</code> is called, and pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n\n<ol>\n<li>\n<p><code>console.log</code> executes, prints <code>main start</code>, and is popped off the call stack.</p>\n</li>\n<li>\n<p><code>main</code> continues executing, calls <code>work()</code>, and is pushed onto the call stack:</p>\n</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n<ol>\n<li>While executing <code>work</code>, <code>console.log(\"do work\")</code> is called, and pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n\n<ol>\n<li>\n<p><code>console.log</code> executes, prints <code>do work</code>, and is popped off the call stack.</p>\n</li>\n<li>\n<p><code>work</code> finishes executing, and is popped off the call stack.</p>\n</li>\n<li>\n<p><code>main</code> continues executing, calls <code>console.log(\"main end\")</code> and is pushed onto the call stack:</p>\n</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction work() {\n  console.log(\"do work\")\n}\n\nfunction main() {\n  console.log(\"main start\")\n  work()\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n\n<ol>\n<li>\n<p><code>console.log</code> executes, prints <code>main end</code>, and is popped off the call stack.</p>\n</li>\n<li>\n<p><code>main</code> finishes executing, and is popped off the call stack. The call stack is empty again and the script finishes executing.</p>\n</li>\n</ol>\n<p>This code didn't interact with any asynchronous (internal) APIs. But when it does (like when calling <code>setTimeout(callback)</code>) it makes use of the task queue.</p>\n<h3>Task queue</h3>\n<p>Any asynchronous work in the runtime is represented as a task in a queue, or in other words, a <em>message queue</em>.</p>\n<p>Each message can be thought of as a function that will be called in &lt;abbr title=\"First In First Out\"&gt;FIFO&lt;/abbr&gt; order to handle said work. For example, the callback provided to the <code>setTimeout</code> or <code>Promise</code> API.</p>\n\n<p>Additionally, each message is processed <em>completely</em> before any other message is processed. This means that <strong>whenever a function runs it can't be interrupted</strong>. This behavior is called <em>run-to-completion</em> and makes it easier to reason about our JavaScript programs.</p>\n<p>Messages get <em>enqueued</em> (i.e. added to the queue) and at some point messages will be <em>dequeued</em> (i.e. removed from the queue).</p>\n<p>When? How? This is handled by the Event Loop.</p>\n<h3>Event loop</h3>\n<p>The event loop can be literally thought of as a loop that runs forever, and where every cycle is referred to as a <em>tick</em>.</p>\n<p>On every tick the event loop will check if there's any work in the task queue. If there is, it will execute the task (i.e. call a function), <strong>but only if the call stack is empty</strong>.</p>\n<p>The event loop can be described with the following pseudo code[^2]:</p>\n<p>[^2]: Taken from <a href=\"https://developer.mozilla.org/en-US/docs/Web/JavaScript/EventLoop#Event_loop\">MDN</a>.</p>\n<pre><code>while (queue.waitForMessage()) {\n  queue.processNextMessage()\n}\n</code></pre>\n<p>To summarize:</p>\n<ul>\n<li>When code executes, function calls are added to the call stack.</li>\n<li>Whenever calls are made via asynchronous (internal) APIs (like <code>setTimeout</code> or <code>Promise</code>) the corresponding callbacks are eventually added to the task queue.</li>\n<li>When the call stack is empty and the task queue contains one or more tasks, the event loop will remove a task on every tick and push it onto the call stack. The function will execute and this process will continue until all work is done.</li>\n</ul>\n\n<p>With that covered, we can explore how the AWS Lambda execution environment interacts with the Node.js event loop.</p>\n<h2>AWS Lambda</h2>\n<p>AWS Lambda invokes a Lambda function via an exported handler function, e.g. <code>exports.handler</code>. When Lambda invokes this handler it calls it with 3 arguments:</p>\n<pre><code>handler(event, context, callback)\n</code></pre>\n<p>The <code>callback</code> argument may be used to return information to the caller and to signal that the handler function has completed, so Lambda may end it. For that reason you don't have to call it explicitly. Meaning, if you don't call it Lambda will call it for you[^3].</p>\n<p>[^3]: When using Node.js version <code>8.10</code> or above, you may also return a <code>Promise</code> instead of using the callback function. In that case you can also make your handler <code>async</code>, because <code>async</code> functions return a <code>Promise</code>.</p>\n<h3>Baseline</h3>\n<p>From here on we'll use a simple script as a \"baseline\" to reason about the event loop behavior. Create a file called <code>timeout.js</code> with the following contents:</p>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nmain()\n</code></pre>\n<p>When we execute this script <em>locally</em> (not via Lambda) with <code>node timeout.js</code>, the following will print:</p>\n<pre><code>main start\ntimeout start\nmain end\ntimeout cb fired after 5000 ms\n</code></pre>\n<p>The last message takes 5 seconds to print, but the script does <em>not</em> stop executing before it does.</p>\n<h3>What happens in Lambda, stays in Lambda</h3>\n<p>Now lets modify the code from <code>timeout.js</code> so it's compatible with Lambda:</p>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n<p>You can create a new function in the AWS Lambda console and paste in the code from above. Run it, sit back and enjoy.</p>\n\n<p>Wait, what? Lambda just ended the handler function <em>without</em> printing the last message <code>timeout cb fired after 5000 ms</code>. Lets run it again.</p>\n\n<p>It now prints <code>timeout cb fired after 5000 ms</code> <em>first</em> and then the other ones! So what's going on here?</p>\n<h3>AWS Lambda execution model</h3>\n<p>AWS Lambda takes care of provisioning and managing resources needed to run your functions. When a Lambda function is invoked, an execution context is created for you based on the configuration you provide. The execution context is a temporary runtime environment that initializes any external dependencies of your Lambda function.</p>\n<p>After a Lambda function is called, Lambda maintains the execution context for some time in anticipation of another invocation of the Lambda function (for performance benefits). It freezes the execution context after a Lambda function completes and may choose to reuse (thaw) the same execution context when the Lambda function is called again (but it doesn't have to).</p>\n<p>In the AWS docs we can find the following regarding this subject:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt; Background processes or callbacks initiated by your Lambda function that did not complete when the function ended resume <strong>if AWS Lambda chooses to</strong> reuse the Execution Context.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.aws.amazon.com/lambda/latest/dg/running-lambda-code.html\">AWS docs: Lambda execution environment</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>As well as this somewhat hidden message:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;When the callback is called (explicitly or implicitly), AWS Lambda continues the Lambda function invocation until the event loop is empty.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.aws.amazon.com/lambda/latest/dg/nodejs-prog-model-handler.html\">AWS docs: Lambda function handler in Node.js</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>Looking further, there's some documentation about the context object. Specifically about a property called <code>callbackWaitsForEmptyEventLoop</code>. This is what it does:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;The default value is <code>true</code>. This property is useful only to modify the default behavior of the callback. <strong>By default, the callback will wait until the event loop is empty before freezing the process and returning the results to the caller</strong>.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.aws.amazon.com/lambda/latest/dg/nodejs-prog-model-context.html\">AWS docs: Lambda context object in Node.js</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>Okay, so with this information we can make sense of what happened when we executed the code in <code>timeout.js</code> before. Lets break it down and go over it step by step.</p>\n\n<ol>\n<li>Lambda starts executing the code in <code>timeout.js</code>. The call stack is empty.</li>\n</ol>\n\n<ol>\n<li><code>main</code> is called, and pushed onto to the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li>While executing <code>main</code>, <code>console.log(\"main start\")</code> is called, and pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li><code>console.log</code> executes, prints <code>main start</code>, and is popped off the call stack.</li>\n</ol>\n\n<ol>\n<li><code>main</code> continues executing, calls <code>timeout(5e3)</code>, and is pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li>While executing <code>timeout</code>, <code>console.log(\"timeout start\")</code> is called, and pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li><code>console.log</code> executes, prints <code>timeout start</code>, and is popped off the call stack.</li>\n</ol>\n\n<ol>\n<li><code>timeout</code> continues executing, calls <code>new Promise(callback)</code> on line 6, and is pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li>While <code>new Promise(callback)</code> executes, it interacts with the <code>Promise</code> API and passes the provided callback to it. The <code>Promise</code> API sends the callback to the task queue and now must wait until the call stack is empty before it can execute.</li>\n</ol>\n\n<ol>\n<li><code>new Promise</code> finishes executing, and is popped of the call stack.</li>\n</ol>\n\n<ol>\n<li><code>timeout</code> finishes executing, and is popped off the call stack.</li>\n</ol>\n\n<ol>\n<li><code>main</code> continues executing, calls <code>console.log(\"main end\")</code>, and is pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li><code>console.log</code> executes, prints <code>main end</code>, and is popped off the call stack.</li>\n</ol>\n\n<ol>\n<li><code>main</code> finishes executing, and is popped off the call stack. The call stack is empty.</li>\n</ol>\n\n<ol>\n<li>The <code>Promise</code> callback (step 9) can now be scheduled by the event loop, and is pushed onto the call stack.</li>\n</ol>\n\n<ol>\n<li>The <code>Promise</code> callback executes, calls <code>setTimeout(callback, timeout)</code> on line 7, and is pushed onto the call stack:</li>\n</ol>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n\n<ol>\n<li>While <code>setTimeout(callback, timeout)</code> executes, it interacts with the <code>setTimeout</code> API and passes the corresponding callback and timeout to it.</li>\n</ol>\n\n<ol>\n<li><code>setTimeout(callback, timeout)</code> finishes executing and is popped of the call stack. At the same time the <code>setTimeout</code> API starts counting down the timeout, to schedule the callback function in the future.</li>\n</ol>\n\n<ol>\n<li>The Promise callback finishes executing and is popped off the call stack. The call stack is empty again.</li>\n</ol>\n<p>At this point the call stack and task queue are both empty. At the same time a timeout is counting down (5 seconds), but the corresponding timeout callback has <em>not</em> been scheduled yet. As far as Lambda is concerned, the event loop is empty. So it will <em>freeze</em> the process and return results to the caller!</p>\n<p>The interesting part here is that Lambda doesn't immediately destroy its execution context. Because if we wait for +5 seconds and run the Lambda again (like in the <a href=\"#what-happens-in-lambda-stays-in-lambda\">second run</a>) we see the console message printed from the <code>setTimeout</code> callback first.</p>\n<p>This happens because after the Lambda stopped executing, the execution context was still around. And after waiting for +5 seconds, the <code>setTimeout</code> API sent the corresponding callback to the task queue:</p>\n\n<p>When we execute the Lambda again (second run), the call stack is empty with a message in the task queue, which can immediately be scheduled by the event loop:</p>\n\n<p>This results in <code>timeout cb fired after 5000 ms</code> being printed first, because it executed before any of the code in our Lambda function:</p>\n\n<h3>Doing it right</h3>\n<p>Obviously this is undesired behavior and you should <em>not</em> write your code in the same way we wrote the code in <code>timeout.js</code>.</p>\n<p>Like stated in the AWS docs, we need to make sure to complete processing <em>all</em> callbacks before our handler exits:</p>\n<p>&lt;blockquote&gt;\n&lt;p&gt;You should make sure any background processes or callbacks (in case of Node.js) in your code are complete before the code exits.&lt;/p&gt;</p>\n<p>&lt;cite&gt;\n&lt;p&gt;<a href=\"https://docs.aws.amazon.com/lambda/latest/dg/running-lambda-code.html\">AWS docs: Lambda execution environment</a>&lt;/p&gt;\n&lt;/cite&gt;\n&lt;/blockquote&gt;</p>\n<p>Therefore we'll make the following change to the code in <code>timeout.js</code>:</p>\n<pre><code>- timeout(5e3);\n+ await timeout(5e3);\n</code></pre>\n<p>This change makes sure the handler function does <em>not</em> stop executing until the <code>timeout</code> function finishes:</p>\n<pre><code>\"use strict\"\n\nfunction timeout(ms) {\n  console.log(\"timeout start\")\n\n  return new Promise((resolve) =&gt; {\n    setTimeout(() =&gt; {\n      console.log(`timeout cb fired after ${ms} ms`)\n      resolve()\n    }, ms)\n  })\n}\n\nasync function main() {\n  console.log(\"main start\")\n  await timeout(5e3)\n  console.log(\"main end\")\n}\n\nexports.handler = main\n</code></pre>\n<p>When we run our code with this change, all is well now.</p>\n\n<h2>In closing</h2>\n<p>I intentionally left out some details about the the task queue. There are actually <em>two</em> task queues! One for <em>macrotasks</em> (e.g. <code>setTimeout</code>) and one for <em>microtasks</em> (e.g. <code>Promise</code>).</p>\n<p>According to the <a href=\"https://html.spec.whatwg.org/multipage/webappapis.html#task-queue\">spec</a>, one macrotask should get processed per tick. And after it finishes, all microtasks will be processed within the same tick. While these microtasks are processed they can enqueue more microtasks, <strong>which will all be executed in the same tick</strong>.</p>\n<p>For more information see <a href=\"https://blog.risingstack.com/node-js-at-scale-understanding-node-js-event-loop\">this article from RisingStack</a> where they go more into detail.</p>\n<p>This post was originally published on <a href=\"https://medium.com/radient-tech-blog/aws-lambda-and-the-node-js-event-loop-864e48fba49\">Medium</a>.</p>\n","date_published":"2019-05-30T00:00:00.000Z","date_modified":"2023-02-03T00:00:00.000Z","tags":["aws","event-loop","lambda","nodejs"]}]}}