MicrosoftDocs · captainbrosset · May 29, 2026 · May 29, 2026 · May 29, 2026 · May 29, 2026
diff --git a/microsoft-edge/toc.yml b/microsoft-edge/toc.yml
@@ -105,18 +105,22 @@
           href: ./web-platform/site-impacting-changes.md
 # /Release notes
 # =============================================================================
+# Experimental APIs for Microsoft Edge
+    - name: Experimental APIs for Microsoft Edge
+      items:
+        - name: Test experimental APIs and features by using origin trials
+          href: ./origin-trials/index.md
+          displayName: Experimental web platform features  # standard term eg in edge://flags
+
+        - name: Provide user-relevant ads with the Ad Selection API
+          href: ./web-platform/ad-selection-api.md
+# /Experimental APIs for Microsoft Edge
+# =============================================================================
 # AI for the web
     - name: AI for the web
       items:
         - name: Developer APIs
           items:
-            - name: Test experimental APIs and features by using origin trials
-              href: ./origin-trials/index.md
-              displayName: Experimental web platform features  # standard term eg in edge://flags
-
-            - name: Provide user-relevant ads with the Ad Selection API
-              href: ./web-platform/ad-selection-api.md
-
             - name: Prompt a built-in language model with the Prompt API
               href: ./web-platform/prompt-api.md
 
@@ -131,6 +135,9 @@
 
             - name: Detect languages with the Language Detector API
               href: ./web-platform/languagedetector-api.md
+
+            - name: Convert speech to text with the SpeechRecognition API
+              href: ./web-platform/speech-recognition-api.md
 # /AI for the web
 # =============================================================================
 # -----------------------------------------------------------------------------

diff --git a/microsoft-edge/web-platform/speech-recognition-api.md b/microsoft-edge/web-platform/speech-recognition-api.md
@@ -0,0 +1,256 @@
+---
+title: Convert speech to text with the SpeechRecognition API
+description: Convert speech to text with the SpeechRecognition API.
+author: MSEdgeTeam
+ms.author: msedgedevrel
+ms.topic: article
+ms.service: microsoft-edge
+ms.date: 05/29/2026
+---
+# Convert speech to text with the SpeechRecognition API
+
+The SpeechRecognition API is a standard web API that allows you to convert speech from a media file or the device microphone, into text, from your website's JavaScript code. This article focuses on using the SpeechRecognition API with an on-device (or local) speech recognition model that's built into Microsoft Edge.
+
+For more information about the API itself, see [Web Speech API](https://developer.mozilla.org/docs/Web/API/Web_Speech_API), at MDN.
+
+**Detailed contents:**
+* [Availability of the local speech recognition model](#availability-of-the-local-speech-recognition-model)
+* [Alternatives to and benefits of the local speech recognition model](#alternatives-to-and-benefits-of-the-local-speech-recognition-model)
+   * [Model availability](#model-availability)
+* [Enable local speech recognition in Microsoft Edge](#enable-local-speech-recognition-in-microsoft-edge)
+* [See a working example](#see-a-working-example)
+* [Use the SpeechRecognition API with local recognition in your website](#use-the-speechrecognition-api-with-local-recognition-in-your-website)
+   * [Check if the API is supported and instantiate a SpeechRecognition object](#check-if-the-api-is-supported-and-instantiate-a-speechrecognition-object)
+   * [Choose an input language and opt-in to local recognition](#choose-an-input-language-and-opt-in-to-local-recognition)
+   * [Check whether the local model is already installed](#check-whether-the-local-model-is-already-installed)
+   * [Start speech recognition](#start-speech-recognition)
+   * [Stop recognition explicitly and on media end](#stop-recognition-explicitly-and-on-media-end)
+* [Send feedback](#send-feedback)
+* [See also](#see-also)
+
+
+<!-- ====================================================================== -->
+## Availability of the local speech recognition model
+
+The local speech recognition model is available in Microsoft Edge Canary or Dev, starting with version 150.x.y.z <!-- todo add correct version and potential flag and hardware requirements -->
+
+
+<!-- ====================================================================== -->
+## Alternatives to and benefits of the local speech recognition model
+
+<!-- todo -->
+
+
+<!-- ------------------------------ -->
+#### Model availability
+
+An initial download of the model will be required the first time a website uses the local speech recognition model with the SpeechRecognition API.
+
+You can monitor the model download by using the promise that's returned by the SpeechRecognition API `install()` method.  To learn more, see [Check whether the local model is already installed](#check-whether-the-local-model-is-already-installed), below.
+
+
+<!-- ====================================================================== -->
+## Enable local speech recognition in Microsoft Edge
+
+To use the local speech recognition model with the SpeechRecognition API, you need to enable the feature in Microsoft Edge Canary or Dev.  To enable the feature:
+
+1. Make sure you're using the latest version of Microsoft Edge Canary or Dev (version XYZ or newer).  See [Become a Microsoft Edge Insider](https://www.microsoft.com/edge/download/insider).
+
+1. In Microsoft Edge Canary or Dev, open a new tab or window and go to `edge://flags/`.
+
+1. In the search box, at the top of the page, enter **On-Device Speech Recognition**.
+
+   The page is filtered to show the matching flag.
+
+1. Select **Enabled** next to the flag for the API you want to enable:
+
+   ![Flags page of browser](./speech-recognition-apis-images/flag.png)
+
+1. Restart Microsoft Edge Canary or Dev.
+
+
+<!-- ====================================================================== -->
+## See a working example
+
+To see the SpeechRecognition API in action, and review existing code:
+
+1. [Enable local speech recognition in Microsoft Edge](#enable-local-speech-recognition-in-microsoft-edge), as described above.
+
+1. In Microsoft Edge Canary or Dev browser, open a tab or window and go to [SpeechRecognition API](https://microsoftedge.github.io/Demos/built-in-ai/playgrounds/speechrecognition-api/).
+
+1. In the information banner at the top, check the status: it initially reads **...TODO...**:
+
+   ![...TODO...](./speech-recognition-apis-images/....png)
+
+TODO: describe the steps to download the local model.
+
+1. Under **Input language**, choose the language you want to use for speech recognition.
+
+1. Under **Audio source**, use the dropdown menu to select the audi to use for speech recognition:
+
+   * Choose **Microphone** to use your device microphone as the audio source.
+   * Choose **File** to use an audio or video file from your device as the audio source.
+
+1. If you chose **File** as the audio source, click **Choose file** under the **Media file** section, and select an audio or video file from your device.
+
+1. Click **Start** to start transcribing the input audio into text.
+
+   The text transcription is displayed in the page:
+
+   ... TODO screenshot with red boxes around the relevant sections ...
+
+1. To stop converting speech to text, at any time, click the **Stop** button.
+
+   The transcription might also stop automatically after a long period of silence in the input audio.
+
+See also:
+* [/built-in-ai/playgrounds/speechrecognition-api/](https://github.com/MicrosoftEdge/Demos/tree/main/built-in-ai/playgrounds/speechrecognition-api) - Source code for the SpeechRecognition API playground demo.
+
+
+<!-- ====================================================================== -->
+## Use the SpeechRecognition API with local recognition in your website
+
+The following sections describe how to use the SpeechRecognition API with local speech recognition in your website's code.  For more details about the API itself, see [Web Speech API](https://developer.mozilla.org/docs/Web/API/Web_Speech_API), at MDN.
+
+
+<!-- ------------------------------ -->
+#### Check if the API is supported and instantiate a SpeechRecognition object
+
+To ensure that the SpeechRecognition API is supported in the browser, test whether the `SpeechRecognition` object is available:
+
+```js
+if (!window.SpeechRecognition) {
+  console.log("The SpeechRecognition API is not available in this browser.");
+} else {
+  console.log("The SpeechRecognition API is available.");
+
+  const recognition = new SpeechRecognition();
+}
+```
+
+See also:
+* [SpeechRecognition](https://developer.mozilla.org/docs/Web/API/SpeechRecognition), at MDN.
+
+
+<!-- ------------------------------ -->
+#### Choose an input language and opt-in to local recognition
+
+To configure speech recognition by using a local model, choose an input language and set the `processLocally` option: 
+
+```js
+recognition.lang = "en-US";
+recognition.processLocally = true;
+```
+
+As of Microsoft Edge 150.x.y.z, the following input languages are supported for local speech recognition:
+* <!-- todo -->
+
+Language support is expected to expand in future versions.
+
+Also set the `continuous` and `interimResults` options to `true` to receive interim results and transcribe long audio sessions without stopping:
+
+```js
+recognition.continuous = true;
+recognition.interimResults = true;
+```
+
+See also:
+* [SpeechRecognition: lang property](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/lang), at MDN.
+* [SpeechRecognition: processLocally property](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/processLocally), at MDN.
+* [SpeechRecognition: continuous property](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/continuous), at MDN.
+* [SpeechRecognition: interimResults property](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/interimResults), at MDN.
+
+
+<!-- ------------------------------ -->
+#### Check whether the local model is already installed
+
+Before starting recognition, check model availability for your selected language. If the model is not yet installed, trigger the installation and wait for it to complete before starting recognition:
+
+```js
+async function ensureModelReady(lang) {
+  const availability = await SpeechRecognition.available({
+    langs: [lang],
+    processLocally: true,
+  });
+
+  if (availability === "available") {
+    return true;
+  }
+
+  if (availability === "downloadable" || availability === "downloading") {
+    const installed = await SpeechRecognition.install({
+      langs: [lang],
+      processLocally: true,
+    });
+
+    if (!installed) {
+      throw new Error(`Failed to install local model for ${lang}.`);
+    }
+
+    return true;
+  }
+
+   return false;
+}
+```
+
+The promise returned by `SpeechRecognition.install()` resolves when installation succeeds or fails.
+
+See also:
+* [SpeechRecognition: available() static method](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/available_static), at MDN.
+* [SpeechRecognition: install() static method](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/install_static), at MDN.
+
+
+<!-- ------------------------------ -->
+#### Start speech recognition
+
+After you've made sure that the API and model are both ready, to start recognition, use the `start()` method.
+
+When called without a parameter, the `start()` method recognizes audio from the user's microphone:
+
+```js
+recognition.start();
+```
+
+To recognize audio from a media file instead, pass a `MediaStreamTrack` instance as an argument to the `start()` method.  For example, you can create a `MediaStreamTrack` instance by creating a `MediaStreamDestinationNode` instance by using the WebAudio API:
+
+```js
+const audioContext = new AudioContext();
+const mediaStreamDestination = audioContext.createMediaStreamDestination();
+recognition.start(mediaStreamDestination.stream.getAudioTracks()[0]);
+```
+
+See also:
+* [SpeechRecognition: start() method](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/start), at MDN.
+* [Web Audio API](https://developer.mozilla.org/docs/Web/API/Web_Audio_API), at MDN.
+
+
+<!-- ------------------------------ -->
+#### Stop recognition explicitly and on media end
+
+To stop recognition, use the `stop()` method:
+
+```js
+recognition.stop();
+```
+
+You can also choose to stop recognition when the media input ends, by using the `onended` event handler of the media element that you're using as input.  For example, if you're using a `HTMLAudioElement` or `HTMLVideoElement` as the audio source, you can set up the event handler like this:
+
+```js
+mediaElement.onended = () => recognition.stop();
+```
+
+See also:
+* [SpeechRecognition: stop() method](https://developer.mozilla.org/docs/Web/API/SpeechRecognition/stop), at MDN.
+
+
+<!-- ====================================================================== -->
+## Send feedback
+
+We're very interested in hearing your feedback about the local speech recognition model, its performance, or any other improvements you'd like to see for your use-cases.  Please send feedback by adding a comment to [the SpeechRecognition API feedback issue](https://github.com/MicrosoftEdge/MSEdgeExplainers/issues/xyz). <!-- todo -->
+
+
+<!-- ====================================================================== -->
+## See also
+
+<!-- todo -->