podcast-namespace/transcripts/transcripts.md

## Transcript File Format Details

This is the initial spec for the podcast transcript format. There are four possible formats detailed below.

Some transcript implementations are done in-browser. [CORS headers](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS) 
are required to make these files available from other websites. [A CORS tester is available here](https://cors-test.codehappy.dev/), 
to ensure that transcripts are available within browser-based players.

The examples given below are just for convenience.  In production you should ensure you are conforming to the actual 
spec for each format as defined in it's own documentation.

<br><br>


## HTML

The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are
designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.

#### HTML tags used:
- `<cite>`: Name of the speaker (if available)
- `<time>`: Start time of monologue (if available)
- `<p>`: Content of monologue

#### Snippet:
```html
<cite>Kevin:</cite>
<time>0:00</time>
<p>We have an update planned where we would like to give the ability to upload an artwork file for these videos</p>
<cite>Alban :</cite>
<time>0:09</time>
<p>You're triggering Tom right now with a hey, here's a cool feature.</p>
```

Example file:  [example.html](example.html)

<br><br>


## JSON

The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.

#### Elements included in this representation:
- `<version>`: The version of JSON transcript specification
- `<segments>`: An array of dialogue elements (segments)
- `<speaker>`: Speaker
- `<startTime>`: Start time for the segment
- `<endTime>`: End time for the segment (if available)
- `<body>`: Dialogue content

#### Snippet:
```json
{
  "version": "1.0.0",
  "segments": [
    {
      "speaker": "Darth Vader",
      "startTime": 0.5,
      "endTime": 0.75,
      "body": "I"
    },
    {
      "speaker": "Darth Vader",
      "startTime": 1,
      "endTime": 1.25,
      "body": "am"
    },
    {
      "speaker": "Darth Vader",
      "startTime": 1.5,
      "endTime": 2.0,
      "body": "your"
    },
    {
      "speaker": "Darth Vader",
      "startTime": 2.25,
      "endTime": 2.50,
      "body": "father.\n"
    },
    {
      "speaker": "Luke",
      "startTime": 2.75,
      "endTime": 3.0,
      "body": "Nooooo"
    }
  ]
}
```

Example file:  [example.json](example.json)

<br><br>


## SRT

The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a
popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.

#### Properties:
- Max number of lines: 2
- Max characters per line: 32
- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.

#### Snippet:
```
1
00:00:00,000 --> 00:00:02,760
Sarah: In today's episode,
you'll learn whether or not you

2
00:00:02,760 --> 00:00:06,090
should have a podcast trailer.
And if so, what should you

3
00:00:06,090 --> 00:00:11,610
include in one? Welcome to
Podcasting Q&A, where you learn

4
00:00:11,610 --> 00:00:15,750
the best tips and strategies to
launch, grow and monetize your

5
00:00:15,750 --> 00:00:18,630
podcast. This week's question
comes from Gillian.

6
00:00:19,080 --> 00:00:21,450
Gillian: Hi Buzzsprout, Gillian
here from breaking through

7
00:00:21,450 --> 00:00:25,350
careers podcast. My question is,
do we need a podcast trailer?
```

Example file:  [example.srt](example.srt)


## WebVTT

Web Video Text Tracks Format (WebVTT) are an alternative to SRT primarily designed for the use in HTML on the web.  It is supported in all major web browsers and is similar enough to SRT to be converted.

### Differences from SRT taken from [Wikipedia](https://en.wikipedia.org/wiki/WebVTT):
- WebVTT's first line starts with WEBVTT after the optional UTF-8 byte order mark
- There is space for optional header data between the first line and the first cue
- Timecode fractional values are separated by a full stop instead of a comma
- Timecode hours are optional
- The frame numbering/identification preceding the timecode is optional
- Comments identified by the word NOTE can be added
- Metadata information can be added in a JSON-style format
- Chapter information can be optionally specified
- Only supports extended characters as UTF-8
- CSS in a separate file defined in the companion HTML document for C tags is used instead of the FONT tag
- Cue settings allow the customization of cue positioning on the video

#### Properties:
- Speaker names (optional): Speakers can be included in a voice span tag `<v>` at the beginning of each caption.

#### Snippet:
```
WEBVTT

00:00:00.000 --> 00:00:05.000
<v John>Podcasting 2.0 is really changing the game.

00:00:05.000 --> 00:00:10.000
<v Tom>Yeah, absolutely. The new features are incredible.

00:00:10.000 --> 00:00:15.000
<v Tom>It's amazing how it's empowering creators like never before.

00:00:15.000 --> 00:00:20.000
<v Tom>And the enhanced monetization options are a game-changer.

00:00:20.000 --> 00:00:25.000
<v John>Exactly, Tom. It's revolutionizing the industry.

00:00:25.000 --> 00:00:30.000
<v Tom>No doubt about it. Podcasting 2.0 is the future.

00:00:30.000 --> 00:00:35.000
<v John>Couldn't agree more, Tom. The future looks bright.
```

Example file:  [example.vtt](example.vtt)
links 2020-10-21 17:07:19 +00:00			`## Transcript File Format Details`

Add CORS header requirement 2023-01-10 07:26:32 +00:00			`This is the initial spec for the podcast transcript format. There are four possible formats detailed below.`

Correct WebVTT example - addresses #594 2024-01-26 18:05:08 +00:00			`Some transcript implementations are done in-browser. [CORS headers](https://developer.mozilla.org/en-US/docs/Web/HTTP/CORS)`
			`are required to make these files available from other websites. [A CORS tester is available here](https://cors-test.codehappy.dev/),`
			`to ensure that transcripts are available within browser-based players.`
transcript format detail 2020-10-21 16:58:54 +00:00
Correct WebVTT example - addresses #594 2024-01-26 18:05:08 +00:00			`The examples given below are just for convenience. In production you should ensure you are conforming to the actual`
			`spec for each format as defined in it's own documentation.`
transcript format detail 2020-10-21 16:58:54 +00:00
links 2020-10-21 17:07:19 +00:00			`<br><br>`


transcript format detail 2020-10-21 16:58:54 +00:00			`## HTML`

			`The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are`
			`designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.`

formatting 2020-10-21 17:02:44 +00:00			`#### HTML tags used:`
			- `<cite>`: Name of the speaker (if available)
			- `<time>`: Start time of monologue (if available)
			- `<p>`: Content of monologue

			`#### Snippet:`
Add highlighting hint to code blocks 2021-04-19 03:35:20 +00:00			```html
formatting 2020-10-21 17:02:44 +00:00			`<cite>Kevin:</cite>`
			`<time>0:00</time>`
			`<p>We have an update planned where we would like to give the ability to upload an artwork file for these videos</p>`
			`<cite>Alban :</cite>`
			`<time>0:09</time>`
			`<p>You're triggering Tom right now with a hey, here's a cool feature.</p>`
			```
transcript format detail 2020-10-21 16:58:54 +00:00
links 2020-10-21 17:07:19 +00:00			`Example file: [example.html](example.html)`

transcript format detail 2020-10-21 16:58:54 +00:00			`<br><br>`


			`## JSON`

			`The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.`

formatting 2020-10-21 17:02:44 +00:00			`#### Elements included in this representation:`
			- `<version>`: The version of JSON transcript specification
			- `<segments>`: An array of dialogue elements (segments)
			- `<speaker>`: Speaker
Replaced underscored with camel case (typo) 2020-10-30 15:59:51 +00:00			- `<startTime>`: Start time for the segment
			- `<endTime>`: End time for the segment (if available)
formatting 2020-10-21 17:02:44 +00:00			- `<body>`: Dialogue content

			`#### Snippet:`
Add highlighting hint to code blocks 2021-04-19 03:35:20 +00:00			```json
formatting 2020-10-21 17:02:44 +00:00			`{`
			`"version": "1.0.0",`
			`"segments": [`
			`{`
			`"speaker": "Darth Vader",`
			`"startTime": 0.5,`
			`"endTime": 0.75,`
			`"body": "I"`
			`},`
			`{`
			`"speaker": "Darth Vader",`
			`"startTime": 1,`
			`"endTime": 1.25,`
			`"body": "am"`
			`},`
			`{`
			`"speaker": "Darth Vader",`
			`"startTime": 1.5,`
			`"endTime": 2.0,`
			`"body": "your"`
			`},`
			`{`
			`"speaker": "Darth Vader",`
			`"startTime": 2.25,`
			`"endTime": 2.50,`
Update transcripts.md Adding a line break to the sample JSON. 2024-04-25 20:26:49 +00:00			`"body": "father.\n"`
formatting 2020-10-21 17:02:44 +00:00			`},`
transcript format detail 2020-10-21 16:58:54 +00:00			`{`
formatting 2020-10-21 17:02:44 +00:00			`"speaker": "Luke",`
			`"startTime": 2.75,`
			`"endTime": 3.0,`
			`"body": "Nooooo"`
transcript format detail 2020-10-21 16:58:54 +00:00			`}`
formatting 2020-10-21 17:02:44 +00:00			`]`
			`}`
			```
transcript format detail 2020-10-21 16:58:54 +00:00
links 2020-10-21 17:07:19 +00:00			`Example file: [example.json](example.json)`

transcript format detail 2020-10-21 16:58:54 +00:00			`<br><br>`


			`## SRT`

			`The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a`
			`popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.`

formatting 2020-10-21 17:02:44 +00:00			`#### Properties:`
			`- Max number of lines: 2`
			`- Max characters per line: 32`
			`- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.`

			`#### Snippet:`
			```
			`1`
			`00:00:00,000 --> 00:00:02,760`
			`Sarah: In today's episode,`
			`you'll learn whether or not you`

			`2`
			`00:00:02,760 --> 00:00:06,090`
			`should have a podcast trailer.`
			`And if so, what should you`

			`3`
			`00:00:06,090 --> 00:00:11,610`
			`include in one? Welcome to`
			`Podcasting Q&A, where you learn`

			`4`
			`00:00:11,610 --> 00:00:15,750`
			`the best tips and strategies to`
			`launch, grow and monetize your`

			`5`
			`00:00:15,750 --> 00:00:18,630`
			`podcast. This week's question`
			`comes from Gillian.`

			`6`
			`00:00:19,080 --> 00:00:21,450`
			`Gillian: Hi Buzzsprout, Gillian`
			`here from breaking through`

			`7`
			`00:00:21,450 --> 00:00:25,350`
			`careers podcast. My question is,`
			`do we need a podcast trailer?`
			```
links 2020-10-21 17:07:19 +00:00
			`Example file: [example.srt](example.srt)`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00

			`## WebVTT`

			`Web Video Text Tracks Format (WebVTT) are an alternative to SRT primarily designed for the use in HTML on the web. It is supported in all major web browsers and is similar enough to SRT to be converted.`

			`### Differences from SRT taken from [Wikipedia](https://en.wikipedia.org/wiki/WebVTT):`
			`- WebVTT's first line starts with WEBVTT after the optional UTF-8 byte order mark`
			`- There is space for optional header data between the first line and the first cue`
			`- Timecode fractional values are separated by a full stop instead of a comma`
			`- Timecode hours are optional`
			`- The frame numbering/identification preceding the timecode is optional`
			`- Comments identified by the word NOTE can be added`
			`- Metadata information can be added in a JSON-style format`
			`- Chapter information can be optionally specified`
			`- Only supports extended characters as UTF-8`
			`- CSS in a separate file defined in the companion HTML document for C tags is used instead of the FONT tag`
			`- Cue settings allow the customization of cue positioning on the video`

			`#### Properties:`
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			- Speaker names (optional): Speakers can be included in a voice span tag `<v>` at the beginning of each caption.
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
			`#### Snippet:`
			```
			`WEBVTT`

Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:00.000 --> 00:00:05.000`
			`<v John>Podcasting 2.0 is really changing the game.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:05.000 --> 00:00:10.000`
			`<v Tom>Yeah, absolutely. The new features are incredible.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:10.000 --> 00:00:15.000`
			`<v Tom>It's amazing how it's empowering creators like never before.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:15.000 --> 00:00:20.000`
			`<v Tom>And the enhanced monetization options are a game-changer.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:20.000 --> 00:00:25.000`
			`<v John>Exactly, Tom. It's revolutionizing the industry.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:25.000 --> 00:00:30.000`
			`<v Tom>No doubt about it. Podcasting 2.0 is the future.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00
Updating the specs for WebVTT 2024-04-03 19:25:10 +00:00			`00:00:30.000 --> 00:00:35.000`
			`<v John>Couldn't agree more, Tom. The future looks bright.`
Initial WebVTT documentation 2021-03-31 01:44:51 +00:00			```

			`Example file: [example.vtt](example.vtt)`