pull/93/head
Dave Jones 2020-10-21 12:02:44 -05:00
rodzic 3693cbd15e
commit 794ce4ee5c
1 zmienionych plików z 91 dodań i 91 usunięć

Wyświetl plik

@ -6,12 +6,12 @@ This is the initial spec for the podcast transcript format. There are three pos
The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are
designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications. designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.
### HTML tags used: #### HTML tags used:
- `<cite>`: Name of the speaker (if available) - `<cite>`: Name of the speaker (if available)
- `<time>`: Start time of monologue (if available) - `<time>`: Start time of monologue (if available)
- `<p>`: Content of monologue - `<p>`: Content of monologue
### Snippet: #### Snippet:
``` ```
<cite>Kevin:</cite> <cite>Kevin:</cite>
<time>0:00</time> <time>0:00</time>
@ -28,7 +28,7 @@ designed to serve as an accessibility aid and provide searchable episode content
The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications. The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.
### Elements included in this representation: #### Elements included in this representation:
- `<version>`: The version of JSON transcript specification - `<version>`: The version of JSON transcript specification
- `<segments>`: An array of dialogue elements (segments) - `<segments>`: An array of dialogue elements (segments)
- `<speaker>`: Speaker - `<speaker>`: Speaker
@ -36,7 +36,7 @@ The JSON representation is a flexible format that accomodates various degrees of
- `<end_time>`: End time for the segment (if available) - `<end_time>`: End time for the segment (if available)
- `<body>`: Dialogue content - `<body>`: Dialogue content
### Snippet: #### Snippet:
``` ```
{ {
"version": "1.0.0", "version": "1.0.0",
@ -83,12 +83,12 @@ The JSON representation is a flexible format that accomodates various degrees of
The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a
popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications. popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.
### Properties: #### Properties:
- Max number of lines: 2 - Max number of lines: 2
- Max characters per line: 32 - Max characters per line: 32
- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon. - Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.
### Snippet: #### Snippet:
``` ```
1 1
00:00:00,000 --> 00:00:02,760 00:00:00,000 --> 00:00:02,760