pull/93/head
Dave Jones 2020-10-21 12:02:44 -05:00
rodzic 3693cbd15e
commit 794ce4ee5c
1 zmienionych plików z 91 dodań i 91 usunięć

Wyświetl plik

@ -6,12 +6,12 @@ This is the initial spec for the podcast transcript format. There are three pos
The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are
designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.
### HTML tags used:
#### HTML tags used:
- `<cite>`: Name of the speaker (if available)
- `<time>`: Start time of monologue (if available)
- `<p>`: Content of monologue
### Snippet:
#### Snippet:
```
<cite>Kevin:</cite>
<time>0:00</time>
@ -28,7 +28,7 @@ designed to serve as an accessibility aid and provide searchable episode content
The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.
### Elements included in this representation:
#### Elements included in this representation:
- `<version>`: The version of JSON transcript specification
- `<segments>`: An array of dialogue elements (segments)
- `<speaker>`: Speaker
@ -36,7 +36,7 @@ The JSON representation is a flexible format that accomodates various degrees of
- `<end_time>`: End time for the segment (if available)
- `<body>`: Dialogue content
### Snippet:
#### Snippet:
```
{
"version": "1.0.0",
@ -83,12 +83,12 @@ The JSON representation is a flexible format that accomodates various degrees of
The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a
popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.
### Properties:
#### Properties:
- Max number of lines: 2
- Max characters per line: 32
- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.
### Snippet:
#### Snippet:
```
1
00:00:00,000 --> 00:00:02,760