kopia lustrzana https://github.com/Podcastindex-org/podcast-namespace
formatting
rodzic
3693cbd15e
commit
794ce4ee5c
|
@ -6,12 +6,12 @@ This is the initial spec for the podcast transcript format. There are three pos
|
||||||
The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are
|
The HTML transcript format provides a solution when a transcript is available but no or limited timecode data is available. HTML transcript files are considered low-fidelity and are
|
||||||
designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.
|
designed to serve as an accessibility aid and provide searchable episode content. The HTML format used for podcast transcripts should adhere to the following specifications.
|
||||||
|
|
||||||
### HTML tags used:
|
#### HTML tags used:
|
||||||
- `<cite>`: Name of the speaker (if available)
|
- `<cite>`: Name of the speaker (if available)
|
||||||
- `<time>`: Start time of monologue (if available)
|
- `<time>`: Start time of monologue (if available)
|
||||||
- `<p>`: Content of monologue
|
- `<p>`: Content of monologue
|
||||||
|
|
||||||
### Snippet:
|
#### Snippet:
|
||||||
```
|
```
|
||||||
<cite>Kevin:</cite>
|
<cite>Kevin:</cite>
|
||||||
<time>0:00</time>
|
<time>0:00</time>
|
||||||
|
@ -28,7 +28,7 @@ designed to serve as an accessibility aid and provide searchable episode content
|
||||||
|
|
||||||
The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.
|
The JSON representation is a flexible format that accomodates various degrees of fidelity in a concise way. This format for podcast transcripts should adhere to the following specifications.
|
||||||
|
|
||||||
### Elements included in this representation:
|
#### Elements included in this representation:
|
||||||
- `<version>`: The version of JSON transcript specification
|
- `<version>`: The version of JSON transcript specification
|
||||||
- `<segments>`: An array of dialogue elements (segments)
|
- `<segments>`: An array of dialogue elements (segments)
|
||||||
- `<speaker>`: Speaker
|
- `<speaker>`: Speaker
|
||||||
|
@ -36,7 +36,7 @@ The JSON representation is a flexible format that accomodates various degrees of
|
||||||
- `<end_time>`: End time for the segment (if available)
|
- `<end_time>`: End time for the segment (if available)
|
||||||
- `<body>`: Dialogue content
|
- `<body>`: Dialogue content
|
||||||
|
|
||||||
### Snippet:
|
#### Snippet:
|
||||||
```
|
```
|
||||||
{
|
{
|
||||||
"version": "1.0.0",
|
"version": "1.0.0",
|
||||||
|
@ -83,12 +83,12 @@ The JSON representation is a flexible format that accomodates various degrees of
|
||||||
The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a
|
The SRT format was designed for video captions but provides a suitable solution for podcast transcripts. The SRT format contains medium-fidelity timestamps and are a
|
||||||
popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.
|
popular export option from transcription services. SRT transcripts used for podcasts should adhere to the following specifications.
|
||||||
|
|
||||||
### Properties:
|
#### Properties:
|
||||||
- Max number of lines: 2
|
- Max number of lines: 2
|
||||||
- Max characters per line: 32
|
- Max characters per line: 32
|
||||||
- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.
|
- Speaker names (optional): Start a new card when the speaker changes. Include the speaker's name, followed by a colon.
|
||||||
|
|
||||||
### Snippet:
|
#### Snippet:
|
||||||
```
|
```
|
||||||
1
|
1
|
||||||
00:00:00,000 --> 00:00:02,760
|
00:00:00,000 --> 00:00:02,760
|
||||||
|
|
Ładowanie…
Reference in New Issue