R17 Normalise reference transcript into internal extensible format #8

EyalLavi · 2018-09-19T13:19:17Z

We want clients to be able to benchmark using different criteria: text only, text and punctuation, timings, speaker identification. The tool needs to identify the elements in the reference transcript for benchmarking. We also want to avoid forcing users to make complicated reference files. So we need a way to convert different formats into an internal format. The internal format could be a list of tokens (words) with metadata (TBC).
Proposed input formats:

Plain text
CTM
MLF

EyalLavi added the Requirement label Sep 19, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

R17 Normalise reference transcript into internal extensible format #8

R17 Normalise reference transcript into internal extensible format #8

EyalLavi commented Sep 19, 2018 •

edited

Loading

R17 Normalise reference transcript into internal extensible format #8

R17 Normalise reference transcript into internal extensible format #8

Comments

EyalLavi commented Sep 19, 2018 • edited Loading

EyalLavi commented Sep 19, 2018 •

edited

Loading