[TIKA-94] Speech-to-text transcription - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Resolved
Priority: Minor
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 1.27
Component/s: parser
Labels:
- new-parser
- tika-transcription

Description

Like OCR for image files (~~TIKA-93~~), we could try using speech recognition to extract text content (where available) from audio (and video!) files.

The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and comes with a friendly license.

Attachments

Issue Links

relates to

TIKA-3384 Convert new transcribe package to a Parser

Open

links to

GitHub Pull Request #406

Activity

People

Assignee:: Lewis John McGibbney

Reporter:: Jukka Zitting

Votes:: 1 Vote for this issue

Watchers:: 12 Start watching this issue

Dates

Created:: 12/Nov/07 02:14

Updated:: 12/May/21 18:02

Resolved:: 03/May/21 23:31