Skip to content

JavaGT/ETV-University-of-Auckland-Transcripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Transcripts and Metadata from ETV

Collecting transcripts made with insanely-fast-whisper in order to create a search index + app.

Original metadata from ETV saved in './metadata' and json transcript files from insanely-fast-whisper with distil-whisper large v3 in "./transcripts/"

Plan

  • Minimum viable code for collecting metadata
  • Minimum viable code for collecting transcripts
  • Pre-process metadata for searching/filtering transcripts
  • Pre-process transcripts for searching
  • Minimum viable code for searching transcripts
  • Impliment download_metadata script. Make it run daily to collect metadata using github actions (Using known IDs to find sequences with new additions)

About

Transcripts from ETV-UOA

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages