|
|
Subscribe / Log in / New account

Providing context

Providing context

Posted Sep 25, 2024 7:53 UTC (Wed) by mwood (guest, #55622)
In reply to: … until it's not by ringerc
Parent article: Transcribing audio with AI using Speech Note

> I'd love to be able to seed one with a context.

You actually can! Whisper has an `--initial_prompt` option.

e.g. I tried the example from the article and gave it some context, which allowed it to correctly transcribe "shan't" and "bosh" and the repetition, but for some reason it got horribly confused in the middle.

I gave it the following context:

'This is a reading of a two stanza poem. It contains some old/unusual exclamations and the last line of each stanza contains some repetition like "and WORD, and WORD, and WORD"'

[00:00.000 --> 00:13.720] the cat this is a LibriVox recording all LibriVox recordings are in the public domain for more information or to volunteer please visit LibriVox.org
[00:13.720 --> 00:26.560] the cat advice to the young by harry graham from ruthless rhymes for heartless homes LibriVox coffee break collection number eight
[00:26.560 --> 00:41.280] my children you should imitate the harmless necessary cat who eats whatever's on his plate and doesn't even leave the fat who never stays in bed too late or does immoral things like that
[00:41.280 --> 00:55.080] instead of saying shan't or bosh he'll sit and wash and wash and wash when shadows fall and lights grow dim he sits beneath the kitchen stair
[00:55.080 --> 00:55.880] basta
[00:55.880 --> 00:56.460] ba
[00:56.460 --> 01:03.260] and limb a simple couch he chooses there and if you tumble over him he simply loves to hear you
[01:03.260 --> 01:19.500] swear and while bad language you prefer he'll sit and purr and purr and purr end of the cat by harry graham read by patrick wallace


to post comments


Copyright © 2025, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds