I’m testing out Camtasia. I used to use it for creating training videos years ago, but haven’t used it in a long while, so the new version is a just a little bit different… Anyhow, one of the features of Camtasia 2021 is that you can convert your audio narration to caption text, using their Speech to Text function (open the Captions option and click the cog wheel settings icon for it). But there was a problem — the Speech to Text option was grayed out and unavailable. Off to the internet…
Something someone said on a forum about Windows language settings led me to find the solution; however, I tried several things, so I’m not sure exactly which one was the critical one. My Windows language settings were set to both English Australian and English US, but when I looked a bit further (click on the language name and select Options), I didn’t have the language pack for Australian English installed and the speech ‘pack’ for US English wasn’t installed either. I downloaded and installed both, then check the Speech to Text settings in Camtasia. No change. So I shut down Camtasia and restarted it and the Speech to Text option was now available. Hopefully this will help someone else.
(Aside: Despite training my voice for speech recognition as Camtasia suggests, my experiments using their Speech to Text function were abysmal. The words in the captions that Camtasia created bore NO resemblance to anything I’d said!! I’ll try again tomorrow with a different microphone [I was using earbuds with a built-in mic, so next I’ll try a plug-in headset microphone to see if that makes any difference.] Meantime, I wondered if it was Windows speech recognition that was at fault or Camtasia, and in my minimal testing I can say that it was Camtasia. I exported the narration to an MP3 file, then used the online version of Word to transcribe it to text and it was perfect, whereas the Camtasia version was unintelligible.)
Further to the audio issue: Yes, my headset microphone made the world of difference to the audio, but despite that, the speech to text conversion in Camtasia for the captioning was still awful. Here’s what the transcription function in online Word had, followed by what Camtasia thought I’d said. The Word one needed a little editing; the Camtasia one needed to be rewritten from scratch. And when I imported the transcription as an SRT file (using the same time stamps as the Camtasia SRT file), it left out all by about 5 seconds of audio, so that wasn’t a good option.
Here’s what Word transcribed it as:
00:00:02
In this simple find and replace, we’re going to find a particular word, change it to another word, but we’re going to add a twist to that, and we’re going to make the second word in italics.
00:00:13
So what we’re going to find is the company name in this case here XYZ, and we’re going to replace it with another name and make that name.
00:00:21
Italics first thing we’ll do is we open the find replace window. We do need extra functions, so we need the one that’s under control H, so I press control H, just move it up a little out of the way here. Going to search for XYZ.
And here’s the SAME audio converted from speech to text within Camtasia:
00:00:00,100 –> 00:00:10,433
If in the civil fine to replace dead to find a particular word tragic to another live within 11 twisted left with kind Q of the 82nd would be
00:00:10,433 –> 00:00:20,766
the targets so if we could find if the company name and face. It’s like C wood and replaced with another nine can make it nine attempts to
00:00:20,766 –> 00:00:31,132
sink as we often find replacement that we do need if function so we need the one that under control page to press control H
00:00:31,133 –> 00:00:37,999
move it up and-white hit did a search for its YC
Just amazingly BAD.