Add Collection Audio using HyperTTS

This article explains how to add Collection Audio using HyperTTS. Collection Audio means we are going to pre-generate audio files which will be added to your Anki collection. Using this method, your text to speech audio will be available whether you are reviewing on the Windows/Mac Anki app, or iOS / Android. This method doesn't work with Cloze notes.

Let's add some Audio

We will generate audio for a Chinese / Mandarin deck but the procedure is the same for any language. In the Anki browser, select a few notes, and select menu HyperTTS -> Add Audio (Collection)...

Configure Source

As a first step, we'll configure the source, in other words which field the audio should be generated based off. In most cases, all you need to do here is select Source Field. I chose Chinese as I want to generate Chinese audio.

If you have special needs, you can also try the Template and Advanced Template Source Modes. These allow you to combine multiple fields together or do something very advanced (for expert users).

Move on to the Target tab.

Configure Target

Since we are generating Collection Audio, one of your fields must contain the Sound Tag. In this tab, you can configure which field will hold the tag. Some people like to dedicate a Field to the sound tag (you could name it simply Sound). Other people prefer to keep the sound tag and the original text together, which you can do by checking Text and Sound Tag in the Text and Sound Tag Handling section below. If you'd like to see how to create a dedicated Sound field for your notes, see our Note Types Tutorial.

Move on to the Voice Selection tab.

Select Voice

If you include all services, HyperTTS offers over 1,200 voices. You have to choose one or more to generate the audio. To restrict the voices displayed, you can select various filters such as Language. Select a voice in the correct language, select one of your notes in the table on the right, then click Preview Sound. You should hear the sound pronounced for the note you've selected.

If you're happy with a single voice, you can use the default Single Selection Mode. If you'd like to randomly pick from a group of voices, select Random, and make sure to add two voices or more. Priority mode is useful for Dictionary services such as Forvo, Collins. You can first look for a word in Forvo, if not found, then look at Collins, and finally as a fallback, use a TTS service.

Next, let's take a look at the Text Processing tab.

Text Processing Options

You don't need to set anything in this tab unless you have a special requirement. You can configure various text replacement rules, and test them. In HyperTTS, text replacement rules are associated with a preset, so that you can have different rules by language / by deck.

The most frequent use case will be to add Text Replacement Rules to expand abbreviations (such as kg, cm3, etc). Advanced users can also use Regex Rules. A number of people may also want to remove text in brackets such as (), [], and there is a dedicated checkbox for this. Any text replacement change that you make will be reflected in the table on the right, in the Processed Text column.

Apply Audio to Notes

Once you're satisfied with all the settings, you can click Apply To Notes. You will see a progress bar informing you of the progress, and you can click Close when generation has finished. If you'd like to undo this whole operation, you can click the Anki Edit menu, and select Undo.

Check your Card Template

To ensure you hear your audio while reviewing, make sure that your card template contains the field which has the Sound tag. Select a note in the Anki browser, and click Cards.... In my example below, the sound tag is in the Sound field, which is present in the back template, and you can see the play button in the template preview on the right.

Want to add one by one ?

If you like to enter new notes one by one, you can generate collection audio with a single click from the Anki editor. Just select your HyperTTS preset and click Add Audio. Your Sound field will get updated, and you'll hear the audio.

The End !

This is the end of the tutorial. If you have any questions, please email contact@mail.vocab.ai. Are you interested in trying AwesomeTTS , HyperTTS or Language Tools, to easily add Audio / Translation / Transliteration to your flashcards ? Signup for AwesomeTTS Plus.