This article explains how to add Collection Audio using HyperTTS. Collection Audio means we are going to pre-generate audio files which will be added to your Anki collection. Using this method, your text to speech audio will be available whether you are reviewing on the Windows/Mac Anki app, or iOS / Android. This method doesn't work with Cloze notes.
We will generate audio for a Chinese / Mandarin deck but the procedure is the same for any language. In the Anki browser, select a few notes, and select menu
As a first step, we'll configure the source, in other words which field the audio should be generated based off. In most cases, all you need to do here is select. I chose Chinese as I want to generate Chinese audio.
If you have special needs, you can also try theand . These allow you to combine multiple fields together or do something very advanced (for expert users).
Move on to thetab.
Since we are generating Collection Audio, one of your fields must contain the Sound Tag. In this tab, you can configure which field will hold the tag. Some people like to dedicate a Field to the sound tag (you could name it simply Sound). Other people prefer to keep the sound tag and the original text together, which you can do by checking Note Types Tutorial.in the section below. If you'd like to see how to create a dedicated Sound field for your notes, see our
Move on to thetab.
If you include all services, HyperTTS offers over 1,200 voices. You have to choose one or more to generate the audio. To restrict the voices displayed, you can select various filters such as. Select a voice in the correct language, select one of your notes in the table on the right, then click . You should hear the sound pronounced for the note you've selected.
If you're happy with a single voice, you can use the default. If you'd like to randomly pick from a group of voices, select , and make sure to add two voices or more. mode is useful for Dictionary services such as Forvo, Collins. You can first look for a word in Forvo, if not found, then look at Collins, and finally as a fallback, use a TTS service.
Next, let's take a look at thetab.
You don't need to set anything in this tab unless you have a special requirement. You can configure various text replacement rules, and test them. In HyperTTS, text replacement rules are associated with a preset, so that you can have different rules by language / by deck.
The most frequent use case will be to addto expand abbreviations (such as kg, cm3, etc). Advanced users can also use . A number of people may also want to remove text in brackets such as (), , and there is a dedicated checkbox for this. Any text replacement change that you make will be reflected in the table on the right, in the column.
Once you're satisfied with all the settings, you can click. You will see a progress bar informing you of the progress, and you can click when generation has finished. If you'd like to undo this whole operation, you can click the Anki menu, and select .
To ensure you hear your audio while reviewing, make sure that your card template contains the field which has the Sound tag. Select a note in the Anki browser, and click. In my example below, the sound tag is in the Sound field, which is present in the back template, and you can see the play button in the template preview on the right.
If you like to enter new notes one by one, you can generate collection audio with a single click from the Anki editor. Just select your HyperTTS preset and click. Your Sound field will get updated, and you'll hear the audio.