Upload a short clip of the person speaking. A background job will compute a voice embedding after save.