Deepfake Voice Clone
An AI-generated audio impersonation of a real person's voice, used in fraud to authorise transactions, deceive employees, or manipulate family members into sending money.
Also known as: AI voice clone, voice deepfake, synthetic voice fraud
Last reviewed: 1 June 2026
A deepfake voice clone is a synthetic audio recording generated by artificial intelligence that replicates the tone, cadence, accent, and distinctive vocal characteristics of a specific real person. Modern voice cloning systems require only a few seconds to minutes of sample audio — easily obtained from public videos, podcasts, or social media — to produce clones that can be indistinguishable from the original to the human ear.
In a fraud context, voice clones are used in several ways. Criminals clone the voice of a company executive and use it in a phone call to an employee in the finance department, instructing an urgent wire transfer. They clone the voice of a family member and call an elderly relative claiming to be in trouble and needing immediate money. They create voice clips impersonating public figures to lend credibility to investment scams. They bypass voice biometric authentication systems used by banks and call centres.
Voice clone fraud is a rapidly growing threat because the technology is inexpensive and accessible, the audio quality improves continually, and verification of a voice alone is psychologically difficult. Defences include establishing safe-word protocols for financial transactions, requiring alternative verification channels for any instruction to move money, voice liveness detection in biometric authentication systems, and educating individuals never to trust a voice alone for high-stakes decisions.
Examples
- An employee receives a call from what sounds like the CEO's voice requesting an urgent overseas wire transfer for a confidential acquisition; the voice is an AI clone generated from publicly available conference videos, and the funds are sent to a criminal account.