Whitepaper 2
Whitepaper 2
Abstract
DeepFakeAI aims to grant anyone access to the power of deepfake technology, utilizing the
blockchain to create a robust, cloud-based service that can quickly generate clips according to
it’s users desires. These clips are generated through text submitted by the user and a selected
”Character” on the site, which can be one of the pre-selected characters curated by the service or
a character generated through a user’s own video and audio data. Available through a web app,
web API or telegram bot, the service allows the generation of as many clips as users require.
Table of Contents
Table of Contents 1
Background 2
Payment Structure 5
Using Minutes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
User vs Developer Minutes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
Custom Characters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Holding $FakeAI . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Tier Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
Road-map 7
Summary 7
1
Background
AI generated video clips created through deep learning, or deepfakes as they’re commonly known,
have become widely recognized as a formidable technology in our current world. Anyone can generate
clips so long as they have a powerful computer, several hours to train and refine a model, several
more hours to render generated clips and the money to foot the power bill. These requirements create
a high barrier to entry, and although some services exist to create or alter images, generate audio,
and even synthesize new clips, the process is often unpredictable, expensive, time intensive or a some
combination of these three issues.
If a person wanted to create a clip of a person saying their chosen phrase, they would need to use a
combination of the above services, then bring these individual elements together using video editing
software and a deft hand. In an age where deepfakes are so present in the public consciousness, it is
still considerably difficult to create a convincing video clip.
Entertainment
DeepFakeAI’s most common use-case is the creation of clips purely for entertainment’s sake. These
clips can be generated as jokes, used to spread a message in a humorous way, or even as announcements
on platforms such as Discord servers or social media sites. So long as the use-case follows existing laws
and regulations on computer generated content, the clips are free to be used in any capacity the user
may desire.
2
Figure 1: The Character creation and deepfaking process for curated characters
Character Selection
First, the user must select a character. These can be the pre-generated Characters that the DeepFakeAI
team has created, or can be a user-generated character that the user uploads video and audio data to
create.
Curated Characters
The curated Character list created by the DeepFakeAI team contains a wide variety of characters,
the ever-expanding list of which can be found proudly displayed on the DeepFakeAI website2 . These
Characters are generated by the team using the process shown in Figure 1, and the deep learning is
referred to therein as the ”Deepfake Audio Training Algorithm” and the ”Deepfake Video Training
Algorithm”. Both visual and auditory data is extracted from publicly available footage, but each are
carefully edited as two separate inputs to reduce unwanted bad data. The key to the quality of these
curated characters is in the Review, Refinement and Retraining process, wherein the DeepFakeAI team
tweak variables and monitor the deep learning progress throughout it’s development. By removing all
unwanted noise and other speakers, the end product is clean and clear for use in the clip generation
process.
User-Generated Characters
User-Generated Characters are created in a similar fashion, but go through a faster and more auto-
mated training process to ensure quick product turnaround for users. They are then used to create
clips for that user, and allow more individualized results for those who may want it. The quality is
still high, but depends much more on user input data and is limited by the quality of video and audio
2 https://fakeai.io
3
provided. The data can be given by uploading 1 minute or less video footage of the desired Character
and some audio , which is then processed in the same manner described above. There is then a training
period, and you can submit your email to be notified when the training is complete3 .
Clip Generation
Once a Character is selected, the clip generation process can begin. The user inputs a message to their
chosen interface, whether it be the web app, web API, or Telegram bot (See ”Using the Service” for
more details). The input message is then fed into the trained Character model , which processes the
message into a video clip using generative AI processing based on the Character model. The output is
a realistic video clip including the Character moving and speaking in front of a background, and audio
that is synchronized with the visuals which match up with the input message. This deepfaked clip is
the main output from the deep learning model.
HD Lip-syncing
While the initial output from the deep learning model is more than sufficient for many purposes, the
DeepFakeAI team has developed a more advanced HD Lipsyncing Algorithm that uses further pro-
cessing to greatly improve the quality of mouth movements, expressions, and synchronization with the
generated audio. By implementing this optional algorithm, the clips generated appear very realistic,
and can be viewed in higher quality settings without suffering reduced believably. For an example, see
DeepFakeAI’s HD Lipsync Model Demo4 . Users can choose whether to use this model or not in their
clip creation menu.
By utilizing the above processes, DeepFakeAI produces high-quality clips in an efficient manner re-
gardless of the message or Character chosen.
4
once the training process is complete, which may take a few hours. Once completed, your character is
ready for use in clip creation.
Payment Structure
As a service with no outside advertisements available or sponsors to listen to, DeepFakeAI is able to
deliver clips without any outside interference or data sharing whatsoever. Because of this, the service
requires funding for further development, maintenance costs, and the base cost of hosting the service
into perpetuity. As such, the service is paid, and can be accessed through either holding the corollary
token $FakeAI, or by purchasing minutes directly on the web portal.
Using Minutes
Purchasing minutes is the most common way to use DeepFakeAI, which can be done through the web
portal. Users must be signed up and logged into the website before using the service.
If the use of fiat currency is desired, users can sign in using their email, the Google API, or the
X API. Once logged in, users must go to the ”Buy” tab of the dashboard, which can be found at
app.fakeai.io. Then, the user can select either User or Developer minutes, and choose how many
minutes they would like to purchase (More information on the difference between User and Developer
minutes can be found below). Once the desired type and amount of minutes have been selected, the
user select the ”Buy” option, which will redirect them to a Stripe checkout page. Here, payment
information can be filled out, and once the payment has processed the user will have the purchased
minutes available for use.
Alternatively, if the user wishes to use cryptocurrency, they can log in using their MetaMask wallet.
Once this is done, users can follow the same process described above, however instead of being redi-
rected to a Stripe checkout page, the currency will be pulled directly from the selected wallet upon
purchase.
5
User Minutes Developer Minutes
$2 per minute $3 per minute
Web App Access ✓ Web App Access ✓
Telegram Bot Access ✓ Telegram Bot Access ✓
Add Telegram Bot to Server ✗ Add Telegram Bot to Server ✓
Web API Access ✗ Web API Access ✓
Custom Characters
Custom characters can be created for a fee, and can be paid for using either cryptocurrency or fiat
currency using the same process as the minutes purchasing above. Simply select the dollar sign by the
Custom Character’s name in the Custom tab to access the checkout page. Once paid for, the character
can be trained. Once training is complete you will be notified via email. Once trained, the character
can be used just like any other, and can be viewed and managed under the List portion of the Custom
tab.
Holding $FakeAI
Cryptocurrency users can also access the service by holding a certain number of $FakeAI tokens. This
gains access to certain tiers of use, all of which give the user access to the same options as the Developer
Minutes. The uses include the following for each tier:
Tier Structure
• Tier 1: 50,000 - 99,999 tokens
– 1 video per day
All tiered options require locking the tokens for at least 24 hours to be used, after which the benefits
are available so long as the tokens continue to be held.
6
Road-map
The current road-map for DeepFakeAI is the following as of the writing of this whitepaper:
• Improved deepfake quality through continued training and refinement of the deep learning algo-
rithms and processes
• Further optimizations to ease of use and efficiency of the web app, web API and Telegram bot
• Introduction of the DeepFakeAI bot to new platforms, to expand use-cases across multiple social
media sites
• Continued expansion of the Character pool with input from the DeepFakeAI community
• Improvement of the custom Character option through further refinement of the automated Char-
acter creation process and custom refinement for video and audio
• A new ”Zero Shot” model, which will reduce cost, amount of audio data required and training
time significantly for custom characters alike using a new audio training process. These savings
will be passed on to the users as lower cost and time to use the service.
These objectives continue to drive the DeepFakeAI team to improve the service, and there is no limit
to the possibilities in the future.
Summary
DeepFakeAI as a service has revolutionized the easy creation of deepfaked clips, allowing users to
generate any desired clip with minimal effort at low cost. The technology is sound, the team is
working hard on improvements, and the output is clean, high definition video available for whatever
use-case can be imagined. It lowers the high barrier of entry to this powerful technology, and the
customization options are unrivaled by other competing services. All that is required is a computer,
an internet connection, and an idea to start creating.