Skip to content

Addon for Blender that lets you generate lipsync animation automatically from audio files.

License

Notifications You must be signed in to change notification settings

blackears/parrotLipsync

Repository files navigation

Parrot Automatic Lipsync for Blender

Parrot Lipsync is an extension for Blender that lets you automatically generate lip-synced data from audio tracks. It can be used for both 2D and 3D animations and supports many common languages. It uses the allosaurus library to turn spoken audio tracks into phonemes which are then used to key frame your mouth positions.

Video - overview

More videos

Installation

Older versions

For installing Parrot version 1.0.5 and earlier, please use the instructions on this page.

Latest Version

Parrot Lipsync relies on you having ffmpeg already installed on your computer.

Install Ffmpeg

Ffmpeg is a popular library that lets you read and write many audio/video files. It needs to be accessible from the command line.

The ffmpeg homepage offers installers for a variety of systems. Alternately, you can use the below commands to install it from a terminal:

ffmpeg -version

If ffmpeg has not been found, you will need to install it.

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

Installing the Parrot Lipsync extension

Now that ffmpeg is installed, you can install Parrot by opening the Edit > Preferences > Addons tab, clicking the downward pointing arrow button in the top right corner and selecting Install from Disk.... Browse to the Parrot Lipsync zip file you downloaded and select it. Finally, make sure the check mark next to View 3D: Parrot Lipsync is checked so that the extension is activated.

Usage

Once installed, click on the Parrot Lipsync panel to access Parrot's controls.

Parrot running in Blender 4.1

You can then configure various parameters:

General Parameters

  • Phoneme Table - Table used to map phonemes to mouth poses
  • Key interpolation - Way to interpolate between mouth poses. It is recommended that you use bezier for 3D and constant for keyframe animation.
  • Rest Gap - If two phonemes are separated by a more than this many seconds, add rest poses to indicate a closed mouth.
  • Limit phonemes per second - If selected, will skip phonemes if they are more frequent than the phonemes per second rate.
  • Phonemes per second - When activated, will drop phonemes if they exceed this speed limit. This can smooth out mouth movements.

Lipsync Action Paramaters

This generates an action based on audio data. This action is added to the project's actions and is not assigned to any particular object. You must have added an audio track to the Video Sequencer and selected it as the active object.

  • Lipsync Action - Action object that will have it's data replaced with the data generated for this audio track. If blank, a new action will be created.

Press the Render Lipsync to Action button to generate the action.

NLA Action Paramaters

This uses Blender's NLA editor to generate the animation action and then add it to the target object's NLA tracks. NLA tracks will be generatd for all selected tracks in the Video Sequencer.

  • Target object - Object action will be created for
  • Action name suffix - This string will be appended to the names of any actions generated to help you keep track of them

Press the Render Lipsync to Action button to generate the NLA tracks and actions.

Phoneme Groups

This is a list generated by reading the phoneme table. It lists every Phoneme group deacribed in the table and has a Pose Action field for each group that lets you define the mouth position for each group. You'll need to provide poses for each entery in the table for Parrot to generate lipsync tracks.

If the Phoneme Groups are not showing, or if you want to load a different table, click the Reload Phoneme Table to rebuild the table.

Running in headless mode

A script has been included in /examples/headless_example/run_headless.py that demonstrates running Parrot from the command line without having to launch Blender first.

Run blender headless from the command line with the following command:

blender -b headless_demo.blend -P run_headless.py

run_headless.py:

import bpy

# Set Parrot operator properties
bpy.context.scene.props.target_object = bpy.data.objects['rig']

# Generate the lipsync
bpy.ops.parrot.render_lipsync_to_object_nla()

# Save the result
bpy.ops.wm.save_as_mainfile(filepath="//headless_result.blend")

Editing the Phoneme Table

Parrot Lipsync uses a separate JSON file which defines what mouth positions to use and which phonemes are used for which mouth positions. By default it will use the file named phoneme_table_en.json which comes with the addon. This file can be edited to change which mouth position groups are used and which phonemes should be used for each mouth position.

If you wish to define your own phoneme table, it is recommended that you make a copy of phoneme_table_en.json, add your customizations to it and then set the Phoneme table file field of the Parrot UI to point to your custom file.

Default phoneme table

Description of the Phoneme table

The phoneme table has two main sections:

The groups section defines each mouth position that your animation will use. The groups are what you will assign mouth poses to in the addon UI. The name field defines the name of the group and the description field lets you add some descriptive text to explain to the user what sounds this mouth pose is meant to make (and which is displayed as a hint). The groups will also always include a special group called rest, even if you do not explicitly define it in the JSOn file. The rest group is meant to represent the mouth when it is closed between words.

The phonemes section is where you let Parrot know which phoneme symbols belong to which group. The code field is the International Phonetic Alphabetic code for the syllable and the group field indicates which mouth pose group the syllable belongs to. The class and example fields are not currently used by Parrot but meant to provide usage hints to the user and might be used in a future version of Parrot.

Building

A make.py file is included in the root directory. You must have downloaded the necessary wheels and put them in ./source/wheels first. (The wheels are not included in this repo because they are several gigabytes in size all together and some are platform specific.) You must also have Blender 4.2 or higher installed and in your path.

The Parrot Lipsync extension can be built by being in the root directory and running the command

python ./make.py -e

This will place the built zipfiles in the ./extensions directory.

Getting the wheels

A wheel is an archive of a python module that can be imported into a Python program. Since Blender 4.2, all necessary wheels must be bundled with the extension. Since Parrot uses artificial intelligence, it requires several big wheels. If you run the make file with the -w flag, it will download the wheels necessary for the platform you are running on.

python ./make.py -w

There is currently no way (that I know of) to download the wheels for a different platform, but if you want the wheels for platforms other than the one you are building on, you can run the script on those other platforms and then copy the downloaded wheels into your ./source/wheels directory.

Acknowledgements

allosaurus https://github.com/xinjli/allosaurus

Blender Base Meshes: https://studio.blender.org/training/stylized-character-workflow/base-meshes/

Support

If you found this software useful, please consider buying me a coffee on Kofi. Every contribution helps me to make more software:

ko-fi

About

Addon for Blender that lets you generate lipsync animation automatically from audio files.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages