meshtalk

This repository contains code to run MeshTalk for face animation from audio. If you use MeshTalk, please cite

@inproceedings{richard2021meshtalk,
    author    = {Richard, Alexander and Zollh\"ofer, Michael and Wen, Yandong and de la Torre, Fernando and Sheikh, Yaser},
    title     = {MeshTalk: 3D Face Animation From Speech Using Cross-Modality Disentanglement},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {1173-1182}
}

Supplemental Material

Running MeshTalk

Dependencies

ffmpeg
numpy
torch         (tested with v1.10.0)
pytorch3d     (tested with v0.4.0)
torchaudio    (tested with v0.10.0)

Animating a Face Mesh from Audio

Download the pretrained models and unzip them. Make sure your python path contains the root directory (export PYTHONPATH=<your_meshtalk_root_directory>).

Then, run

python animate_face.py --model_dir <your_pretrained_model_dir> --audio_file <your_speech_snippet.wav> --output <your_output_file.mp4>

See a description of command line arguments via python animate_face.py --help. We provide a neutral face template mesh in assets/face_template.obj. Note that the rendered results look slightly different than in the paper and supplemental video because we use a differnt (open source) rendering engine in this repository.

Training your own MeshTalk version

A subset of the data (13 subjects) has been released as our Multiface dataset. The dataset includes tracked meshes and audio files.

Note that the geometries in multiface have a slightly different topology than in meshtalk. To convert geometries from multiface to meshtalk, run

python utils/multiface2meshtalk.py <multiface_mesh.bin> <output.bin>

on the .bin files containing the vertex positions of the multiface meshes.

Once converted, you can load the new meshes with

geom = np.fromfile("my_converted_geometry.bin", dtype=np.float32).reshape(6172, 3)

and use the vertex positions with the topology defined in assets/face_template.obj.

License

The code and dataset are released under CC-NC 4.0 International license.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
models		models
utils		utils
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
animate_face.py		animate_face.py
supplemental_video.png		supplemental_video.png
supplementary_material.pdf		supplementary_material.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

meshtalk

Supplemental Material

Running MeshTalk

Dependencies

Animating a Face Mesh from Audio

Training your own MeshTalk version

License

About

Releases

Packages

Languages

License

ThornbirdZhang/meshtalk

Folders and files

Latest commit

History

Repository files navigation

meshtalk

Supplemental Material

Running MeshTalk

Dependencies

Animating a Face Mesh from Audio

Training your own MeshTalk version

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages