One-Shot Free-View Neural Talking Head Synthesis

Unofficial pytorch implementation of paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing".

Python 3.6 and Pytorch 1.7 are used.

Updates:

2021.11.05 :

~~Replace Jacobian with the rotation matrix (Assuming J = R) to avoid estimating Jacobian.~~
Correct the rotation matrix.

2021.11.17 :

Better Generator, better performance (models and checkpoints have been released).

Driving | Beta Version | FOMM | New Version:

driving-beta-fomm-new.mp4

Driving | FOMM | Ours:

Free-View:

Train:

python run.py --config config/vox-256.yaml --device_ids 0,1,2,3,4,5,6,7

Demo:

python demo.py --config config/vox-256.yaml --checkpoint path/to/checkpoint --source_image path/to/source --driving_video path/to/driving --relative --adapt_scale --find_best_frame

free-view (e.g. yaw=20, pitch=roll=0):

python demo.py --config config/vox-256.yaml --checkpoint path/to/checkpoint --source_image path/to/source --driving_video path/to/driving --relative --adapt_scale --find_best_frame --free_view --yaw 20 --pitch 0 --roll 0

Note: run crop-video.py --inp driving_video.mp4 first to get the cropping suggestion and crop the raw video.

Pretrained Model:

Model	Train Set	Baidu Netdisk	Media Fire
Vox-256-Beta	VoxCeleb-v1	Baidu (PW: c0tc)	MF
Vox-256-New	VoxCeleb-v1	-	MF
Vox-512	VoxCeleb-v2	soon	soon

Note:

~~For now, the Beta Version is not well tuned.~~
For free-view synthesis, it is recommended that Yaw, Pitch and Roll are within ±45°, ±20° and ±20° respectively.
Face Restoration algorithms (GPEN) can be used for post-processing to significantly improve the resolution.

Acknowlegement:

Thanks to NV, AliaksandrSiarohin and DeepHeadPose.

Name	Name	Last commit message	Last commit date
Latest commit zhanglonghao1992 Create LICENSE.md Apr 19, 2022 9511d25 · Apr 19, 2022 History 85 Commits
config	config	Add files via upload	Nov 18, 2021
modules	modules	add spade generator	Nov 18, 2021
sync_batchnorm	sync_batchnorm	Add files via upload	Sep 1, 2021
LICENSE.md	LICENSE.md	Create LICENSE.md	Apr 19, 2022
README.md	README.md	Update README.md	Mar 3, 2022
animate.py	animate.py	Update animate.py	Nov 11, 2021
augmentation.py	augmentation.py	Add files via upload	Sep 1, 2021
crop-video.py	crop-video.py	Add files via upload	Sep 1, 2021
demo.py	demo.py	fix estimate_jacobian bug	Nov 22, 2021
frames_dataset.py	frames_dataset.py	Add files via upload	Sep 1, 2021
logger.py	logger.py	Update logger.py	Oct 28, 2021
run.py	run.py	Add files via upload	Nov 18, 2021
train.py	train.py	Add files via upload	Sep 1, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One-Shot Free-View Neural Talking Head Synthesis

Updates:

Train:

Demo:

Pretrained Model:

Acknowlegement:

About

Releases

Packages

Languages

License

zhanglonghao1992/One-Shot_Free-View_Neural_Talking_Head_Synthesis

Folders and files

Latest commit

History

Repository files navigation

One-Shot Free-View Neural Talking Head Synthesis

Updates:

Train:

Demo:

Pretrained Model:

Acknowlegement:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages