Gralx: Multimodal Context-Preserving Data Masking

The Multimodal Context-Preserving Data Masking API is an open-source privacy layer designed for companies building products on top of large public models or their own proprietary models. This API enables the seamless integration of advanced data masking techniques into existing applications, ensuring the protection of sensitive information while preserving the vital context necessary for accurate model performance. Features

Key Features

Seamless Integration: The API provides a simple and intuitive interface for incorporating data masking capabilities into your existing pipelines, making it easy to protect sensitive data without extensive modifications to your codebase.
Multimodal Support: The API supports various data modalities, including text, images, audio, and video, allowing you to protect sensitive information across different types of data utilized by your models.
Context-Preserving Masking: Advanced masking techniques are employed to maintain the contextual integrity of the masked data, ensuring that the relationships and semantics within the dataset are preserved. This enables your models to perform accurately while safeguarding sensitive information.
Flexible Masking Strategies: The API offers a range of masking strategies, such as substitution, encryption, and generalization, giving you the flexibility to choose the most appropriate technique for your specific use case and compliance requirements.
Customizable Masking Rules: You can define custom masking rules based on your domain expertise and data privacy policies, ensuring that the masking process aligns with your organization's specific needs and regulations.
Performance Optimization: The API is designed with performance in mind, leveraging efficient algorithms and parallel processing to minimize the impact on your application's runtime while delivering robust data protection. Comprehensive Logging and Monitoring: Detailed logs and monitoring capabilities are provided to track the masking process, identify potential issues, and ensure the integrity of the masked data.

Python Libraries

Flask Security Too - authentification
PySceneDetect - split scenes

External Frameworks

BMF (Babit Multimedia Framework)developed by ByteDance: Used to process the videos.
Video LLaMa*: Used to transcribe the video scene scene by scene.
Eleven Labs API: Used to generated masked audio with swapped PII data variables.
Presidio by Mircrosoft: Used to extract, label, and swap textual PII data.
Story Diffusion: Used to generate frames that mask PII data but retain important context.
EMOby Alibaba: used to re-animate facial expressions and lip-sync audio in the generated portraits
Animate Anyone by Alibaba: used to re-animate body movements in the generated images

Name	Name	Last commit message	Last commit date
Latest commit tesims Update README.md Sep 22, 2024 5a0383c · Sep 22, 2024 History 18 Commits
MeloTTS	MeloTTS	Update audio processing and related files	Sep 3, 2024
OpenVoice_server	OpenVoice_server	Update audio processing and related files	Sep 3, 2024
__pycache__	__pycache__	Initial commit	Aug 17, 2024
app	app	Update audio processing and related files	Sep 3, 2024
checkpoints_v2	checkpoints_v2	Update audio processing and related files	Sep 3, 2024
migrations	migrations	Add new files and updates	Aug 18, 2024
test	test	testing text flow	Aug 26, 2024
venv_py39	venv_py39	Update audio processing and related files	Sep 3, 2024
.DS_Store	.DS_Store	Update audio processing and related files	Sep 3, 2024
.flaskenv	.flaskenv	Initial commit	Aug 17, 2024
.gitattributes	.gitattributes	Initial commit	Aug 17, 2024
.gitignore	.gitignore	Remove large file from tracking and add to .gitignore	Sep 3, 2024
LICENSE	LICENSE	Initial commit	Aug 17, 2024
README.md	README.md	Update README.md	Sep 22, 2024
bfg.jar	bfg.jar	Initial commit	Aug 17, 2024
config.py	config.py	Update audio processing and related files	Sep 3, 2024
graxl-2.code-workspace	graxl-2.code-workspace	Update audio processing and related files	Sep 3, 2024
main.py	main.py	Initial commit	Aug 17, 2024
requirements.txt	requirements.txt	Update audio processing and related files	Sep 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gralx: Multimodal Context-Preserving Data Masking

Key Features

Python Libraries

External Frameworks

About

Releases

Packages

Languages

License

tesims/Gralx

Folders and files

Latest commit

History

Repository files navigation

Gralx: Multimodal Context-Preserving Data Masking

Key Features

Python Libraries

External Frameworks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages