Open Generative Fill

A repository for hacking Generative Fill with Open Source Tools

Try out the HF Sapace

Try out the scripts

Text Based Image Editing

Creating an Open Source alternative for GenerativeFill and other editing tools.

Through this notebook we aim to:

Provide an Edit Prompt
Provide an Image
Edit the Image based on the initial Prompt

How do we propose to do this?

Accept an edit_prompt and an image as input
Use a Vision Model to caption the image
Pass the edit_prompt through a language model to extract the source entity
Create a replacement_caption where the source entity of the original image is swapped with the target entity in the edit_prompt
Use the source entity to create a segmentation mask using OWL-VIT and SAM
Use the mask and the replacement_caption for image inpainting

The pipeline is shown in the figure below:

TO-DOs

This repository is still in its early stages and will require additional work.

Better Captioning from Vision Model
Prompt upsampling using the Language Model
More complex editing tasks than replacement
Optimization of the models and an end-to-end pipeline

Credits

sayakpaul for the amazing advice and ideas
pedrogengo for the replacement caption idea, as illustrated here
rishiraj for patiently teaching us about Qwen and small LLMs

Citing Generative Fill

If Generative Fill helps your research, we appreciate your citations. Here is the BibTeX entry:

@misc{raha2024opengenerativefill,
  title={Open Source Generative Fill},
  author={Raha, Ritwik and Roy Gosthipaty, Aritra},
  year={2024},
  howpublished={\url{https://github.com/ritwikraha/GenerativeFill-with-Keras-and-Diffusers}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.devcontainer		.devcontainer
notebooks		notebooks
open_generative_fill		open_generative_fill
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
inference.py		inference.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Generative Fill

Text Based Image Editing

Through this notebook we aim to:

How do we propose to do this?

TO-DOs

Credits

Citing Generative Fill

About

Releases

Packages

Languages

License

AashiDutt/Open-Generative-Fill

Folders and files

Latest commit

History

Repository files navigation

Open Generative Fill

Text Based Image Editing

Through this notebook we aim to:

How do we propose to do this?

TO-DOs

Credits

Citing Generative Fill

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages