Privacy-Preserving-Multimodal-Medical-Question-Answer-LLM

A powerful framework designed to handle sensitive medical data with advanced privacy techniques.

This project extends LLaVA-Med with state-of-the-art privacy mechanisms for both text and images, ensuring the protection of sensitive information in real-time.

Install

Clone this repository and navigate to LLaVA-Med folder

https://github.com/microsoft/LLaVA-Med.git
cd LLaVA-Med

Install Package: Create conda environment

 conda create -n llava-med python=3.10 -y
 conda activate llava-med
 pip install --upgrade pip  # enable PEP 660 support
 pip install -e .

Install openai
```
  import openai
```
If you are running a Python script and want to access the API key
```
import os
api_key = os.getenv("OPENAI_API_KEY")
```

Serving

Web UI

Launch a Controller

  python -m llava.serve.controller --host 0.0.0.0 --port 10000

Launch a model worker

  python -m llava.serve.model_worker --host 0.0.0.0 --controller http://localhost:10000 --port 40000 --worker http://localhost:40000 --model-path microsoft/llava-med-v1.5-mistral-7b --multi-modal

Wait until the process finishes loading the model and you see "Uvicorn running on ...".

Launch a model worker (Multiple GPUs, when GPU VRAM <= 24GB) If your the VRAM of your GPU is less than 24GB (e.g., RTX 3090, RTX 4090, etc.), you may try running it with multiple GPUs.

  python -m llava.serve.model_worker --host 0.0.0.0 --controller http://localhost:10000 --port 40000 --worker http://localhost:40000 --model-path microsoft/llava-med-v1.5-mistral-7b --multi-modal --num-gpus 2

Wait until the process finishes loading the model and you see "Uvicorn running on ...".

Send a test message

  python -m llava.serve.test_message --model-name llava-med-v1.5-mistral-7b --controller http://localhost:10000

Launch a gradio server

  python -m llava.serve.gradio_web_server --controller http://localhost:10000

You can open your browser and chat with a model now.

Model Description

The Privacy-Preserving LLaVA-Med model enhances the original LLaVA-Med biomedical assistant by incorporating additional privacy protections for text and image data. Utilizing advanced natural language processing and computer vision techniques, this model preprocesses user data in real-time to obscure personally identifiable information and adds controlled noise to sensitive image regions. While it functions as a preprocessing module, it does not alter the core architecture of LLaVA-Med, ensuring that user privacy is maintained without compromising the accuracy of biomedical assistance.

Limitations

While the Privacy-Preserving LLaVA-Med model enhances privacy for both text and image data, it does have certain limitations. Although it is beneficial for protecting privacy, the added noise in images and the obfuscation of text can sometimes slightly impact the model's performance, particularly in situations where fine-grained details are essential for accurate medical interpretation. Furthermore, the use of the OpenAI API, which addresses text obfuscation, limits the number of simultaneous conversations and also slows down response times.

Acknowledgement

Base Model

LLaVA-Med

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
data/eval		data/eval
docs		docs
images		images
llava		llava
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
download_data.sh		download_data.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Privacy-Preserving-Multimodal-Medical-Question-Answer-LLM

Contents

Install

Serving

Web UI

Model Description

Acknowledgement

Base Model

About

Releases

Packages

Languages

License

Kennesaw-IntelliTrust-Lab/PP-LLaVA-Med

Folders and files

Latest commit

History

Repository files navigation

Privacy-Preserving-Multimodal-Medical-Question-Answer-LLM

Contents

Install

Serving

Web UI

Model Description

Acknowledgement

Base Model

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages