Skip to content

jonathanrandall/esp32_cam_2_llm

Repository files navigation

esp32_cam_2_llm

connect an esp32 cam to llm and back again

The project connects an esp32 cam to an LLM, which takes as input the prompt "describe this picture?" together with the image, and then returns a description of the image. The returned description is then transformed into speech. The LLM is running locally on my machine. I'm using the ollama library with LLAVA model. The LLAVA open source model is good for doing image analysis. I'm using an MSI laptop with an NVIDIA 4070 GPU, which has 8GB of vram. The llava model can be installed directoy with ollama.

Thanks to pcbway for the pcbs. https://www.pcbway.com

explanatory video:

https://youtu.be/hMA0HNYTjm0

repository files:

  • llm_2_esp32_cam.py: this file is running on my local maching.
  • esp32_2_llm: this contains the test internet radio file from the audio esp32-i2s library. Try to get this working first to troubleshoot any i2s difficulties.
  • esp32_2_llmv4: this contains the code running on the eps32 cam that communicates with the llm.
  • fritzing: fritzing file and gerbers for pcb.

connections I used

connection list

equipment list

About

connect an esp32 cam to llm and back again

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published