Vision Browser

Notes

This is a tool which allows you to navigate the web from a chat window, by taking screenshots and sending them to GPT4Vision using the OpenAI API.

The idea was to use this as an additional search tool when webscraping isn't successful. I added a websocket client to connect it to another LLM/bot.

This is a fork from https://github.com/unconv/gpt4v-browsing Props to unconventional-coding for this cool project!

What I've added: -improved chat navigation/link selection.
-added a websocket chat feature to use with other projects.
-removed the python implementations.
-works best w open browser/non-headless mode.
-chat history saved in chatlog, to be used for context later

$ npm install
$ node vision_browse.js

$ npm install
$ pip install -r requirements.txt
$ python3 vision_crawl.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
package.json		package.json
requirements.txt		requirements.txt
vision_browse.js		vision_browse.js
websocket_client.js		websocket_client.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Vision Browser

Notes

Examples

About

Uh oh!

Releases

Packages

Uh oh!

Languages

mrdavtan/Vision_Browse

Folders and files

Latest commit

History

Repository files navigation

Vision Browser

Notes

Examples

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages