This Chrome extension allows users to capture screenshots of web pages and analyze them using OpenAI's GPT-4 Vision API. Users can ask questions about the captured screenshot and receive AI-generated responses.
- Screenshot capture of the current tab
- Integration with OpenAI's GPT-4 Vision API
- User-friendly interface for entering prompts
- Display of AI-generated responses within the extension
- Clone this repository: git clone https://github.com/rodneyg/ScreenHawk.git
Copy 2. Navigate to the project directory: cd ScreenHawk
Copy 3. Install dependencies: npm install
Copy 4. Build the extension: npm run build
- Open Chrome and navigate to
chrome://extensions
- Enable "Developer mode" in the top right corner
- Click "Load unpacked" and select the
dist
folder in your project directory
- Click the extension icon in your Chrome toolbar
- Click "Capture Screenshot" to capture the current tab
- Enter a prompt about the screenshot in the dialog that appears
- View the AI-generated response in a new dialog
npm run build
: Build the extensionnpm run watch
: Watch for changes and rebuildnpm run lint
: Run ESLintnpm test
: Run tests (if implemented)
src/
: Source TypeScript filespublic/
: Static files (HTML, manifest)dist/
: Build output (generated)
Contributions are welcome! Please feel free to submit a Pull Request.
This project is licensed under the MIT License - see the LICENSE file for details.
- OpenAI for providing the GPT-4 Vision API
- The Chrome Extensions development community