Give Cursor Agent an AI team and advanced skills

The AI Team

Perplexity to search the web and perform deep research
Gemini 2.0 for huge whole-codebase context window, search grounding and reasoning
Stagehand for browser operation to test and debug web apps (uses Anthropic or OpenAI models)

New Skills for your existing Agent

Work with GitHub Issues and Pull Requests
Generate local agent-accessible documentation for external dependencies

cursor-tools is optimized for Cursor Composer Agent but it can be used by any coding agent that can execute commands

How do I use it?

After installation, to see AI teamwork in action just ask Cursor Composer to use Perplexity or Gemini. Here are two examples:

Asking Perplexity to carry out web research

see what happens next...

see the spec composer and perplexity produced together: pac-man-spec.md (link out to the example repo)

Asking Gemini for a plan

see what happens next...

see the spec composer and perplexity produced together: pac-man-plan.md (link out to the example repo)

What is cursor-tools

cursor-tools provides a CLI that your AI agent can use to expand its capabilities. cursor-tools works with with Cursor (and is compatible with other agents), When you run cursor-tools install we automatically add a prompt section to your Cursor project rules (.cursor/rules/cursor-tools.mdc or legacy .cursorrules file) so that it works out of the box with Cursor, there's no need for additional prompts.

cursor-tools requires a Perplexity API key and a Google AI API key.

cursor-tools is an node package. You can install it globally, at a node project level or run without installation using npx.

Installation

Run the interactive setup:

npx cursor-tools@latest install .

This command will:

Add cursor-tools as a dev dependency in your package.json
Guide you through API key configuration
Update your Cursor project rules for Cursor integration (using .cursor/rules/cursor-tools.mdc or existing .cursorrules)

Requirements

Node.js 18 or later
Perplexity API key
Google Gemini API key
For browser commands:
- Playwright (npm install --global playwright)
- OpenAI API key or Anthropic API key (for act, extract, and observe commands)

cursor-tools uses Gemini-2.0 because it is the only good LLM with a context window that goes up to 2 million tokens - enough to handle and entire codebase in one shot. Gemini 2.0 experimental models that we use by default are currently free to use on Google and you need a Google Cloud project to create an API key.

cursor-tools uses Perplexity because Perplexity has the best web search api and indexes and it does not hallucinate. Perplexity Pro users can get an API key with their pro account and recieve $5/month of free credits (at time of writing). Support for Google search grounding is coming soon but so far testing has shown it still frequently hallucinates things like APIs and libraries that don't exist.

Additional Examples

To see cursor-tools GitHub and Perplexity skills: Check out this example issue that was solved using Cursor agent and cursor-tools

Tips:

Ask Cursor Agent to have Gemini review its work
Ask Cursor Agent to generate documentation for external dependencies and write it to a local-docs/ folder

If you do something cool with cursor-tools please let me know on twitter or make a PR to add to this section!

Detailed Cursor Usage

Use Cursor Composer in agent mode with command execution (not sure what this means, see section below on Cursor Agent configuration). If you have installed the cursor-tools prompt to your .cursorrules (or equivalent) just ask your AI coding agent/assistant to use "cursor-tools" to do things.

Examples usages:

Use web search

"Please implement country specific stripe payment pages for the USA, UK, France and Germany. Use cursor-tools web to check the available stripe payment methods in each country."

Note: in most cases you can say "ask Perplexity" instead of "use cursor-tools web" and it will work the same.

Use repo search

"Let's refactor our User class to allow multiple email aliases per user. Use cursor-tools repo to ask for a plan including a list of all files that need to be changed."

Note: in most cases you can say "ask Gemini" instead of "use cursor-tools repo" and it will work the same.

Use doc generation

"Use cursor-tools to generate documentation for the Github repo https://github.com/kait-http/kaito" and write it to docs/kaito.md"

Note: in most cases you can say "generate documentation" instead of "use cursor-tools doc" and it will work the same.

Use github integration

"Use cursor-tools github to fetch issue 123 and suggest a solution to the user's problem"

"Use cursor-tools github to fetch PR 321 and see if you can fix Andy's latest comment"

Note: in most cases you can say "fetch issue 123" or "fetch PR 321" instead of "use cursor-tools github" and it will work the same.

Use browser automation

"Use cursor-tools to open the users page and check the error in the console logs, fix it"

"Use cursor-tools to test the form field validation logic. Take screenshots of each state"

"Use cursor-tools to open https://example.com/foo the and check the error in the network logs, what could be causing it?"

Note: in most cases you can say "Use Stagehand" instead of "use cursor-tools" and it will work the same.

Authentication and API Keys

cursor-tools requires API keys for both Perplexity AI and Google Gemini. These can be configured in two ways:

Interactive Setup: Run cursor-tools install and follow the prompts
Manual Setup: Create ~/.cursor-tools/.env in your home directory or .cursor-tools.env in your project root:
```
PERPLEXITY_API_KEY="your-perplexity-api-key"
GEMINI_API_KEY="your-gemini-api-key"
```

AI Team Features

Perplexity: Web Search & Research

Use Perplexity AI to get up-to-date information directly within Cursor:

cursor-tools web "What's new in TypeScript 5.7?"

Gemini 2.0: Repository Context

Leverage Google Gemini 2.0 models with 1M+ token context windows for codebase-aware assistance:

cursor-tools repo "Explain the authentication flow in this project, which files are involved?"

Repository context is created using Repomix. See repomix configuration section below for details on how to change repomix behaviour.

Above 1M tokens cursor-tools will always send requests to Gemini 2.0 Pro as it is the only model that supports 1M+ tokens.

The Gemini 2.0 Pro context limit is 2M tokens, you can add filters to .repomixignore if your repomix context is above this limit.

Stagehand: Browser Automation

Automate browser interactions for web scraping, testing, and debugging:

Important: The browser command requires the Playwright package to be installed separately in your project:

npm install playwright
# or
yarn add playwright
# or
pnpm add playwright

open - Open a URL and capture page content:

# Open and capture HTML content, console logs and network activity (enabled by default)
cursor-tools browser open "https://example.com" --html

# Take a screenshot
cursor-tools browser open "https://example.com" --screenshot=page.png

# Debug in an interactive browser session
cursor-tools browser open "https://example.com" --connect-to=9222

act - Execute actions using natural language - Agent tells the browser-use agent what to do:

# Single action
cursor-tools browser act "Login as '[email protected]'" --url "https://example.com/login"

# Multi-step workflow using pipe separator
cursor-tools browser act "Click Login | Type '[email protected]' into email | Click Submit" --url "https://example.com"

# Record interaction video
cursor-tools browser act "Fill out registration form" --url "https://example.com/signup" --video="./recordings"

observe - Analyze interactive elements:

# Get overview of interactive elements
cursor-tools browser observe "What can I interact with?" --url "https://example.com"

# Find specific elements
cursor-tools browser observe "Find the login form" --url "https://example.com"

extract - Extract data using natural language:

# Extract specific content
cursor-tools browser extract "Get all product prices" --url "https://example.com/products"

# Save extracted content
cursor-tools browser extract "Get article text" --url "https://example.com/blog" --html > article.html

# Extract with network monitoring
cursor-tools browser extract "Get API responses" --url "https://example.com/api-test" --network

Browser Command Options

All browser commands (open, act, observe, extract) support these options:

--console: Capture browser console logs (enabled by default, use --no-console to disable)
--html: Capture page HTML content (disabled by default)
--network: Capture network activity (enabled by default, use --no-network to disable)
--screenshot=<file path>: Save a screenshot of the page
--timeout=<milliseconds>: Set navigation timeout (default: 120000ms for Stagehand operations, 30000ms for navigation)
--viewport=<width>x<height>: Set viewport size (e.g., 1280x720).
--headless: Run browser in headless mode (default: true)
--no-headless: Show browser UI (non-headless mode) for debugging
--connect-to=<port>: Connect to existing Chrome instance. Special values: 'current' (use existing page), 'reload-current' (refresh existing page)
--wait=<time:duration or selector:css-selector>: Wait after page load (e.g., 'time:5s', 'selector:#element-id')
--video=<directory>: Save a video recording (1280x720 resolution, timestamped subdirectory). Not available when using --connect-to
--url=<url>: Required for act, observe, and extract commands

Notes on Connecting to an existing browser session with --connect-to

DO NOT ask browser act to "wait" for anything, the wait command is currently disabled in Stagehand.
When using --connect-to, viewport is only changed if --viewport is explicitly provided
Video recording is not available when using --connect-to
Special --connect-to values:
- current: Use the existing page without reloading
- reload-current: Use the existing page and refresh it (useful in development)

Video Recording

All browser commands support video recording of the browser interaction in headless mode (not supported with --connect-to):

Use --video=<directory> to enable recording
Videos are saved at 1280x720 resolution in timestamped subdirectories
Recording starts when the browser opens and ends when it closes
Videos are saved as .webm files

Example:

# Record a video of filling out a form
cursor-tools browser act "Fill out registration form with name John Doe" --url "http://localhost:3000/signup" --video="./recordings"

Console and Network Logging

Console logs and network activity are captured by default:

Use --no-console to disable console logging
Use --no-network to disable network logging
Logs are displayed in the command output

Complex Actions

The act command supports chaining multiple actions using the pipe (|) separator:

# Login sequence with console/network logging (enabled by default)
cursor-tools browser act "Click Login | Type '[email protected]' into email | Click Submit" --url "http://localhost:3000/login"

# Form filling with multiple fields
cursor-tools browser act "Select 'Mr' from title | Type 'John' into first name | Type 'Doe' into last name | Click Next" --url "http://localhost:3000/register"

# Record complex interaction
cursor-tools browser act "Fill form | Submit | Verify success" --url "http://localhost:3000/signup" --video="./recordings"

Troubleshooting Browser Commands

Common issues and solutions:

Element Not Found Errors
- Use --no-headless to visually debug the page
- Use browser observe to see what elements Stagehand can identify
- Check if the element is in an iframe or shadow DOM
- Ensure the page has fully loaded (try increasing --timeout)
Stagehand API Errors
- Verify your OpenAI or Anthropic API key is set correctly
- Check if you have sufficient API credits
- Try switching models using --model
Network Errors
- Check your internet connection
- Verify the target website is accessible
- Try increasing the timeout with --timeout
- Check if the site blocks automated access
Video Recording Issues
- Ensure the target directory exists and is writable
- Check disk space
- Video recording is not available with --connect-to
Performance Issues
- Use --headless mode for better performance (default)
- Reduce the viewport size with --viewport
- Consider using --connect-to for development

Skills

GitHub Integration

Access GitHub issues and pull requests directly from the command line with rich formatting and full context:

# List recent PRs or issues
cursor-tools github pr
cursor-tools github issue

# View specific PR or issue with full discussion
cursor-tools github pr 123
cursor-tools github issue 456

The GitHub commands provide:

View of 10 most recent open PRs or issues when no number specified
Detailed view of specific PR/issue including:
- PR/Issue description and metadata
- Code review comments grouped by file (PRs only)
- Full discussion thread
- Labels, assignees, milestones and reviewers
Support for both local repositories and remote GitHub repositories
Markdown-formatted output for readability

Authentication Methods: The commands support multiple authentication methods:

GitHub token via environment variable: GITHUB_TOKEN=your_token_here
GitHub CLI integration (if gh is installed and logged in)
Git credentials (stored tokens or Basic Auth)

Without authentication:

Public repositories: Limited to 60 requests per hour
Private repositories: Not accessible

With authentication:

Public repositories: 5,000 requests per hour
Private repositories: Full access (with appropriate token scopes)

Documentation Generation (uses Gemini 2.0)

Generate comprehensive documentation for your repository or any GitHub repository:

# Document local repository
cursor-tools doc --output=docs.md

# Document remote GitHub repository (both formats supported)
cursor-tools doc --from-github=username/repo-name@branch
cursor-tools doc --from-github=https://github.com/username/repo-name@branch

# Save documentation to file (with and without a hint)
# This is really useful to generate local documentation for libraries and dependencies
cursor-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/CURSOR-TOOLS.md
cursor-tools doc --from-github=eastlondoner/cursor-tools --save-to=docs/CURSOR-TOOLS.md --hint="only information about the doc command"

Configuration

Default Settings

Customize cursor-tools behavior by creating a cursor-tools.config.json file:

{
  "perplexity": {
    "model": "sonar-pro",
    "maxTokens": 8000
  },
  "gemini": {
    "model": "gemini-2.0-pro-exp-02-05",
    "maxTokens": 10000
  },
  "tokenCount": {
    "encoding": "o200k_base"
  },
  "browser": {
    "defaultViewport": "1280x720",
    "timeout": 30000,
    "stagehand": {
      "env": "LOCAL",
      "headless": true,
      "verbose": 1,
      "debugDom": false,
      "enableCaching": false,
      "model": "claude-3-5-sonnet-latest", // For Anthropic provider
      "provider": "anthropic", // or "openai"
      "timeout": 30000
    }
  }
}

The configuration supports:

perplexity.model: Perplexity AI model to use
perplexity.maxTokens: Maximum tokens for Perplexity responses
gemini.model: Google Gemini model to use
gemini.maxTokens: Maximum tokens for Gemini responses
tokenCount.encoding: Tokenizer to use for counting tokens (defaults to o200k_base which is optimized for Gemini)
browser.defaultViewport: Default viewport size for browser commands
browser.timeout: Default timeout for browser commands
browser.stagehand.env: Environment for browser commands
browser.stagehand.headless: Whether to run browser in headless mode
browser.stagehand.verbose: Verbosity level for browser commands
browser.stagehand.debugDom: Whether to enable debug output for browser commands
browser.stagehand.enableCaching: Whether to enable caching for browser commands
browser.stagehand.model: The default model to use. See "Model Selection" below.
browser.stagehand.provider: The AI provider to use ("openai" or "anthropic"). Determines which API key is required.
browser.stagehand.timeout: Timeout for operations in milliseconds

GitHub Authentication

The GitHub commands support several authentication methods:

Environment Variable: Set GITHUB_TOKEN in your environment:
```
GITHUB_TOKEN=your_token_here
```
GitHub CLI: If you have the GitHub CLI (gh) installed and are logged in, cursor-tools will automatically use it to generate tokens with the necessary scopes.
Git Credentials: If you have authenticated git with GitHub (via HTTPS), cursor-tools will automatically:
- Use your stored GitHub token if available (credentials starting with ghp_ or gho_)
- Fall back to using Basic Auth with your git credentials

To set up git credentials:

Configure git to use HTTPS instead of SSH:

git config --global url."https://github.com/".insteadOf [email protected]:

Store your credentials:

git config --global credential.helper store  # Permanent storage
# Or for macOS keychain:
git config --global credential.helper osxkeychain

The next time you perform a git operation requiring authentication, your credentials will be stored

Authentication Status:

Without authentication:
- Public repositories: Limited to 60 requests per hour
- Private repositories: Not accessible
- Some features may be restricted
With authentication (any method):
- Public repositories: 5,000 requests per hour
- Private repositories: Full access (if token has required scopes)
- Access to all features

cursor-tools will automatically try these authentication methods in order:

GITHUB_TOKEN environment variable
GitHub CLI token (if gh is installed and logged in)
Git credentials (stored token or Basic Auth)

If no authentication is available, it will fall back to unauthenticated access with rate limits.

Repomix Configuration

When generating documentation, cursor-tools uses Repomix to analyze your repository. By default, it excludes certain files and directories that are typically not relevant for documentation:

Node modules and package directories (node_modules/, packages/, etc.)
Build output directories (dist/, build/, etc.)
Version control directories (.git/)
Test files and directories (test/, tests/, __tests__/, etc.)
Configuration files (.env, .config, etc.)
Log files and temporary files
Binary files and media files

You can customize the files and folders to exclude by adding a .repomixignore file to your project root.

Example .repomixignore file for a Laravel project:

vendor/
public/
database/
storage/
.idea
.env

This ensures that the documentation focuses on your actual source code and documentation files. Support to customize the input files to include is coming soon - open an issue if you run into problems here.

Model Selection

The browser commands support different AI models for processing. You can select the model using the --model option:

# Use gpt-4o
cursor-tools browser act "Click Login" --url "https://example.com" --model=gpt-4o

# Use Claude 3.5 Sonnet
cursor-tools browser act "Click Login" --url "https://example.com" --model=claude-3-5-sonnet-latest

You can set a default provider in your cursor-tools.config.json file under the stagehand section:

{
  "stagehand": {
    "provider": "openai", // or "anthropic"
  }
}

You can also set a default model in your cursor-tools.config.json file under the stagehand section:

{
  "stagehand": {
    "provider": "openai", // or "anthropic"
    "model": "gpt-4o"
  }
}

If no model is specified (either on the command line or in the config), a default model will be used based on your configured provider:

OpenAI: o3-mini
Anthropic: claude-3-5-sonnet-latest

Available models depend on your configured provider (OpenAI or Anthropic) in cursor-tools.config.json and your API key.

Cursor Configuration

cursor-tools automatically configures Cursor by updating your project rules during installation. This provides:

Command suggestions
Usage examples
Context-aware assistance

For new installations, we use the recommended .cursor/rules/cursor-tools.mdc path. For existing installations, we maintain compatibility with the legacy .cursorrules file. If both files exist, we prefer the new path and show a warning.

Cursor Agent Configuration:

To get the benefits of cursor-tools you should use Cursor agent in "yolo mode". Ideal settings:

cursor-tools cli

In general you do not need to use the cli directly, your AI coding agent will call the CLI but it is useful to know it exists and this is how it works.

Command Options

All commands support these general options:

--model: Specify an alternative model
--max-tokens: Control response length
--save-to: Save command output to a file (in addition to displaying it, like tee)
--help: View all available options (help has not been implemented for all commands yet)

Documentation command specific options:

--from-github: Generate documentation for a remote GitHub repository (supports @branch syntax)

GitHub command specific options:

--from-github: Access PRs/issues from a specific GitHub repository (format: owner/repo)
--repo: Alternative to --from-github, does the same thing (format: owner/repo)

Execution Methods

Execute commands in several ways:

# Global installation
cursor-tools web "query"

# without global installation
npx -y cursor-tools@latest web "query"

Troubleshooting

Command Not Found
- Ensure cursor-tools is installed (globally or as a dev dependency)
- Check your PATH if installed globally
API Key Errors
- Verify .cursor-tools.env exists and contains valid API keys
- Run cursor-tools install to reconfigure API keys
- Check that your API keys have the necessary permissions
- For GitHub operations, ensure your token has the required scopes (repo, read:user)
Model Errors
- Check your internet connection
- Verify API key permissions
- Ensure the specified model is available for your API tier
GitHub API Rate Limits
- GitHub API has rate limits for unauthenticated requests. For higher limits you must be authenticated.
- If you have the gh cli installed and logged in cursor-tools will use that to obtain a short lived auth token. Otherwise you can add a GitHub token to your environment:
```
GITHUB_TOKEN=your_token_here
```
- Private repositories always require authentication
Documentation Generation Issues
- Repository too large: Try using --hint to focus on specific parts
- Token limit exceeded: The tool will automatically switch to a larger model
- Network timeouts: The tool includes automatic retries
- For very large repositories, consider documenting specific directories or files
Cursor Integration
- If .cursorrules is outdated, run cursor-tools install . to update
- Ensure Cursor is configured to allow command execution
- Check that your Cursor version supports AI commands

Examples

Web Search Examples

# Get information about new technologies
cursor-tools web "What are the key features of Bun.js?"

# Check API documentation
cursor-tools web "How to implement OAuth2 in Express.js?"

# Compare technologies
cursor-tools web "Compare Vite vs Webpack for modern web development"

Repository Context Examples

# Architecture understanding
cursor-tools repo "Explain the overall architecture of this project"

# Find usage examples
cursor-tools repo "Show me examples of error handling in this codebase"

# Debugging help
cursor-tools repo "Why might the authentication be failing in the login flow?"

Documentation Examples

# Document specific aspects
cursor-tools doc --hint="Focus on the API endpoints and their usage"

# Document with custom output
cursor-tools doc --save-to=docs/architecture.md --hint="Focus on system architecture"

# Document dependencies
cursor-tools doc --from-github=expressjs/express --save-to=docs/EXPRESS.md

GitHub Integration Examples

# List PRs with specific labels
cursor-tools github pr --from-github facebook/react

# Check recent issues in a specific repository
cursor-tools github issue --from-github vercel/next.js

# View PR with code review comments
cursor-tools github pr 123 --from-github microsoft/typescript

# Track issue discussions
cursor-tools github issue 456 --from-github golang/go

Browser Command Examples

`open` subcommand examples:

# Open a URL and get HTML
cursor-tools browser open "https://example.com" --html

# Open and capture console logs and network activity
cursor-tools browser open "https://example.com" --console --network

# Take a screenshot
cursor-tools browser open "https://example.com" --screenshot=page.png

# Run in non-headless mode for debugging
cursor-tools browser open "https://example.com" --no-headless

`act`, `extract`, `observe` subcommands examples:

# AI-powered action
cursor-tools browser act "Click on 'Sign Up'" --url "https://example.com"

# AI-powered extraction
cursor-tools browser extract "Get the main content" --url "https://example.com/blog"

# AI-powered observation
cursor-tools browser observe "What can I do on this page?" --url "https://example.com"

Node Package Manager (npm)

cursor-tools is available on npm here

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. If you used cursor-tools to make your contribution please include screenshots or videos of cursor-tools in action.

License

MIT License - see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.cursor/rules		.cursor/rules
local-docs		local-docs
scripts		scripts
src		src
test-videos		test-videos
tests/commands/browser		tests/commands/browser
.cursor-tools.env.example		.cursor-tools.env.example
.cursorrules		.cursorrules
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md
build.js		build.js
cursor-tools.config.json		cursor-tools.config.json
eslint.config.js		eslint.config.js
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
test-output.txt		test-output.txt
tsconfig.json		tsconfig.json

License

eastlondoner/cursor-tools

Folders and files

Latest commit

History

Repository files navigation

Give Cursor Agent an AI team and advanced skills

The AI Team

New Skills for your existing Agent

How do I use it?

Asking Perplexity to carry out web research

Asking Gemini for a plan

What is cursor-tools

Installation

Requirements

Additional Examples

Detailed Cursor Usage

Use web search

Use repo search

Use doc generation

Use github integration

Use browser automation

Authentication and API Keys

AI Team Features

Perplexity: Web Search & Research

Gemini 2.0: Repository Context

Stagehand: Browser Automation

Browser Command Options

Video Recording

Console and Network Logging

Complex Actions

Troubleshooting Browser Commands

Skills

GitHub Integration

Documentation Generation (uses Gemini 2.0)

Configuration

Default Settings

GitHub Authentication

Repomix Configuration

Model Selection

Cursor Configuration

Cursor Agent Configuration:

cursor-tools cli

Command Options

Execution Methods

Troubleshooting

Examples

Web Search Examples

Repository Context Examples

Documentation Examples

GitHub Integration Examples

Browser Command Examples

open subcommand examples:

act, extract, observe subcommands examples:

Node Package Manager (npm)

Contributing

Sponsors

Vinta.app

Resoled.it

iterate.com

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

`open` subcommand examples:

`act`, `extract`, `observe` subcommands examples:

Packages