Generate sitemaps from GitHub repositories to train custom AI chatbots. This tool helps you create AI assistants that can answer questions about your codebase and documentation.
- Generate sitemaps from any public GitHub repository
- Support for various GitHub URL formats (HTTPS, SSH, specific branches)
- Easy integration with CustomGPT.ai for creating AI chatbots using no-code.
- Clean, user-friendly interface.
- Support for repositories up to 50,000 files
- Real-time validation and error handling
- Automatic sitemap generation and hosting
- Create AI documentation assistants for your projects
- Build code-aware chatbots for developer support
- Generate knowledge bases from GitHub wikis and docs
- Build RAG models on your repository's content
- Make your codebase more accessible through natural language
- Visit GitHub Sitemap Generator
- Enter your GitHub repository URL, Click "Generate Sitemap" and Copy the generated sitemap URL.
- Use the Sitemap URL in CustomGPT.ai to create your coding assistant chatbot.
git clone [email protected]:adorosario/github2customgpt.git
cd github2customgpt
docker-compose up --build
Visit http://localhost:8501
in your browser.
-
Clone the repository:
git clone github2customgpt cd github-sitemap-generator
-
Install dependencies:
pip install -r requirements.txt
-
Configure AWS credentials in
.streamlit/config.toml
:[aws_s3] accountid = "your-account-id" access_key_id = "your-access-key" access_key_secret = "your-secret-key"
-
Run the application:
streamlit run github_sitemap_generator.py
-
Repository Selection:
- Choose repositories with good documentation
- Include README files, wikis, and inline comments
- Markdown files work best for training
-
URL Formats:
- Main branch:
https://github.com/username/repository
- Specific branch:
https://github.com/username/repository/tree/branch-name
- SSH format:
[email protected]:username/repository.git
- Main branch:
-
Best Practices:
- Start with smaller public repositories for quicker testing
- Focus on well-documented code sections
- Include documentation directories
- Ensure repository is public
- Only works with public GitHub repositories
- Maximum 50,000 files per repository
- Some file types may not be suitable for AI training (e.g images)
- Repository must be accessible via GitHub API
- Private repositories are not supported
Contributions are welcome! Please feel free to submit a Pull Request.
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature
) - Commit your changes (
git commit -m 'Add some AmazingFeature'
) - Push to the branch (
git push origin feature/AmazingFeature
) - Open a Pull Request
This project is licensed under the MIT License - see the LICENSE file for details.
- Built with Streamlit
- Powered by CustomGPT
- Uses GitHub API
If you encounter any issues or have questions, open an issue in this repository