Google Genai Starter Template

A comprehensive FastAPI starter template for working with Google's Gemini AI models, supporting text, image, and audio processing.

Features

Text Processing
- Text generation with prompts
- Structured chat conversations
- Context-aware text generation
Image Processing
- Single image analysis
- Multiple image analysis
- Image comparison
Audio Processing
- Audio transcription
- Content analysis
- Audio summarization

Installation

Clone the repository:

git clone https://github.com/yourusername/google-genai-starter-template.git
cd google-genai-starter-template

Install dependencies:

pip install -r requirements.txt

Install additional audio dependencies:

# For Ubuntu/Debian
sudo apt-get install ffmpeg

# For macOS
brew install ffmpeg

# For Windows
# Download ffmpeg from https://ffmpeg.org/download.html

Set up environment variables:

export GOOGLE_API_KEY="your_api_key_here"

Usage

Start the server:

uvicorn app.main:app --reload

Access the API documentation:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

API Endpoints

Text Processing

POST /api/text/generate
POST /api/text/chat
POST /api/text/context

Image Processing

POST /api/image/analyze
POST /api/image/analyze-multiple
POST /api/image/compare

Audio Processing

POST /api/audio/transcribe
POST /api/audio/analyze
POST /api/audio/summarize

Example Requests

Text Generation

import requests

response = requests.post(
    "http://localhost:8000/api/text/generate",
    json={
        "prompt": "Write a short story about AI",
        "temperature": 0.7
    }
)
print(response.json())

Image Analysis

import requests

files = {
    'image': open('image.jpg', 'rb'),
    'prompt': (None, 'Describe this image'),
    'temperature': (None, '0.7')
}

response = requests.post(
    "http://localhost:8000/api/image/analyze",
    files=files
)
print(response.json())

Audio Processing

import requests

files = {
    'audio_file': open('audio.mp3', 'rb'),
    'language': (None, 'en-US')
}

response = requests.post(
    "http://localhost:8000/api/audio/transcribe",
    files=files
)
print(response.json())

Dependencies

FastAPI
google-generativeai
Pillow
SpeechRecognition
pydub
python-multipart
uvicorn

Development

Install development dependencies:

pip install pytest black isort flake8

Run tests:

pytest

Format code:

black .
isort .

Contributing

Fork the repository
Create your feature branch
Commit your changes
Push to the branch
Create a new Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For support, please open an issue in the GitHub repository.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
scripts		scripts
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Genai Starter Template

Features

Installation

Usage

API Endpoints

Text Processing

Image Processing

Audio Processing

Example Requests

Text Generation

Image Analysis

Audio Processing

Dependencies

Development

Contributing

License

Support

About

Releases

Packages

Languages

License

capybara-brain346/my-genai-starter-template

Folders and files

Latest commit

History

Repository files navigation

Google Genai Starter Template

Features

Installation

Usage

API Endpoints

Text Processing

Image Processing

Audio Processing

Example Requests

Text Generation

Image Analysis

Audio Processing

Dependencies

Development

Contributing

License

Support

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages