🔍 Semantic Search with Weaviate and Sentence Embeddings

This project demonstrates how to build a semantic search system using Weaviate and Sentence Transformers to retrieve contextually relevant content from web pages. It processes raw HTML from any given URL, cleans and chunks the content, generates embeddings, and stores them in Weaviate for fast and efficient semantic querying.

🚀 Project Overview

🧠 Backend – FastAPI

The FastAPI backend handles:

🔗 Fetching HTML content from a given URL
🧹 Cleaning and chunking the text content
🧬 Generating and storing sentence embeddings using Sentence Transformers
📦 Storing the embeddings and metadata in Weaviate
🔍 Performing semantic search on the stored embeddings

🖥️ Frontend – React

The React-based frontend allows users to:

Enter a webpage URL
Input a semantic search query
View the top-k relevant results retrieved from the backend

📦 Requirements

Make sure you have the following installed:

Python 3.8+
FastAPI
Weaviate (local or Docker)
sentence-transformers
nltk
requests
React 18+

Install Python dependencies via:

pip install -r requirements.txt

⚙️ Setup Instructions

Start Weaviate Locally
In the project root directory, a docker-compose.yml file is included to run Weaviate locally.

docker-compose up -d

Run the Backend (FastAPI)

Navigate into the backend directory and start the FastAPI server:

cd backend
uvicorn main:app --reload

Run the Frontend (React)

Navigate into the frontend directory, install dependencies, and start the development server:

cd frontend
npm install
npm run dev

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
backend		backend
frontend		frontend
README.md		README.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔍 Semantic Search with Weaviate and Sentence Embeddings

🚀 Project Overview

🧠 Backend – FastAPI

🖥️ Frontend – React

📦 Requirements

⚙️ Setup Instructions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Ice-wiz/smartcodes

Folders and files

Latest commit

History

Repository files navigation

🔍 Semantic Search with Weaviate and Sentence Embeddings

🚀 Project Overview

🧠 Backend – FastAPI

🖥️ Frontend – React

📦 Requirements

⚙️ Setup Instructions

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages