SmartSearch: Build your own conversational search engine with LLMs

🇨🇳中文 | 🌐English

Online Demo

SmartSearch: Build your own conversational search engine with LLMs

Features

Built-in support for LLM, you can build API with local model
Support OpenAI LLMs, such as gpt-4
Built-in support for search engine
Customizable pretty UI interface
Shareable, cached search results
Support for follow-up questions, continuous Q&A
Supports query analysis, rewrites queries based on context for precise search

Setup Search Engine API

There are two default supported search engines: Bing and Google.

Bing Search

To use the Bing Web Search API, please visit this link to obtain your Bing subscription key.

Google Search

You have three options for Google Search: you can use the SearchApi Google Search API from SearchApi, Serper Google Search API from Serper, or opt for the Programmable Search Engine provided by Google.

Setup LLM and KV

Note

We recommend using the built-in llm and kv functions with Lepton. Running the following commands to set up them automatically.

pip install -U leptonai && lep login
pip install -r requirements.txt

Build and Run

. Build web

cd web && npm install && npm run build

Run server

export BING_SEARCH_V7_SUBSCRIPTION_KEY=YOUR_BING_SUBSCRIPTION_KEY
BACKEND=BING python search.py

alternatively, you can run server with Google Search API.

Using Google Search Api

For Google Search using SearchApi:

export SEARCHAPI_API_KEY=YOUR_SEARCHAPI_API_KEY
BACKEND=SEARCHAPI python search.py

For Google Search using Serper:

export SERPER_SEARCH_API_KEY=YOUR_SERPER_API_KEY
BACKEND=SERPER python search.py

For Google Search using Programmable Search Engine:

export GOOGLE_SEARCH_API_KEY=YOUR_GOOGLE_SEARCH_API_KEY
export GOOGLE_SEARCH_CX=YOUR_GOOGLE_SEARCH_ENGINE_ID
BACKEND=GOOGLE python search.py

ok, now your search app running on http://0.0.0.0:8080

Deploy

You can deploy your own version via

lep photon run -n search-with-lepton-modified -m search.py --env BACKEND=BING --env BING_SEARCH_V7_SUBSCRIPTION_KEY=YOUR_BING_SUBSCRIPTION_KEY

Learn more about lep photon here.

Deployment Configurations

Here are the configurations you can set for your deployment:

Name: The name of your deployment, like "my-search"
Resource Shape: most of heavy lifting will be done by the LLM server and the search engine API, so you can choose a small resource shape. cpu.small is usually good enough.

Then, set the following environmental variables.

BACKEND: the search backend to use. If you don't have bing or google set up, simply use LEPTON to try the demo. Otherwise, do BING, GOOGLE, SERPER or SEARCHAPI.
LLM_MODEL: the LLM model to run. We recommend using mixtral-8x7b, but if you want to experiment other models, you can try the ones hosted on LeptonAI, for example, llama2-70b, llama2-13b, llama2-7b. Note that small models won't work that well.
KV_NAME: the Lepton KV to use to store the search results. You can use the default search-with-lepton.
RELATED_QUESTIONS: whether to generate related questions. If you set this to true, the search engine will generate related questions for you. Otherwise, it will not.
GOOGLE_SEARCH_CX: if you are using google, specify the search cx. Otherwise, leave it empty.
LEPTON_ENABLE_AUTH_BY_COOKIE: this is to allow web UI access to the deployment. Set it to true.

In addition, you will need to set the following secrets:

LEPTON_WORKSPACE_TOKEN: this is required to call Lepton's LLM and KV apis. You can find your workspace token at Settings.
BING_SEARCH_V7_SUBSCRIPTION_KEY: if you are using Bing, you need to specify the subscription key. Otherwise it is not needed.
GOOGLE_SEARCH_API_KEY: if you are using Google, you need to specify the search api key. Note that you should also specify the cx in the env. If you are not using Google, it is not needed.
SEARCHAPI_API_KEY: if you are using SearchApi, a 3rd party Google Search API, you need to specify the api key.
OPENAI_API_KEY: if you are using OpenAI, you need to specify the api key.
OPENAI_BASE_URL: if you are using OpenAI, you can specify the base url. It is usually https://api.openai.com/v1.

Todo

Support multi-round retrieval, mainly displaying multi-round retrieval results on the page.
Support third-party LLM's API, such as qwen, baichuan, etc.
Mini program support, currently only supports web end.

Contact

Issue(suggestion):
Email: xuming: [email protected]
Wechat: Add my Wechat ID: xuming624, note: name-company-NLP to join the NLP discussion group.

License

The license agreement is The Apache License 2.0, which can be used for commercial purposes for free. Please include SmartSearch's link and license agreement in the product description.

Contribute

The project code is still rough, if everyone has improvements to the code, welcome to submit back to this project.

Reference

leptonai/search_with_lepton

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
docs		docs
web		web
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
requirements.txt		requirements.txt
run.sh		run.sh
search.py		search.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SmartSearch: Build your own conversational search engine with LLMs

Features

Setup Search Engine API

Bing Search

Google Search

Setup LLM and KV

Build and Run

Using Google Search Api

Deploy

Deployment Configurations

Todo

Contact

License

Contribute

Reference

About

Releases

Packages

Languages

License

anthonyyuan/SmartSearch

Folders and files

Latest commit

History

Repository files navigation

SmartSearch: Build your own conversational search engine with LLMs

Features

Setup Search Engine API

Bing Search

Google Search

Setup LLM and KV

Build and Run

Using Google Search Api

Deploy

Deployment Configurations

Todo

Contact

License

Contribute

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages