Create you AI or LLM Strategy

Large-scale language models like GPT-4 from OpenAI serve as foundational technologies that can be applied to virtually any business issue. However, the robust power and flexibility of this technology come with a significant challenge: it is extremely difficult to pinpoint the optimal opportunities for leveraging this technology within a company.

This project is designed to assist analytics leaders, product managers, and development teams in surmounting these obstacles by demonstrating the technology's application across a variety of common business problems. The project unfolds through a series of episodes, each accompanied by the following resources:

Walkthrough videos available on Prolego's YouTube Channel.
Tagged releases on the main branch of this repository.
Conversations held within Prolego's Discord community.

Capabilities approach

We advise our clients to take a capabilties-based approach when building their AI. That is, create foundational solutions that allow you to solve many different business use cases. Unfortunately too many teams begin solving specifing business problems withough building a generalizable foundation.

Most companies are developing the following capabilties as part of their AI strategy.

Capability	Explanation	Examples
text classification	Assigning categories to documents or document sections.	Episode 2
information extraction	Pulling out names, places, or specific sections from documents.	Episode 2
semantic search	Finding information based on its meaning instead of keywords.	Episode 1, Episode 4
information summarization	Condense extensive documents into concise and essential highlights.
information comparison	Identifying similar documents or sections of documents.
document generation	Creating precisely written content consistent with style and needs. Often includes a review step.	Episode 2
unified natural language query	Empowering anyone to get answers to questions about data and documents without SQL or tools.	Episode 3, Episode 4, Episode 5
routine task automation	Automating analysis of information from various sources, reasoning across them, and making decisions.

Quickstart installation

First install the neo-sophia code on your local machine before proceeding to the examples from the Episodes below.

Install the base neo-sophia code

git clone https://github.com/prolego-team/neo-sophia.git
conda env create -f neo-sophia/env.yml
conda activate neosophia 
pip install -e neo-sophia
cd neo-sophia
cp config_example.json config.json
cp openai_api_key_example.txt openai_api_key.txt

Setup your local configuration.

Change the path locations in config.json or use the defaults.
Add your OpenAI API key to openai_api_key.txt.

Run the tests.

./test.sh

If the tests pass you are ready to run the code in one of the Episodes.

Questions? Just ask in our Discord Community.

Episodes

Episode 5 - Unified NLQ with multiple tables

Episode 4 - Unified NLQ with a table and document

Episode 3 - Intro to Unified Natural Language Query

What will be the AI "killer app" in the enterprise? Our bet is Unified Natural Language Query (NQL). It gives executives and business leaders the ability to get insights from data by asking "natural" questions, similar to how you currently use ChatGPT. In this Episode we describe the business problem and show the extensible power of a simple example of SQL generation supplemented with the reasoning power of an LLM like GPT-4.

Videos (coming soon!)

[Unified Natural Language Query is the enerprise AI "killer app"]
[SQL generation and interpretation with LLMs]

Checkout Episode 3, Release v0.3.2

    git checkout tags/v0.3.2

Start the demo by running

    python -m examples.sqlite_chat

Episode 2 - Automated document processing

Every company has businesses processes that require ingesting and processing a stream of text documents. Most of this processing requires tedious human effort to find, edit, review, summarize, score, etc. chunks of text from larger documents. In this Episode we demonstrate a generalized approach for solving many of these problems using LLMs. The example takes a set of SEC 10-Q company filings and replaces the "Basis of Presentation" section with different text based on an editable templates.

Videos (coming soon!)

[Your AI strategy “quick win” - automated document processing]
[Audomated document processing - technical walkthrough]

Running the Episode 2 application

Checkout Episode 2, Release v0.2.0

    git checkout tags/v0.2.0

Start the demo by running

    python -m examples.generate_10q_basis

Episode 1 - Embeddings, Semantic search and Document Q&A

Most companies are struggling to pick the best AI use cases from many different options. By building a core competency in document embeddings you can begin developing a set of capabilities applicable for many enterprise use cases. In Episoide 1 we provide a primer on embeddings for a business audience and demonstrate the use of embeddings in semantic search and document Q&A.

This episode uses data from the MSRB Regulatory Rulebook

Videos

Running the Episode 1 application

Checkout Episode 1, Release v0.1.1

    git checkout tags/v0.1.1

Extract text from the MSRB Rulebook:

    python -m scripts.download_and_extract_msrb

Start the demo by running

    python -m examples.interface

About Prolego & Neo Sophia

Prolego is an AI services company that started in 2017 and has helped some of the world’s biggest companies generate opportunities with AI. "Prolego" is the Greek word for "predict". We needed a name for this repo and decided to use the Greek words for "new" (neo) and "wisdom" (sophia). And we just thought that Neo Sophia sounded cool.

The team:

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
data		data
examples		examples
scripts		scripts
src/neosophia		src/neosophia
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config_example.json		config_example.json
env.yml		env.yml
openai_api_key_example.txt		openai_api_key_example.txt
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Create you AI or LLM Strategy

Capabilities approach

Quickstart installation

Install the base neo-sophia code

Setup your local configuration.

Run the tests.

Episodes

Episode 5 - Unified NLQ with multiple tables

Episode 4 - Unified NLQ with a table and document

Episode 3 - Intro to Unified Natural Language Query

Episode 2 - Automated document processing

Running the Episode 2 application

Episode 1 - Embeddings, Semantic search and Document Q&A

Running the Episode 1 application

About Prolego & Neo Sophia

About

Releases

Packages

Languages

License

sachins-eng/neo-sophia

Folders and files

Latest commit

History

Repository files navigation

Create you AI or LLM Strategy

Capabilities approach

Quickstart installation

Install the base neo-sophia code

Setup your local configuration.

Run the tests.

Episodes

Episode 5 - Unified NLQ with multiple tables

Episode 4 - Unified NLQ with a table and document

Episode 3 - Intro to Unified Natural Language Query

Episode 2 - Automated document processing

Running the Episode 2 application

Episode 1 - Embeddings, Semantic search and Document Q&A

Running the Episode 1 application

About Prolego & Neo Sophia

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages