Skip to content

DSensei is a Slack bot that helps you do data analytics with natural language.

License

Notifications You must be signed in to change notification settings

jeffskafi/dsensei

 
 

Repository files navigation

DSensei

Are you tired of struggling with complex SQL queries just to retrieve data from your databases? Meet DSensei, the open source Slack bot that makes it easy and natural to find the information you need. By harnessing the power of ChatGPT, DSensei understands natural language commands and can query databases such as BigQuery, MySQL, and PostgreSQL on your behalf.

But DSensei is more than just a chatbot — it's your personal data sensei. With its intuitive interface and powerful capabilities, DSensei empowers you to effortlessly explore and analyze your data. Say goodbye to tedious SQL queries and hello to a smarter way of working. Install DSensei in your Slack workspace today and start discovering new insights in seconds.

We're passionately developing this project and would love for you to be a part of our community on Discord where you can receive the latest news, report bugs, and make feature requests. Please also feel free to submit any feedback on github directly.

Table of Contents

Demo

Complex query, get number of movies, most popular movie and most popular actor.

Installation

Prerequisites

Before installing DSensei, ensure that you have the following:

  • Node version >= 18
  • A Slack workspace
  • Admin access to the Slack workspace
  • OpenAI API key for GPT-3 authentication
  • Database credentials for MySQL, PostgreSQL, or BigQuery

Set Up the Slack App

  1. Navigate to the Slack API website and sign in to your Slack workspace.
  2. Click on the "Create an app" button to create a new app, and select "From an app manifest".
  3. Select the workspace you want to install the app into.
  4. Copy and paste the following manifest as YAML format.
display_information:
  name: sensei
features:
  app_home:
    home_tab_enabled: false
    messages_tab_enabled: true
    messages_tab_read_only_enabled: false
  bot_user:
    display_name: sensei
    always_online: true
  slash_commands:
    - command: /info
      description: Get information about DB
      usage_hint: /info [dbs] | [tables db] | [schema db.table]
      should_escape: false
oauth_config:
  scopes:
    bot:
      - app_mentions:read
      - chat:write
      - commands
      - im:history
settings:
  event_subscriptions:
    bot_events:
      - app_mention
      - message.im
  interactivity:
    is_enabled: true
  org_deploy_enabled: false
  socket_mode_enabled: true
  token_rotation_enabled: false
  1. Click "Create", then click "Install the App to Your Workspace".
  2. Verify the app has been installed by checking the "Apps" section in the sidebar of your Slack workspace.

Setup OpenAI API

Setup the App

  • Run npm install
  • Rename .env.example to .env
  • Setup Slack credentials:
    • Goto https://api.slack.com/apps and select the app you just created.
    • Slack bot token (SLACK_BOT_TOKEN) can be found in the "OAuth & Permissions" tab, it should start with xoxb- (screenshot).
    • Slack signing secret (SLACK_SIGNING_SECRET) can be found under the "Basic Information" tab, as "Signing Secret" (screenshot).
    • You need to generate a Slack app token (SLACK_APP_TOKEN) in the "Basic Information" tab, by clicking the "Generate Token and Scopes" button, and choose the connections:write scope. The Slack App Token should start with xapp- (screenshot).
  • Setup the OpenAI API key:
    • You can find your OpenAI API key on this page.
  • Setup the DB Access. Currently only BigQuery, MySQL and pgSQL are supported, and the system only allows for one single type of DB connection.
    • For BigQuery, a gcp access key is needed. Please follow this gcp doc to generate an service account key, and set the BQ_KEY field to the path to your key file.
      • If you use the default roles, please grant the following two roles to the acocunt:
        • BigQuery Data Viewer
        • BigQuery Job User
      • Or if you prefer to use a custom role, please make user the following permissions are granted to the role:
        • bigquery.datasets.get
        • bigquery.jobs.create
        • bigquery.tables.get
        • bigquery.tables.getData
        • bigquery.tables.list
    • For MySQL and PgSQL, a standard db connection string should be used (see the example in the .env.example).
  • [Optional] Whitelist databases and tables.
    • You might want to limit the databases / tables this tool can access, you can do so by list the databases in a comma separated string in the DATABASES field, and / or comma separated dbname.tablename list in the TABLES field.

Start the App

Run npm run prod

In addition to the environment variables mentioned above, here are some additional variables:

  • ENABLE_EMBEDDING_INDEX (boolean): Toggle the embedding index for tables. By default it is disabled, enable this if you have more than 10 tables.
  • PORT (int): Port to listen on for the app.
  • ENABLE_DEBUG_LOGGING (boolean): Toggle the debug logging (this needs be set outside the .env file).

Usage

Accessing Database Metadata

You can list all databases / tables and check table schema by using the /info slach command.

  • /info dbs command lists all databases accessible via DSensei.
  • /info tables dbname command lists all tables in the database dbname.
  • /info schema dbname.tablename command shows the schema of table dbname.tablename.

Querying Database

To interact with DSensei in your Slack workspace, type @dsensei in any channel where you want to run a query and enter your query request, for example: "@dsensei, show me the number of daily active users in the past 7 days".

DSensei will translate your request to an SQL statement, run the SQL, and reply with the result in a thread. If you think the query is incorrect, you can directly edit the query in Slack, and make it re-run the query. You can also ask follow-up questions in the same thread if you want to dive deeper. DSensei includes the SQL statement being used to get the result, which allows you to understand how the query was constructed, and make changes or improvements as needed. This feature helps you to build upon previous queries, without starting from scratch every time.

With DSensei, you can easily access data insights through natural language processing, without the need for complex coding skills or extensive knowledge of SQL syntax. DSensei makes it easy to fetch the data you require and focus on analyzing it to drive business value.

More Demos

Show databases, tables and table schema: Edit queries in place if there are errors in the query:

Known Issues

  • No high availability support. Multiple instances will not work because this is a stateful service.
  • Multiple people chatting in the same thread may confuse ChatGPT. Please keep only one active questions in one thread.
  • Restarting the service will cause the server lose track of all states, so you will not be able to follow up in threads created before the restart.

For any issues not covered here and feature request, please submit via the Github issue or join our Discord.

About

DSensei is a Slack bot that helps you do data analytics with natural language.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • TypeScript 98.6%
  • JavaScript 1.4%