Starred repositories
Simple Chainlit UI for running llms locally using Ollama and LangChain
A curated list of open source tools used in analytics platforms and data engineering ecosystem
The Data Contract Specification Repository
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
A non-profit, open source project to make Vedic Astrology easily available to all.
Claude Engineer is an interactive command-line interface (CLI) that leverages the power of Anthropic's Claude-3.5-Sonnet model to assist with software development tasks.This framework enables Claud…
Free, simple, and intuitive online database diagram editor and SQL generator.
The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
DataOps Data Quality TestGen is part of DataKitchen's Open Source Data Observability. DataOps TestGen delivers simple, fast data quality test generation and execution by data profiling, new datase…
Installer for DataKitchen's Open Source Data Observability Products. Data breaks. Servers break. Your toolchain breaks. Ensure your team is the first to know and the first to solve with visibility …
Actively curated list of awesome BI tools. PRs welcome!
Actively curated list of awesome BI tools. PRs welcome!
Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observability. Configure data quality checks from the UI or in YAM…
Data quality checking frame work for Pyspark in Databricks based on Great Expectations framework
PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
Local PDF Chat Application with Mistral 7B LLM, Langchain, Ollama, and Streamlit
Official electron build of draw.io
"Computing should be taught as a rigorous - but fun - discipline covering topics like programming, database structures, and algorithms. That doesn't have to be boring." -- Geoff Mulgan
SQL Queries & Alerts for Databricks System Tables access.audit Logs
Building a real-time alert monitoring pipeline that sends email notifications off of Azure Event Hubs, Azure Databricks, and a Azure Logic App
Examples of metadata driven SQL processes implemented in Databricks
Tutorial and examples of Data Quality in Big Data System
A simple VS Code devcontainer setup for local PySpark development