Stars
SuperSonic is the next-generation AI+BI platform that unifies Chat BI (powered by LLM) and Headless BI (powered by semantic layer) paradigms.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
坚持分享 GitHub 上高质量、有趣实用的开源技术教程、开发者工具、编程网站、技术资讯。A list cool, interesting projects of GitHub.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
SQL Lineage Analysis Tool powered by Python
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。采购商业版、加入技术社区:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.
Question and Answer based on Anything.
🔥🔥🔥AI-driven database tool and SQL client, The hottest GUI client, supporting MySQL, Oracle, PostgreSQL, DB2, SQL Server, DB2, SQLite, H2, ClickHouse, and more.
A series of large language models developed by Baichuan Intelligent Technology
A 13B large language model developed by Baichuan Intelligent Technology
An Autonomous LLM Agent for Complex Task Solving
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
这是一个可自由拖拽的BI可视化系统 支持主流的关系数据:MySQL,Oracle,PostgreSQL等 同时支持Apache Doris
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
Dapr is a portable, event-driven, runtime for building distributed applications across cloud and edge.
collabH / flinkx-web
Forked from birdLark/LarkMidTable基于flinkx的分布式数据中台产品
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
A Cloud Native traffic orchestration system