-
Qbeast Analytics
- Barcelona
- https://qbeast.io/
- @paolapardoat
Highlights
- Pro
Starred repositories
Open, Multi-modal Catalog for Data & AI
The official home of the Presto distributed SQL query engine for big data
A simple macOS application that will prevent iTunes or Apple Music from launching.
Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination…
The Open-Source toolkit to build your own reliable and secure Industrial IoT platform.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks
Upserts, Deletes And Incremental Processing on Big Data.
Full stack application platform for building stateful microservices, streaming APIs, and real-time UIs
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
QuestDB is a high performance, open-source, time-series database
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The missing star history graph of GitHub repos - https://star-history.com
This repository started out as a learning in public project for myself and has now become a structured learning map for many in the community. We have 3 years under our belt covering all things Dev…
A Scala API for Apache Beam and Google Cloud Dataflow.
A Github API client to extract events and actions, and load into a database
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
A sbt plugin for publishing Scala/Java projects to the Maven central.
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such …
A simple Spark-powered ETL framework that just works 🍺