Stars
- All languages
- ABAP
- ANTLR
- AsciiDoc
- C
- C#
- C++
- CSS
- Clojure
- CoffeeScript
- Common Workflow Language
- Dart
- Dockerfile
- Elm
- Erlang
- Fluent
- FreeMarker
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Julia
- Jupyter Notebook
- Kotlin
- Less
- Lua
- MATLAB
- Makefile
- Markdown
- NSIS
- OCaml
- PHP
- PLpgSQL
- Perl
- Prolog
- Python
- R
- Racket
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Starlark
- Stata
- TLA
- TeX
- TypeScript
- Vala
- Vue
- XSLT
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
A library that brings useful functions from various modern database management systems to Apache Spark
Sistema de vendas presenciais para Feira Outras, autogeridas, do Instituto Pontes e Borboletas (IPB). Stacks: Next.js, Node.js, TypeScript, Tailwind, ShadcnUI
An extremely fast Python package and project manager, written in Rust.
Turning PySpark Into a Universal DataFrame API
A library to expose more of Apache Spark's metrics system
Qubole Sparklens tool for performance tuning Apache Spark
PySpark test helper methods with beautiful error messages
iximiuz Labs control - start remote microVM playgrounds from the command line.
end-to-end data engineering project to get insights from PyPi using python, duckdb, MotherDuck & Evidence
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
A static site generator and markdown indexer inspired by Hugo and DEV, written in PHP
An attempt to answer the age old interview question "What happens when you type google.com into your browser and press enter?"
Next generation of automated data exploratory analysis and visualization platform.
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
A model set of guidelines for RESTful APIs and Events, created by Zalando
Set yourself dynamic writing goals for notes and folders to help you hit your long form writing targets with Obsidian.
Turns Data and AI algorithms into production-ready web applications in no time.
BacenSimulator is a docker image to simulate bacen, a official brazilian payment infrastructure
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
Proof of concept of a big data cluster using open source tools
The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.