- Little Red Dot
- https://leongkui.me
Stars
- All languages
- ActionScript
- AppleScript
- Assembly
- Astro
- AutoIt
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Clojure
- CoffeeScript
- Crystal
- Cuda
- Cython
- Dart
- Dockerfile
- Elixir
- Emacs Lisp
- F#
- Fortran
- GDScript
- Go
- Groovy
- HCL
- HTML
- Hack
- Haskell
- Java
- JavaScript
- Jsonnet
- Julia
- Jupyter Notebook
- Kotlin
- Lua
- M
- MATLAB
- MDX
- MLIR
- Makefile
- Markdown
- Mustache
- Nim
- Nunjucks
- OCaml
- Objective-C
- Objective-C++
- OpenEdge ABL
- PHP
- Perl
- PowerShell
- Pug
- PureBasic
- Python
- QML
- R
- RPM Spec
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Smarty
- Solidity
- Starlark
- Svelte
- Swift
- TSQL
- TeX
- TypeScript
- V
- Vim Script
- Vue
- WebAssembly
- XSLT
- Zig
- templ
Source code for Twitter's Recommendation Algorithm
CMAK is a tool for managing Apache Kafka clusters
Removes large or troublesome blobs like git-filter-branch does, but faster. And written in Scala
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
The leader in Next-Generation Customer Data Infrastructure
Arnold Schwarzenegger based programming language
Apache OpenWhisk is an open source serverless cloud platform
A machine learning package built for humans.
Fault tolerant job scheduler for Mesos which handles dependencies and ISO8601 based schedules
Deploy and manage containers (including Docker) on top of Apache Mesos at scale.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
A distributed, fault-tolerant graph database
Streaming MapReduce with Scalding and Storm
TextTeaser is an automatic summarization algorithm.
DataStax Connector for Apache Spark to Apache Cassandra
Base classes to use when writing tests with Spark
Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala
KillrWeather is a reference application (work in progress) showing how to easily integrate streaming and batch data processing with Apache Spark Streaming, Apache Cassandra, Apache Kafka and Akka f…
The software used to extract structured data from Wikipedia
[DEPRECATED] Tensorflow wrapper for DataFrames on Apache Spark
Simplifying robust end-to-end machine learning on Apache Spark.
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.