Stars
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
OpenCore EFI for AMD Ryzen Hackintosh
Implementation of sbt's test interface for JUnit 5's Jupiter module
Compliance automation framework, focused on SOC2
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
A Scala API for Apache Beam and Google Cloud Dataflow.
Seeing Theory is a project designed and created by Daniel Kunin with support from Brown University's Royce Fellowship Program and National Science Foundation group STATS4STEM. The goal of the proje…
A topic-centric list of HQ open datasets.
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
A small project to show how to add lineage to Atlas when using Spark as ETL tool
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
A design pattern to achieve (almost) zero-downtime deployments of Docker-based web services.
Housing loan risk assessment from its origination data
Ready-to-run Docker images containing Jupyter applications
Lightweight, modular, and extensible library for functional programming.
Loan-level analysis of Fannie Mae and Freddie Mac data
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!
Dockerized Apache Zeppelin with SQL Server and SQL Azure support
Step-by-step guide on how to create a GPG key on keybase.io, adding it to a local GPG setup and using it with Git and GitHub.
Examples for High Performance Spark
The best React-based framework with performance, scalability and security built in.
A library for time series analysis on Apache Spark
Advanced Java Redis client for thread-safe sync, async, and reactive usage. Supports Cluster, Sentinel, Pipelining, and codecs.