Stars
enchywastaken / videoprocessor
Forked from defl/videoprocessorVideo processing on live data.
Make any web page a desktop application
A curated and opinionated list of resources for Chief Technology Officers, with the emphasis on startups
OpenCore EFI for AMD Ryzen Hackintosh
A self-service password management tool for Active Directory
Implementation of sbt's test interface for JUnit 5's Jupiter module
Compliance automation framework, focused on SOC2
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
A Scala API for Apache Beam and Google Cloud Dataflow.
Normalizes nested JSON according to a schema
Seeing Theory is a project designed and created by Daniel Kunin with support from Brown University's Royce Fellowship Program and National Science Foundation group STATS4STEM. The goal of the proje…
A topic-centric list of HQ open datasets.
REST web service for the true real-time scoring (<1 ms) of Scikit-Learn, R and Apache Spark models
A small project to show how to add lineage to Atlas when using Spark as ETL tool
Compile-time tools for working with Avros in Scala
Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster
Framework for easy management of docker-based components across machines
A design pattern to achieve (almost) zero-downtime deployments of Docker-based web services.
Housing loan risk assessment from its origination data
David-Durst / TopNotch
Forked from blackrock/TopNotchA framework for systematically quality controlling big data.
rzilla / dplyr.spark.hive
Forked from piccolbo/dplyr.spark.hivespark and hive backends for dplyr
spark and hive backends for dplyr
Ready-to-run Docker images containing Jupyter applications
Lightweight, modular, and extensible library for functional programming.
Loan-level analysis of Fannie Mae and Freddie Mac data