- San Francisco
- https://www.linkedin.com/in/tao-f-17195814/
- @photoft45
Lists (1)
Sort Name ascending (A-Z)
Stars
Source code for Twitter's Recommendation Algorithm
Apache Spark - A unified analytics engine for large-scale data processing
💾 Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB) [deprecated]
PredictionIO, a machine learning server for developers and ML engineers.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A Scala API for Apache Beam and Google Cloud Dataflow.
🔍 Elasticsearch Scala Client - Reactive, Non Blocking, Type Safe, HTTP Client
A Scala port of the popular Python Requests HTTP client: flexible, intuitive, and straightforward to use.
simple combinator-based parsing for Scala. formerly part of the Scala standard library, now a separate community-maintained module
Neo4j Connector for Apache Spark, which provides bi-directional read/write access to Neo4j from Spark, using the Spark DataSource APIs
Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.
Extensible Rules Engine for custom Dataframe / Dataset validation
Custom state store providers for Apache Spark
A tool to get better debug info on spark's memory usage