Skip to content
View praveenr019's full-sized avatar

Organizations

@datafarer

Block or report praveenr019

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

31 stars written in Scala
Clear filter

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,261 28,404 Updated Dec 27, 2024

Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

Scala 14,362 3,116 Updated Dec 22, 2024

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,072 3,589 Updated Dec 6, 2024

PredictionIO, a machine learning server for developers and ML engineers.

Scala 12,541 1,927 Updated Jan 9, 2021

A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility

Scala 9,184 1,251 Updated Dec 25, 2024

A fault tolerant, protocol-agnostic RPC system

Scala 8,793 1,450 Updated Dec 18, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,701 1,735 Updated Dec 21, 2024

sbt, the interactive build tool

Scala 4,815 937 Updated Dec 23, 2024

A Scala API for Cascading

Scala 3,505 707 Updated May 28, 2023

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,336 542 Updated Dec 18, 2024

Spark: The Definitive Guide's Code Repository

Scala 2,901 2,783 Updated Aug 26, 2020

REST job server for Apache Spark

Scala 2,839 994 Updated Dec 26, 2024

The easy way to learn Scala.

Scala 2,634 543 Updated May 16, 2023

Abstract Algebra for Scala

Scala 2,290 346 Updated Aug 19, 2024

Streaming MapReduce with Scalding and Storm

Scala 2,134 265 Updated Jan 19, 2022

a command line tool to apply templates defined on GitHub

Scala 1,743 223 Updated Dec 26, 2024

Lightning-fast cluster computing in Java, Scala and Python.

Scala 1,425 385 Updated Apr 8, 2014

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,236 447 Updated Dec 27, 2024

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,041 200 Updated Nov 21, 2022

Development in Shark has been ended.

Scala 992 327 Updated Aug 11, 2015

A connector for Spark that allows reading and writing to/from Redis cluster

Scala 942 371 Updated Oct 22, 2024

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 841 239 Updated Dec 25, 2024

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Scala 660 123 Updated Feb 6, 2014

Redshift data source for Apache Spark

Scala 605 349 Updated Aug 10, 2023

A library that provides useful extensions to Apache Spark and PySpark.

Scala 203 27 Updated Nov 30, 2024

A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR

Scala 118 62 Updated Mar 28, 2016

Movie recommendations and more in MapReduce and Scalding

Scala 117 25 Updated Feb 11, 2013

The official repository for the Rock the JVM Spark Optimization 2 course

Scala 38 42 Updated Dec 4, 2023

Scala framework for iterative and interactive cluster computing.

Scala 13 10 Updated Sep 23, 2013
Next