praveenr019

Praveen R praveenr019

72 followers · 410 following

Hyderabad
@praveenr019

Achievements

Organizations

Lists (2)

Sort

Data

1 repository

🚀 My stack

1 repository

Starred repositories

31 stars written in Scala

Clear filter

apache / spark

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,261 28,404 Updated Dec 27, 2024

scala / scala

Scala 2 compiler and standard library. Scala 2 bugs at https://github.com/scala/bug; Scala 3 at https://github.com/scala/scala3

Scala 14,362 3,116 Updated Dec 22, 2024

akka / akka

A platform to build and run apps that are elastic, agile, and resilient. SDK, libraries, and hosted environments.

Scala 13,072 3,589 Updated Dec 6, 2024

apache / predictionio

PredictionIO, a machine learning server for developers and ML engineers.

Scala 12,541 1,927 Updated Jan 9, 2021

gitbucket / gitbucket

A Git platform powered by Scala with easy installation, high extensibility & GitHub API compatibility

Scala 9,184 1,251 Updated Dec 25, 2024

twitter / finagle

A fault tolerant, protocol-agnostic RPC system

Scala 8,793 1,450 Updated Dec 18, 2024

delta-io / delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,701 1,735 Updated Dec 21, 2024

sbt / sbt

sbt, the interactive build tool

Scala 4,815 937 Updated Dec 23, 2024

twitter / scalding

A Scala API for Cascading

Scala 3,505 707 Updated May 28, 2023

awslabs / deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,336 542 Updated Dec 18, 2024

databricks / Spark-The-Definitive-Guide

Spark: The Definitive Guide's Code Repository

Scala 2,901 2,783 Updated Aug 26, 2020

spark-jobserver / spark-jobserver

REST job server for Apache Spark

Scala 2,839 994 Updated Dec 26, 2024

scala-exercises / scala-exercises

The easy way to learn Scala.

Scala 2,634 543 Updated May 16, 2023

twitter / algebird

Abstract Algebra for Scala

Scala 2,290 346 Updated Aug 19, 2024

twitter / summingbird

Streaming MapReduce with Scalding and Storm

Scala 2,134 265 Updated Jan 19, 2022

foundweekends / giter8

a command line tool to apply templates defined on GitHub

Scala 1,743 223 Updated Dec 26, 2024

mesos / spark

Lightning-fast cluster computing in Java, Scala and Python.

Scala 1,425 385 Updated Apr 8, 2014

apache / incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,236 447 Updated Dec 27, 2024

TIBCOSoftware / snappydata

Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in one cluster

Scala 1,041 200 Updated Nov 21, 2022

graphframes / graphframes

Scala 1,012 240 Updated Dec 27, 2024

amplab / shark

Development in Shark has been ended.

Scala 992 327 Updated Aug 11, 2015

RedisLabs / spark-redis

A connector for Spark that allows reading and writing to/from Redis cluster

Scala 942 371 Updated Oct 22, 2024

NVIDIA / spark-rapids

Spark RAPIDS plugin - accelerate Apache Spark with GPUs

Scala 841 239 Updated Dec 25, 2024

sameeragarwal / blinkdb

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

Scala 660 123 Updated Feb 6, 2014

databricks / spark-redshift

Redshift data source for Apache Spark

Scala 605 349 Updated Aug 10, 2023

G-Research / spark-extension

A library that provides useful extensions to Apache Spark and PySpark.

Scala 203 27 Updated Nov 30, 2024

snowplow-archive / spark-example-project

A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR

Scala 118 62 Updated Mar 28, 2016

echen / scaldingale

Movie recommendations and more in MapReduce and Scalding

Scala 117 25 Updated Feb 11, 2013

rockthejvm / spark-performance-tuning

The official repository for the Rock the JVM Spark Optimization 2 course

Scala 38 42 Updated Dec 4, 2023

kayousterhout / spark

Forked from mesos/spark

Scala framework for iterative and interactive cluster computing.

Scala 13 10 Updated Sep 23, 2013

Praveen R praveenr019

Organizations

Lists (2)

Data

🚀 My stack

Starred repositories

big-data

Awesome Lists

Firebase

Flutter

Java

Apache Spark

hadoop