Skip to content
View mateiz's full-sized avatar

Highlights

  • Pro

Organizations

@mesos @radlab

Block or report mateiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Large World Model -- Modeling Text and Video with Millions Context

Python 7,188 555 Updated Oct 19, 2024

A Bulletproof Way to Generate Structured JSON from Language Models

Jupyter Notebook 4,499 159 Updated Feb 24, 2024

A hyper-fast local vector database for use with LLM Agents. Now accepting SAFEs at $135M cap.

Python 1,382 85 Updated Aug 29, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,819 1,160 Updated Jun 30, 2023
Python 1,233 176 Updated Nov 20, 2024

DSPy: The framework for programming—not prompting—language models

Python 20,505 1,546 Updated Dec 27, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,701 1,735 Updated Dec 21, 2024

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Java 968 121 Updated Dec 26, 2024

Sample base images for Databricks Container Services

Dockerfile 169 117 Updated Nov 13, 2024

An open protocol for secure data sharing

Scala 785 172 Updated Dec 20, 2024

Offload IoT computation to local hardware while justifying any network accesses.

Rust 7 2 Updated May 31, 2023

A native Rust library for Delta Lake, with bindings into Python

Rust 2,405 417 Updated Dec 26, 2024

API for manipulating time series on top of Apache Spark: lagged time values, rolling statistics (mean, avg, sum, count, etc), AS OF joins, downsampling, and interpolation

Jupyter Notebook 316 53 Updated Dec 15, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,147 395 Updated Nov 18, 2024

The library for web and native user interfaces.

JavaScript 230,784 47,230 Updated Dec 26, 2024

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 26,664 4,396 Updated Dec 27, 2024

Joblib Apache Spark Backend

Python 243 26 Updated Aug 14, 2024

The Tensor Algebra SuperOptimizer for Deep Learning

C++ 694 91 Updated Jan 26, 2023
Python 384 117 Updated Nov 4, 2022

An open-source toolkit for large-scale genomic analysis

Scala 272 109 Updated Dec 22, 2024

Puffer is a free live TV streaming website and a research study at Stanford using machine learning to improve video streaming

C++ 847 132 Updated Aug 4, 2024

Koalas: pandas API on Apache Spark

Python 3,347 358 Updated Mar 20, 2024

A Python-embedded modeling language for convex optimization problems.

C++ 5,531 1,072 Updated Dec 23, 2024

The Legion Parallel Programming System

C++ 696 145 Updated Dec 19, 2024

GoCD plugins to work with MLFlow as model repository in a CD flow

Java 30 5 Updated Nov 1, 2023

MLflow App Library

Python 76 37 Updated Dec 25, 2018

Intellij Jsonnet Plugin

Java 88 17 Updated Mar 9, 2024

Open source platform for the machine learning lifecycle

Python 19,093 4,292 Updated Dec 26, 2024

The "Command Line Interactive Controller for Kubernetes"

Rust 1,496 84 Updated Jan 8, 2024

Accelerating network inference over video

Python 436 122 Updated Mar 6, 2020
Next