Starred repositories
The Prometheus monitoring system and time series database.
Perforator is a cluster-wide continuous profiling tool designed for large data centers
list of papers, code, and other resources
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.
Large silver standart Russian corpus with NER, morphology and syntax markup
List of Top 500 SQL Interview Questions & Answers with queries and more
ruptures: change point detection in Python
SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
PlayStation 4 emulator for Windows, Linux and macOS written in C++
Easiest and laziest way for building multi-agent LLMs applications.
A curated list of awesome Go frameworks, libraries and software
Streamlit — A faster way to build and share data apps.
Data, Benchmarks, and methods submitted to the M4 forecasting competition
An opinionated list of awesome Python frameworks, libraries, software and resources.
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports comp…
ClickHouse® is a real-time analytics database management system
An extremely fast Python linter and code formatter, written in Rust.
PyScript is an open source platform for Python in the browser. Try PyScript: https://pyscript.com Examples: https://tinyurl.com/pyscript-examples Community: https://discord.gg/HxvBtukrg2
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
Коллекция готовых SQL запросов для PostgreSQL по часто возникающим задачам (получение и модификация данных, ускорение запросов, обслуживание БД)
This is a repo with links to everything you'd ever want to learn about data engineering
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
TensorFlow code and pre-trained models for BERT
Visualize Different Text Splitting Methods
CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.