Skip to content
View tdoehmen's full-sized avatar

Highlights

  • Pro

Organizations

@duckdb

Block or report tdoehmen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DuckDB-powered Postgres for high performance apps & analytics.

C++ 2,102 94 Updated Mar 21, 2025

A blazing fast AI Gateway with integrated guardrails. Route to 200+ LLMs, 50+ AI Guardrails with 1 fast & friendly API.

TypeScript 7,417 549 Updated Mar 24, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,900 303 Updated Mar 18, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 104,136 16,846 Updated Mar 24, 2025
JavaScript 562 39 Updated Oct 24, 2023

Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, et…

Python 194 42 Updated Oct 4, 2024

Fast, high-quality forecasts on relational and multivariate time-series data powered by new feature learning algorithms and automated ML.

Jupyter Notebook 112 13 Updated Jan 9, 2025

Python SQL Parser and Transpiler

Python 7,363 808 Updated Mar 22, 2025

Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of exp…

Python 282 22 Updated Oct 5, 2023

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,738 1,146 Updated Apr 24, 2024

Data Sketches for Apache Spark

Scala 22 4 Updated Dec 22, 2022

Extensible Rules Engine for custom Dataframe / Dataset validation

Scala 134 30 Updated May 7, 2024

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

Python 2,725 496 Updated Oct 23, 2024

An Industrial Graph Neural Network Framework

C++ 1,296 266 Updated Jul 1, 2024

Learn2Clean: Optimizing the Sequence of Tasks for Data Preparation and Cleaning

Python 51 19 Updated Dec 26, 2022
Python 8 1 Updated Sep 24, 2020
14 1 Updated Mar 13, 2021

A collection of demos showcasing automated feature engineering and machine learning in diverse use cases

Jupyter Notebook 500 170 Updated Aug 7, 2023

A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.

Python 504 47 Updated Nov 24, 2024

Featuretools' DFS as a scikit-learn transformer

Python 11 8 Updated Mar 24, 2022

Monitor the stability of a Pandas or Spark dataframe ⚙︎

Python 499 36 Updated Jan 24, 2025

Propositionalization and embeddings.

Python 2 1 Updated Jul 25, 2024

That is the code corresponding to paper "Towards Automatic Complex Feature Engineering" in WISE

Python 4 2 Updated Sep 19, 2018

Feature engineering package with sklearn like functionality

Python 2,016 321 Updated Mar 20, 2025

Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with the Python Database API Specification 2.0.

C++ 628 85 Updated Mar 4, 2025

Linear Prediction Model with Automated Feature Engineering and Selection Capabilities

Python 509 62 Updated Mar 23, 2025
Jupyter Notebook 29 7 Updated Nov 10, 2021

Koalas: pandas API on Apache Spark

Python 3,351 361 Updated Mar 20, 2024

DuckDB is an analytical in-process SQL database management system

C++ 27,925 2,174 Updated Mar 24, 2025