Skip to content
View mrtzh's full-sized avatar

Organizations

@socialfoundations

Block or report mrtzh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collaborative note taking, wiki and documentation platform that scales. Built with Django and React. Opensource alternative to Notion or Outline.

Python 11,717 276 Updated Apr 23, 2025

Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!

Jupyter Notebook 20 3 Updated Apr 8, 2025

Lawma: A lightly fine-tuned Llama model for legal classification tasks.

Jupyter Notebook 18 Updated Sep 14, 2024

Code to reproduce the experiments in the paper Training on the Test Task Confounds Evaluation and Emergence.

Jupyter Notebook 9 1 Updated Dec 3, 2024

BenchBench is a Python package to evaluate multi-task benchmarks.

Python 15 1 Updated Jul 18, 2024

The official Meta Llama 3 GitHub site

Python 28,639 3,366 Updated Jan 26, 2025

Code to reproduce the paper "Do causal predictors generalize better to new domains?"

Python 9 3 Updated Feb 7, 2025

The accompanying code of "What Makes ImageNet Look Unlike LAION."

Jupyter Notebook 10 2 Updated Dec 15, 2024

Code to reproduce the paper "Questioning the Survey Responses of Large Language Models"

Jupyter Notebook 7 1 Updated Dec 8, 2024

Achieve error-rate fairness between societal groups for any score-based classifier.

Python 17 4 Updated Apr 26, 2024

Test-time-training on nearest neighbors for large language models

Python 40 5 Updated Apr 18, 2024

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,280 3,691 Updated Apr 23, 2025

Compute the inverse of a matrix using the Gauss-Jordan method.

C++ 3 Updated Jul 6, 2024

Retro programming in Borland C++ 3.1

C 55 6 Updated Jul 22, 2019

Datasets derived from US census data

Python 258 21 Updated May 15, 2024

This repository provides example code for loading and analyzing data from AHRQ's Medical Expenditure Panel Survey (MEPS). More information about the survey and access to public use data files is av…

SAS 171 73 Updated Apr 10, 2025

Replication materials for "Measuring the predictability of life outcomes using a scientific mass collaboration"

R 3 2 Updated Nov 9, 2021
Jupyter Notebook 31 5 Updated Jan 13, 2022

A Python package to assess and improve fairness of machine learning models.

Python 2,053 452 Updated Apr 13, 2025

Package for typesetting a book into PDF and HTML using pandoc and a bunch of other tools

JavaScript 15 7 Updated Jul 21, 2020

Causal estimators for use with WhyNot

Python 11 1 Updated Mar 20, 2020

A Python sandbox for decision making in dynamics

Python 422 43 Updated Aug 21, 2023

Differentially private synthetic data

Julia 46 18 Updated Dec 18, 2020

Compile markdown into an html and pdf book based on pandoc.

Python 189 21 Updated Sep 10, 2021

A work-in-progress, open-source, multi-player city simulation game.

Rust 7,834 336 Updated Jan 7, 2023

signal-cli provides an unofficial commandline, JSON-RPC and dbus interface for the Signal messenger.

Java 3,466 315 Updated Apr 9, 2025

Starter files for using Pandoc Markdown with Tufte CSS

CSS 327 33 Updated Jul 5, 2022

Universal markup converter

Haskell 37,031 3,492 Updated Apr 23, 2025

A tool that translates augmented markdown into HTML or latex

Java 466 32 Updated Jun 19, 2022
Next