Stars
Machine learning metrics for distributed, scalable PyTorch applications.
Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.
Documentation and source code powering Twitter's Community Notes
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
A python module to repair invalid JSON, commonly used to parse the output of LLMs
📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more
Implementation of X/Twitter v1, v2, and GraphQL APIs
Twitter API Scraper | Without an API key | Twitter Internal API | Free | Twitter scraper | Twitter Bot
Replications data and code for "LaLonde (1986) after Nearly Four Decades: Lessons Learned"
Terminal-based CPU stress and monitoring utility
This is a Python tool to employ stratified sampling or treatment randomization with uneven numbers in some strata using pandas. Mainly thought with RCTs in mind, it also works for any other scenari…
Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".
A Python module to bypass Cloudflare's anti-bot page.