Lists (3)
Sort Name ascending (A-Z)
Starred repositories
[ICLR 2025] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
Rule Snippet & Rule Set for Surge / Mihomo (Clash.Meta) / sing-box
Freeze variations and features in font.
Testing Language Models for Memorization of Tabular Datasets.
Touying is a powerful package for creating presentation slides in Typst.
Uncomplicated Observability for Python and beyond! 🪵🔥
BigCodeBench: Benchmarking Code Generation Towards AGI
[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?
Codebase of ACL2024 paper "Spiral of Silence: How is Large Language Model Killing Information Retrieval?—A Case Study on Open Domain Question Answering"
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
A Comprehensive Benchmark for Software Development.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
SGLang is a fast serving framework for large language models and vision language models.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
aider is AI pair programming in your terminal
An End-to-End Evaluation Framework for Entity Resolution Systems
A lightweight window border system for macOS
A simple framework for creating typed OpenAI functions from Python functions
Hysteria is a powerful, lightning fast and censorship resistant proxy.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.