Skip to content
View josh-cooper's full-sized avatar
  • Tasmania, Australia

Highlights

  • Pro

Organizations

@rdytech

Block or report josh-cooper

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Learning embeddings for classification, retrieval and ranking.

C++ 3,951 528 Updated Dec 4, 2022

Python tool for converting files and office documents to Markdown.

Python 35,765 1,591 Updated Jan 24, 2025

This is a text-based game engine that implements the D&D 5th edition ruleset. A sample adventure is included in this repository

Ruby 76 4 Updated Nov 3, 2023

The LLM Evaluation Framework

Python 4,546 381 Updated Jan 27, 2025
Jupyter Notebook 343 52 Updated Jan 7, 2024

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

Python 1,748 122 Updated Jan 22, 2025

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,487 432 Updated Jan 3, 2025

Lightweight library for scraping web-sites with LLMs

Python 1,001 61 Updated Jan 27, 2025

Data processing with ML, LLM and Vision LLM

Python 4,139 403 Updated Jan 24, 2025

Data processing for and with foundation models! 🍎 πŸ‹ 🌽 ➑️ ➑️🍸 🍹 🍷

Python 3,501 197 Updated Jan 27, 2025

Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]

Python 16,682 1,977 Updated Jan 28, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,772 186 Updated Nov 14, 2024

An open source multi-tool for exploring and publishing data

Python 9,763 702 Updated Jan 16, 2025

Pulumi - Infrastructure as Code in any programming language πŸš€

Go 22,313 1,151 Updated Jan 27, 2025

Shared data types for building collaborative software

JavaScript 18,053 639 Updated Jan 17, 2025

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 15,948 1,022 Updated Jan 27, 2025

HTML to Markdown converter and crawler.

TypeScript 510 33 Updated Jan 9, 2024

πŸ¦™ Integrating LLMs into structured NLP pipelines

Python 1,174 92 Updated Jan 8, 2025

Curate better data for LLMs

Python 1,003 96 Updated Mar 19, 2024

Automatically Discovering Fast Parallelization Strategies for Distributed Deep Neural Network Training

C++ 1,750 234 Updated Jan 28, 2025

LLM inference in C/C++

C++ 71,642 10,358 Updated Jan 27, 2025

RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, …

Python 410 41 Updated Jul 11, 2023

A playbook for systematically maximizing the performance of deep learning models.

27,894 2,300 Updated Jun 18, 2024

πŸ§™ Valtio makes proxy-state simple for React and Vanilla

TypeScript 9,315 265 Updated Jan 26, 2025

ruptures: change point detection in Python

Python 1,688 161 Updated Jan 7, 2025

A list of awesome compiler projects and papers for tensor computation and deep learning.

2,472 307 Updated Oct 19, 2024

A universal package of scraper scripts for humans

Python 310 20 Updated May 22, 2022

Tiny, no-nonsense, self-contained, Tensorflow and ONNX inference

Rust 2,303 217 Updated Jan 27, 2025

Run ONNX and TensorFlow inference in the browser.

Rust 75 7 Updated Jan 20, 2023

Fixes mojibake and other glitches in Unicode text, after the fact.

Python 3,848 120 Updated Oct 30, 2024
Next