Skip to content
View TheodoreGalanos's full-sized avatar

Block or report TheodoreGalanos

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A golang-based data loader which can be used from Python. Focused on a VectorDB stack at the moment, fetching and processing data per sample at GB/s speeds.

Go 55 Updated Oct 10, 2024
Python 64 7 Updated Oct 11, 2024

[NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personali…

Python 1,299 110 Updated Jul 29, 2024
Python 18 4 Updated Jun 26, 2024

PyTorch code for hierarchical k-means -- a data curation method for self-supervised learning

Jupyter Notebook 125 8 Updated Jun 21, 2024

scalable and robust tree-based speculative decoding algorithm

Python 307 31 Updated Aug 13, 2024

FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes

Python 184 13 Updated Oct 6, 2024

Python & JS/TS SDK for running AI-generated code/code interpreting in your AI app

Python 1,177 81 Updated Oct 12, 2024

Recursive Enriching Pterodactyl Tree Augmented Retrieval (REPTAR) is a system that uses a recursive summarization approach to generate thoughtful summaries of text data.

Python 1 Updated Apr 28, 2024

A Collection of Pydantic Models to Abstract IRL

Jupyter Notebook 15 Updated Apr 28, 2024

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 624 42 Updated Oct 10, 2024

Convert PDF to markdown quickly with high accuracy

Python 16,934 963 Updated Sep 7, 2024

DSPy: The framework for programming—not prompting—foundation models

Python 17,703 1,348 Updated Oct 12, 2024

An extensible benchmark for evaluating large language models on planning

PDDL 275 30 Updated May 21, 2024

Inference code for Persimmon-8B

Python 416 23 Updated Sep 9, 2023

Forward-Looking Active REtrieval-augmented generation (FLARE)

Python 579 51 Updated Nov 20, 2023
Jupyter Notebook 92 14 Updated Jul 26, 2023

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 4,909 372 Updated Aug 7, 2024

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Python 1,852 134 Updated Aug 25, 2024

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 265 27 Updated Apr 14, 2024

Curate better data for LLMs

Python 944 89 Updated Mar 19, 2024

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

Go 92,956 7,331 Updated Oct 12, 2024

Llama 2 Everywhere (L2E)

C 1,511 41 Updated Jul 24, 2024

Chat language model that can use tools and interpret the results

Python 1,382 107 Updated Oct 11, 2024

ReactJS library for "Cells, Generators, and Lenses": object-oriented UI components to compose LLM-powered writing interfaces that support iteration and exploration.

TypeScript 17 2 Updated Nov 10, 2023

Structured Text Generation

Python 8,554 431 Updated Oct 8, 2024

[ICLR 2023] Codebase for Copy-Generator model, including an implementation of kNN-LM

Python 182 22 Updated Jul 20, 2023

Datasets collection and preprocessings framework for NLP extreme multitask learning

Python 145 7 Updated Jul 10, 2024

🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation

Python 5,657 716 Updated Sep 19, 2024
Next