Skip to content
View marcospiau's full-sized avatar

Block or report marcospiau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python tool for converting files and office documents to Markdown.

Python 35,356 1,567 Updated Jan 16, 2025

A CLI tool for managing OpenAI batch processing jobs with ease.

Python 29 4 Updated Aug 25, 2024

Retrieval and Retrieval-augmented LLMs

Python 8,295 606 Updated Jan 22, 2025

Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering

Python 168 11 Updated Jun 6, 2021

EMNLP 2021 - Pre-training architectures for dense retrieval

Python 244 23 Updated Mar 18, 2022

Codebase for RetroMAE and beyond.

Python 246 19 Updated Jun 7, 2024

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.

Python 571 73 Updated Apr 24, 2023

Late Interaction Models Training & Retrieval

Python 225 15 Updated Jan 21, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 2,004 171 Updated Dec 15, 2024
Python 3 Updated Jan 28, 2024

Tiny client for LLMs with vision and tool calling. As simple as it gets.

Python 80 8 Updated Dec 28, 2024

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 20,735 1,821 Updated Jan 22, 2025

Full text search in your Pandas dataframe

Python 213 6 Updated Dec 7, 2024
Python 8 1 Updated Nov 10, 2024

A simple, performant and scalable Jax LLM!

Python 1,590 310 Updated Jan 22, 2025

Library for reading and processing ML training data.

Python 365 27 Updated Jan 20, 2025
Python 17 7 Updated Oct 18, 2024
Python 1 Updated Sep 10, 2024

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,250 71 Updated Jan 18, 2025

The prime repository for state-of-the-art Multilingual Question Answering research and development.

Python 732 57 Updated Jan 8, 2025

Run PyTorch models in the browser using ONNX.js

Python 375 46 Updated Apr 18, 2022

A pair of tiny foundational models trained in Brazilian Portuguese.🦙🦙

Python 30 5 Updated Jan 15, 2025

Code for experiments with transfer learning (on deep neural language models) between languages.

Jupyter Notebook 3 5 Updated Jan 22, 2025

Rax is a Learning-to-Rank library written in JAX.

Python 324 11 Updated Jan 3, 2025

🌠 Manage your shell commands.

Rust 5,241 135 Updated Jan 21, 2025

Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint

Python 371 24 Updated Mar 26, 2024

Minimalistic large language model 3D-parallelism training

Python 1,396 140 Updated Jan 17, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,163 164 Updated Jan 22, 2025
Python 46 6 Updated Feb 7, 2024
Next