Stars
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
The easiest way to get started with LlamaIndex
AdalFlow: The library to build & auto-optimize LLM applications.
lalanamika / SriLankaChapter_RegulatoryDecisionMaking
Forked from OmdenaAI/SriLankaChapter_RegulatoryDecisionMakingData gathering for "University of Ruhuna, Sri Lanka Chapter" - Enhancing Regulatory Decision-Making through a Retrieval-Augmented Generation (RAG) Based Large Language Model
Generate a comprehensive review from an arXiv paper, then turn it into a blog post. This project powers the website below for the HuggingFace's Daily Papers (https://huggingface.co/papers).
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Developer APIs to Accelerate LLM Projects
"University of Ruhuna, Sri Lanka Chapter" - Enhancing Regulatory Decision-Making through a Retrieval-Augmented Generation (RAG) Based Large Language Model
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Collection of articles listing reasons why data science projects fail.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
An end-to-end implementation of intent prediction with Metaflow and other cool tools
Template for a data science project
A curated list of roadmaps covering different skills data science & AI careers and skills
Writing clean and optimized Python code
An ongoing list of pandas quirks
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)
A booklet on machine learning systems design with exercises. NOT the repo for the book "Designing Machine Learning Systems"
A simple Python function tool to convert DICOM files into jpg/png/bmp/tiff files and numpy.ndarray
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Free Data Engineering course!
A collection of libraries to optimise AI model performances
Machine Learning Model and Deployment for Classification of Mango Varieties
A starter notebook for the Kitchenware classification competition on Kaggle
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…