usmanxia

Usman Zia usmanxia

Assistant Professor, NUST

NUST, Pakistan
Pakistan

Stars

AI4Bharat / setu

Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Built on Apache Spark, Setu encompasses four key stages: docume…

HTML 14 Updated May 17, 2024

multimodal-art-projection / MAP-NEO

Python 924 86 Updated Feb 7, 2025

rasbt / LLM-workshop-2024

A 4-hour coding workshop to understand how LLMs are implemented and used

Jupyter Notebook 873 291 Updated Jan 13, 2025

PacktPublishing / LLM-Engineers-Handbook

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 2,766 544 Updated Jan 10, 2025

Vaibhavs10 / insanely-fast-whisper

Jupyter Notebook 8,124 581 Updated Jun 16, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,848 459 Updated Jan 3, 2025

cohere-ai / cohere-finetune

A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models

Python 67 5 Updated Feb 4, 2025

cxcscmu / RAGViz

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

TypeScript 79 11 Updated Jan 18, 2025

NirDiamant / Prompt_Engineering

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essen…

Jupyter Notebook 2,926 302 Updated Feb 26, 2025

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,781 412 Updated Dec 4, 2024

westlake-baichuan-mllm / bc-omni

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊

263 7 Updated Jan 27, 2025

NirDiamant / RAG_Techniques

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 12,585 1,290 Updated Feb 24, 2025

arcee-ai / arcee-trainium-recipes

The repository contains all the set-up required to execute trainium training jobs.

Python 4 2 Updated Feb 11, 2025

langchain-ai / langgraph

Build resilient language agents as graphs.

Python 9,471 1,560 Updated Feb 27, 2025

amruthaa08 / Generative_AI_LLMs

Generative AI with Large Language Models on Coursera offered by Deeplearning.AI and AWS.

Jupyter Notebook 44 47 Updated Jul 22, 2023

iusztinpaul / hands-on-llms

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴

Jupyter Notebook 3,204 521 Updated Dec 9, 2024

sinanuozdemir / pearson-gpt-training-engineer

Jupyter Notebook 34 22 Updated Jul 16, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,146 5,007 Updated Jan 22, 2025

peremartra / Large-Language-Model-Notebooks-Course

Practical course about Large Language Models.

Jupyter Notebook 1,506 387 Updated Feb 21, 2025

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

2,742 235 Updated Jan 29, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,398 3,294 Updated Jan 26, 2025

superagent-ai / superagent

🥷 Run AI-agents with an API

TypeScript 5,636 860 Updated Oct 20, 2024

decodingml / llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Python 3,629 596 Updated Dec 26, 2024

raghavbali / llm_workshop_dhs23

LLM Workshop at Data Hack Summit 2023

Jupyter Notebook 13 6 Updated Aug 4, 2023

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 18,085 2,199 Updated Aug 6, 2024

git-cloner / llama-lora-fine-tuning

llama fine-tuning with lora

Python 140 14 Updated May 8, 2024

linhduongtuan / BLOOM-LORA

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Jupyter Notebook 185 39 Updated Jun 18, 2023

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,813 2,223 Updated Jul 29, 2024

PhoebusSi / Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,695 251 Updated Dec 12, 2023

suno-ai / bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,072 4,372 Updated Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly