Skip to content
View usmanxia's full-sized avatar
  • NUST, Pakistan
  • Pakistan

Block or report usmanxia

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Built on Apache Spark, Setu encompasses four key stages: documeโ€ฆ

HTML 14 Updated May 17, 2024

A 4-hour coding workshop to understand how LLMs are implemented and used

Jupyter Notebook 873 291 Updated Jan 13, 2025

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 2,766 544 Updated Jan 10, 2025
Jupyter Notebook 8,124 581 Updated Jun 16, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,848 459 Updated Jan 3, 2025

A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models

Python 67 5 Updated Feb 4, 2025

Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]

TypeScript 79 11 Updated Jan 18, 2025

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essenโ€ฆ

Jupyter Notebook 2,926 302 Updated Feb 26, 2025

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 3,781 412 Updated Dec 4, 2024

Baichuan-Omni: Towards Capable Open-source Omni-modal LLM ๐ŸŒŠ

263 7 Updated Jan 27, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contโ€ฆ

Jupyter Notebook 12,585 1,290 Updated Feb 24, 2025

The repository contains all the set-up required to execute trainium training jobs.

Python 4 2 Updated Feb 11, 2025

Build resilient language agents as graphs.

Python 9,471 1,560 Updated Feb 27, 2025

Generative AI with Large Language Models on Coursera offered by Deeplearning.AI and AWS.

Jupyter Notebook 44 47 Updated Jul 22, 2023

๐Ÿฆ– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป about ๐—Ÿ๐—Ÿ๐— ๐˜€, ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€, and ๐˜ƒ๐—ฒ๐—ฐ๐˜๐—ผ๐—ฟ ๐——๐—•๐˜€ for free by designing, training, and deploying a real-time financial advisor LLM system ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + ๐˜ท๐˜ช๐˜ฅ๐˜ฆ๐˜ฐ & ๐˜ณ๐˜ฆ๐˜ข๐˜ฅ๐˜ช๐˜ฏ๐˜จ ๐˜ฎ๐˜ข๐˜ต๐˜ฆ๐˜ณ๐˜ช๐˜ข๐˜ญ๐˜ด

Jupyter Notebook 3,204 521 Updated Dec 9, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 47,146 5,007 Updated Jan 22, 2025

Practical course about Large Language Models.

Jupyter Notebook 1,506 387 Updated Feb 21, 2025

Curated list of datasets and tools for post-training.

2,742 235 Updated Jan 29, 2025

The official Meta Llama 3 GitHub site

Python 28,398 3,294 Updated Jan 26, 2025

๐Ÿฅท Run AI-agents with an API

TypeScript 5,636 860 Updated Oct 20, 2024

๐Ÿค– ๐—Ÿ๐—ฒ๐—ฎ๐—ฟ๐—ป for ๐—ณ๐—ฟ๐—ฒ๐—ฒ how to ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ an end-to-end ๐—ฝ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป-๐—ฟ๐—ฒ๐—ฎ๐—ฑ๐˜† ๐—Ÿ๐—Ÿ๐—  & ๐—ฅ๐—”๐—š ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ using ๐—Ÿ๐—Ÿ๐— ๐—ข๐—ฝ๐˜€ best practices: ~ ๐˜ด๐˜ฐ๐˜ถ๐˜ณ๐˜ค๐˜ฆ ๐˜ค๐˜ฐ๐˜ฅ๐˜ฆ + 12 ๐˜ฉ๐˜ข๐˜ฏ๐˜ฅ๐˜ด-๐˜ฐ๐˜ฏ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ๐˜ด

Python 3,629 596 Updated Dec 26, 2024

LLM Workshop at Data Hack Summit 2023

Jupyter Notebook 13 6 Updated Aug 4, 2023

Inference Llama 2 in one file of pure C

C 18,085 2,199 Updated Aug 6, 2024

llama fine-tuning with lora

Python 140 14 Updated May 8, 2024

Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigscience/license) using Alpaca-LoRA and Alpaca_data_cleaned.json

Jupyter Notebook 185 39 Updated Jun 18, 2023

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,813 2,223 Updated Jul 29, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiastsโ€ฆ

Jupyter Notebook 2,695 251 Updated Dec 12, 2023

๐Ÿ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 37,072 4,372 Updated Aug 19, 2024
Next