Highlights
- Pro
Stars
Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.
A data preprocessor for the Quranic Treebank using neural networks. Divides longer verses into smaller chunks.
Hey 👋, Glad to see you here! Check out this repository to learn more about me 🤓. You can also use it to make your awesome GitHub README ✨ (Don't Just Fork, Star Too 😅)
[NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.
Maha is a text processing library specially developed to deal with Arabic text.
Several deep learning models for restoring Arabic diacritics using Pytorch.
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
This directory gathers the tools developed by the Data Sourcing Working Group
Toolkit for creating, sharing and using natural language prompts.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Set of functionalities enable Arabic website developers to serve professional search, present and process Arabic content in PHP
End to end Arabic TTS system based on tacotron
Repo for reproducing ALUE benchmark baselines
Magenta: Music and Art Generation with Machine Intelligence
Source code for ECCV20 "GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images"
[CVPR2022 oral] A Simple and Effective Baseline for Text-to-Image Synthesis
Text to image synthesis using thought vectors
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes