nlp-datasets
Here are 154 public repositories matching this topic...
🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Paper.
-
Updated
Oct 1, 2024 - Python
This project is a sentiment analysis model built to classify IMDB movie reviews as either positive or negative using the **IMDB dataset**. It uses various machine learning models and deep learning techniques to handle the text data.
-
Updated
Sep 29, 2024 - Jupyter Notebook
All the Dataset realted to machine learning, Deep learning, NLP and Data Science will be uploaded Here.
-
Updated
Sep 28, 2024
Repository for the paper STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions (EMNLP 2024)
-
Updated
Sep 24, 2024 - Python
NLP projects, which I worked on utilising different natural language processing libraries's.
-
Updated
Sep 12, 2024 - Jupyter Notebook
An Algorithm that can generate conversation history dataset for your own custom LLM/ChatBot finetuning
-
Updated
Aug 9, 2024 - Jupyter Notebook
Official repository for "Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning".
-
Updated
Aug 8, 2024 - Jupyter Notebook
This project applies BERT for emotion detection and sentiment analysis, utilizing a dataset of annotated documents to classify various emotions from text. The main.ipynb file contains the complete code, outputs, and results.
-
Updated
Aug 4, 2024 - Jupyter Notebook
🌍 Leaderboard Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL2024
-
Updated
Jul 27, 2024
A collection of datasets for Ukrainian language
-
Updated
Jul 25, 2024 - Python
A list of Romanian NLP Datasets
-
Updated
Jul 21, 2024
In today's digital age, misinformation spreads rapidly, significantly impacting public perception and decision-making. This project employs word embeddings using spaCy to effectively distinguish between fake and real news, enhancing the accuracy of information verification and contributing to the fight against misinformation. 📰🔍💡
-
Updated
Jun 29, 2024 - Jupyter Notebook
The SMS Spam Collection v.1 📱 is a curated dataset consisting of 5,574 SMS messages in English, meticulously categorized as either legitimate (ham) or spam. This corpus serves as a valuable resource for research in SMS spam detection and filtering.🔍💬
-
Updated
Jun 26, 2024 - Python
Implementation of DFMR for Multimodal Sentiment Analysis in Malayalam (Native Indian Dravida Language)
-
Updated
Jun 10, 2024 - Jupyter Notebook
Data preprocessing and training on Drug Review Dataset using Hugging Face library
-
Updated
May 19, 2024 - Jupyter Notebook
This repo contains everything about transformers and NLP.
-
Updated
May 18, 2024 - Python
Hub for the Portuguese language NLP Resources
-
Updated
Apr 18, 2024 - PHP
A dataset of Moin Persian 🇮🇷 dictionary 📖 words.
-
Updated
Apr 5, 2024
Improve this page
Add a description, image, and links to the nlp-datasets topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the nlp-datasets topic, visit your repo's landing page and select "manage topics."