Skip to content
View mmaguero's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Block or report mmaguero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

s1: Simple test-time scaling

Python 4,821 532 Updated Feb 10, 2025
Jupyter Notebook 3 Updated Jul 10, 2024

Repository containing data and baselines for the two 2024 AmericasNLP shared tasks.

JavaScript 6 4 Updated Apr 4, 2024

Compute Inter Annotator Agreement from Brat files

Python 2 1 Updated Nov 19, 2021
Python 3 1 Updated Feb 9, 2021

Library to download PubMed abstracts with metadata. Originally created to obtain the DrugProt (BioCreative VII) background set

Python 2 1 Updated Jan 4, 2022

Different useful snippets I create while I am a working at BSC

Python 1 Updated Apr 26, 2021
Jupyter Notebook 6 1 Updated Apr 13, 2023

Train transformer language models with reinforcement learning.

Python 11,383 1,525 Updated Feb 10, 2025

Transformer models from BERT to GPT-4, environments from Hugging Face to OpenAI. Fine-tuning, training, and prompt engineering examples. A bonus section with ChatGPT, GPT-3.5-turbo, GPT-4, and DALL…

Jupyter Notebook 847 324 Updated Jan 4, 2024

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,931 1,990 Updated Sep 26, 2024

BLOOM+1: Adapting BLOOM model to support a new unseen language

Python 70 15 Updated Mar 2, 2024

Finetuning InstructLLaMA with portuguese data

Jupyter Notebook 558 68 Updated Jun 6, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 39,208 6,386 Updated Dec 9, 2024

Multiple NER-tool's combined in one output. Incovating mutliple NER-engine's in parallel.

Python 6 1 Updated Aug 25, 2021

Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research

Jupyter Notebook 3,393 253 Updated Mar 15, 2024

Tool to convert CoNLL-U format files to CoNLL format files and manipulate training, validation and test sets.

Python 4 Updated Jul 6, 2023

My notes / works on deep learning from Coursera

Jupyter Notebook 445 362 Updated May 8, 2024

✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into interactive graphs.

TypeScript 36,323 2,355 Updated Feb 7, 2025

🚀 State-of-the-art parsers for natural language.

Python 845 145 Updated Sep 3, 2023

Machine learning-based classifier that identifies sentences that contains evidence of social impact of research

Jupyter Notebook 1 Updated Dec 8, 2022

Biomedical Named Entity Recognition and Normalization of Diseases, Chemicals and Genenetic entity classes through the use of state-of-the-art models.

Jupyter Notebook 108 23 Updated Dec 17, 2021

Jojajovai Guarani-Spanish Parallel Corpus

12 Updated Jul 5, 2022

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,780 3,620 Updated Jul 28, 2024

python library for working with IIIF Image and Presentation APIs

Python 19 5 Updated Jan 27, 2025

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Python 4,433 754 Updated Nov 27, 2024

NLP, before and after spaCy

Python 2,215 250 Updated Sep 22, 2023

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,517 247 Updated Nov 29, 2024

ROADMAP(Mind Map) and KEYWORD for students those who have interest in learning NLP

3,237 518 Updated Sep 29, 2019

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

16,941 2,595 Updated Nov 13, 2023
Next