Skip to content
View mandee13's full-sized avatar

Block or report mandee13

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Transformer: PyTorch Implementation of "Attention Is All You Need"

Python 3,203 456 Updated Aug 6, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,692 2,288 Updated Jun 18, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 7,043 790 Updated Aug 24, 2023

Code for the ICLR'22 paper "Improving Non-Autoregressive Translation Models Without Distillation"

Python 17 2 Updated Mar 11, 2022

ParaGen is a PyTorch deep learning framework for parallel sequence generation.

Python 186 23 Updated Nov 21, 2022

Token Drop mechanism for Neural Machine Translation

Python 5 Updated Mar 15, 2021

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Python 2,211 464 Updated Aug 7, 2024

A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation.

Python 243 39 Updated Sep 17, 2021

MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Java 13,000 2,730 Updated Oct 9, 2023
Python 870 107 Updated May 24, 2024

DisCo Transformer for Non-autoregressive MT

Python 78 9 Updated Jul 28, 2022

Code for Synchronous Bidirectional Neural Machine Translation (SB-NMT)

Python 64 17 Updated May 16, 2019

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,896 497 Updated Feb 14, 2023

Beam search for neural network sequence to sequence (encoder-decoder) models.

Python 34 17 Updated Apr 4, 2019

A general-purpose encoder-decoder framework for Tensorflow

Python 5,607 1,301 Updated Oct 15, 2020

A simple module consistently outperforms self-attention and Transformer model on main NMT datasets with SoTA performance.

Python 87 9 Updated Jul 24, 2023

GIZA++ is a statistical machine translation toolkit that is used to train IBM Models 1-5 and an HMM word alignment model. This package also contains the source for the mkcls tool which generates th…

C++ 265 82 Updated Mar 31, 2023

结巴中文分词

Python 33,547 6,726 Updated Aug 21, 2024

A PyTorch implementation of the 1d and 2d Sinusoidal positional encoding/embedding.

Jupyter Notebook 251 23 Updated Nov 17, 2020

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, suc…

Jupyter Notebook 3,500 2,136 Updated Jun 14, 2021

MASS: Masked Sequence to Sequence Pre-training for Language Generation

Python 1,119 206 Updated Nov 28, 2022

TED parallel Corpora is growing collection of Bilingual parallel corpora, Multilingual parallel corpora and Monolingual corpora extracted from TED talks www.ted.com for 109 world languages.

246 81 Updated Jan 4, 2016

自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子…

Python 1,528 395 Updated Sep 23, 2021

Code inspired by Unsupervised Machine Translation Using Monolingual Corpora Only

Jupyter Notebook 50 9 Updated Jul 25, 2024

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Python 100 22 Updated Aug 1, 2023

Command-line program to download videos from YouTube.com and other video sites

Python 133,390 10,136 Updated Dec 31, 2024

Stand-alone language identification system

Python 2,338 320 Updated Jan 1, 2020

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

Jupyter Notebook 2,020 206 Updated Jan 9, 2024

Transformers without Tears: Improving the Normalization of Self-Attention

Python 130 17 Updated May 29, 2024

An implementation of Performer, a linear attention-based transformer, in Pytorch

Python 1,107 145 Updated Feb 2, 2022
Next