Skip to content
View fandongmeng's full-sized avatar
  • Tencent WeChat AI
  • Beijing

Block or report fandongmeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

201 9 Updated Dec 31, 2024

Code for paper "Patch-Level Training for Large Language Models"

Python 78 3 Updated Nov 15, 2024
Python 21 Updated Sep 5, 2023
Python 5 Updated Aug 15, 2023

code for Teaching LM to Translate with Comparison

Python 39 6 Updated Dec 15, 2023

[NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li, Peng Fu, Yanan Cao, Weiping Wang, Jie Zhou

Python 21 2 Updated Jan 9, 2024

Making large AI models cheaper, faster and more accessible

Python 39,042 4,363 Updated Feb 3, 2025

Code and Data for the ACL22 main conference paper "MSCTD: A Multimodal Sentiment Chat Translation Dataset"

Python 41 3 Updated Dec 25, 2024
Python 14 1 Updated Aug 6, 2022

EMNLP 2022: ClidSum: A Benchmark Dataset for Cross-Lingual Dialogue Summarization

Python 35 2 Updated Jan 13, 2024

A benchmark for the task of translation suggestion

Mask 59 25 Updated Jun 23, 2022

PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.

Python 755 58 Updated Apr 7, 2023
C++ 125 20 Updated Jul 6, 2021

《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models

TeX 2,747 762 Updated Sep 14, 2024

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,504 199 Updated Jun 12, 2023

Code for the paper: GCDT: A Global Context Enhanced Deep Transition Architecture for Sequence Labeling

PLSQL 68 15 Updated Sep 20, 2019

Source code for "Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation"

Python 1 Updated Jun 25, 2019