Skip to content
View marvinzh's full-sized avatar
  • Tokyo Institute of Technology
  • Tokyo

Highlights

  • Pro

Block or report marvinzh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

maximal update parametrization (µP)

Jupyter Notebook 1,427 94 Updated Jul 17, 2024

Ongoing research training transformer models at scale

Python 11,092 2,477 Updated Jan 14, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,231 4,192 Updated Jan 14, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,637 1,881 Updated Apr 30, 2024

爬取关注列表中微博账号的微博

Python 185 52 Updated May 21, 2024

Rasa-Doctor-Friende.A chinese medical chatbot based on Neo4j knowledge graph and Rasa.

Python 279 102 Updated Dec 27, 2022

A platform for building proxies to bypass network restrictions.

Go 45,654 8,953 Updated Nov 15, 2024

常用中国网站白名单,纯列表,用于 SwitchyOmega,控制不走代理的网站。

457 68 Updated Nov 30, 2024

自己使用的白名单pac文件,不定时更新常见域名

JavaScript 352 111 Updated Feb 21, 2024

gfw_whitelist

JavaScript 3,132 746 Updated Feb 1, 2021

Chinese, English NER, English-Chinese machine translation dataset. 中英文实体识别数据集,中英文机器翻译数据集, 中文分词数据集

Python 360 76 Updated Feb 3, 2021

Collections of Chinese NLP corpus

Python 886 209 Updated Dec 28, 2020

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 20,800 2,679 Updated Aug 15, 2024

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

Python 6,919 2,904 Updated Oct 21, 2024

KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation

Python 469 61 Updated May 8, 2023

A complete computer science study plan to become a software engineer.

310,145 77,591 Updated Dec 5, 2024

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

Markdown 126,507 23,286 Updated Sep 22, 2024

Dialogue model that produces empathetic responses when trained on the EmpatheticDialogues dataset.

Python 464 64 Updated Dec 3, 2021

Multi-turn dialogue baselines written in PyTorch

Python 162 23 Updated Mar 10, 2020

我的 OI 课件

518 95 Updated Aug 31, 2020
Python 26 3 Updated Jan 27, 2018

HRED VHRED VHCR for Multi-Turn Dialogue Systems

Python 42 7 Updated Dec 16, 2019

Google Research

Jupyter Notebook 34,678 7,983 Updated Jan 13, 2025

Reformer, the efficient Transformer, in Pytorch

Python 2,140 255 Updated Jun 21, 2023

BERT score for text generation

Jupyter Notebook 1,650 224 Updated Jul 30, 2024

Embedding-based evaluation metrics for dialogue generation.

Python 16 1 Updated Jan 8, 2023

PyTorch Implementation of "A Hierarchical Latent Structure for Variational Conversation Modeling" (NAACL 2018 Oral)

Python 173 45 Updated Jul 25, 2024

A dataset containing human-human knowledge-grounded open-domain conversations.

Python 643 98 Updated Aug 2, 2024

OpenDialKG: Explainable Conversational Reasoning with Attention-based Walks over Knowledge Graphs

137 14 Updated Oct 12, 2021

Tutorials on implementing a few sequence-to-sequence (seq2seq) models with PyTorch and TorchText.

Jupyter Notebook 5,439 1,349 Updated Jan 20, 2024
Next