Skip to content

Commit

Permalink
update readme.
Browse files Browse the repository at this point in the history
  • Loading branch information
xuming06 committed Aug 3, 2019
1 parent 2a943b9 commit d5dd961
Showing 1 changed file with 11 additions and 11 deletions.
22 changes: 11 additions & 11 deletions pycorrector/conv_seq2seq/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
# Neural Text Error Correction with CNN Sequence-to-Sequence Model
# Neural Text Error Correction with Conv Seq2Seq Model


## Features
Expand All @@ -25,16 +25,7 @@ The OOV words UNK in summaries are manually replaced with words in source articl
cd conv_seq2seq
python preprocess.py
```

- big train data

download from https://pan.baidu.com/s/1BkDru60nQXaDVLRSr7ktfA 密码:m6fg [130W sentence pair,215MB]



generate toy train data(`train.src` and `train.trg`) and valid data(`val.src` and `val.trg`), segment by char


result:
```
# train.src:
吸 烟 对 人 的 健 康 有 害 处 , 这 是 各 个 人 都 知 道 的 事 实 。
Expand All @@ -47,6 +38,15 @@ generate toy train data(`train.src` and `train.trg`) and valid data(`val.src` an
如 服 装 , 若 有 一 个 很 流 行 的 样 式 , 人 们 就 赶 快 地 追 求 。
```

- big train data

download from https://pan.baidu.com/s/1BkDru60nQXaDVLRSr7ktfA 密码:m6fg [130W sentence pair,215MB]



generate toy train data(`train.src` and `train.trg`) and valid data(`val.src` and `val.trg`), segment by char



## train

Expand Down

0 comments on commit d5dd961

Please sign in to comment.