Skip to content

dgk_lost_conv 中文对白语料 chinese conversation corpus

Notifications You must be signed in to change notification settings

qhduan/dgk_lost_conv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

========从字幕文件构建的中文对话脚本 conversation corpus========

结果:
dkg_lost.conv

方法:
asstosrt -s utf-8
注意输出 utf-8编码的 srt 文件
ass ----asstosrt---->srt
srt ----cvgen.py---->.conv

dkg_lost.conv 格式:
//M 表示话语,E 表示分割。
E
M 话语 a
M 话语 b
M 话语 c
M 话语 d
E
M 话语 a
M 话语 b
M 话语 c
M 话语 d

No commercial use, all rights reserved.

About

dgk_lost_conv 中文对白语料 chinese conversation corpus

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • SRecode Template 99.5%
  • Other 0.5%