forked from NVIDIA/NeMo
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* refactor serial class Signed-off-by: ekmb <[email protected]> * add and for cardinal part Signed-off-by: ekmb <[email protected]> * move and to the cardinal tagger Signed-off-by: ekmb <[email protected]> * fix tests Signed-off-by: ekmb <[email protected]> * grammar folder on jenkins updated Signed-off-by: ekmb <[email protected]> * resolve merge conflicts, update test cases Signed-off-by: ekmb <[email protected]> * jenkins Signed-off-by: ekmb <[email protected]> Co-authored-by: Yang Zhang <[email protected]>
- Loading branch information
Showing
36 changed files
with
460 additions
and
407 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
24 changes: 12 additions & 12 deletions
24
nemo_text_processing/text_normalization/en/data/time_suffix.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,12 +1,12 @@ | ||
p.m. p m | ||
p.m p m | ||
pm p m | ||
P.M. P M | ||
P.M P M | ||
PM P M | ||
a.m. a m | ||
a.m a m | ||
am a m | ||
A.M. A M | ||
A.M A M | ||
AM A M | ||
p.m. PM | ||
p.m PM | ||
pm PM | ||
P.M. PM | ||
P.M PM | ||
PM PM | ||
a.m. AM | ||
a.m AM | ||
am AM | ||
A.M. AM | ||
A.M AM | ||
AM AM |
28 changes: 14 additions & 14 deletions
28
nemo_text_processing/text_normalization/en/data/time_zone.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,14 +1,14 @@ | ||
cst c s t | ||
c.s.t c s t | ||
cet c e t | ||
c.e.t c e t | ||
pst p s t | ||
p.s.t p s t | ||
est e s t | ||
e.s.t e s t | ||
pt p t | ||
p.t p t | ||
et e t | ||
e.t e t | ||
gmt g m t | ||
g.m.t g m t | ||
cst CST | ||
c.s.t CST | ||
cet CET | ||
c.e.t CET | ||
pst PST | ||
p.s.t PST | ||
est EST | ||
e.s.t EST | ||
pt PT | ||
p.t PT | ||
et ET | ||
e.t ET | ||
gmt GMT | ||
g.m.t GMT |
21 changes: 21 additions & 0 deletions
21
nemo_text_processing/text_normalization/en/data/whitelist_symbols.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
& and | ||
# hash | ||
@ at | ||
§ section | ||
™ trademark | ||
® registered trademark | ||
© copyright | ||
_ underscore | ||
% percent sign | ||
* asterisk | ||
+ plus | ||
/ slash | ||
= equal sign | ||
^ circumflex | ||
| vertical bar | ||
~ tilde | ||
$ dollar | ||
£ pound | ||
€ euro | ||
₩ won | ||
¥ yen |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
24 changes: 16 additions & 8 deletions
24
nemo_text_processing/text_normalization/en/data/year_suffix.tsv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,8 +1,16 @@ | ||
A. D a d | ||
A.D a d | ||
B. C b c | ||
B.C b c | ||
A. D. a d | ||
A.D. a d | ||
B. C. b c | ||
B.C. b c | ||
A. D AD | ||
A.D AD | ||
a. d AD | ||
a.d AD | ||
a. d. AD | ||
a.d. AD | ||
B. C BC | ||
B.C BC | ||
b. c BC | ||
b.c BC | ||
A. D. AD | ||
A.D. AD | ||
B. C. BC | ||
B.C. BC | ||
b. c. BC | ||
b.c. BC |
Oops, something went wrong.