Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

原本那个不加多音字表 没办法返回多个拼音么 #1

Closed
joker-fu opened this issue Jan 1, 2019 · 6 comments
Closed

原本那个不加多音字表 没办法返回多个拼音么 #1

joker-fu opened this issue Jan 1, 2019 · 6 comments
Labels
good first issue Good for newcomers question Further information is requested

Comments

@joker-fu
Copy link

joker-fu commented Jan 1, 2019

No description provided.

@pansong291
Copy link
Owner

pansong291 commented Jan 2, 2019

嗯,原本那个是根据词组返回拼音的,比如 重量 返回的是 ZHONGLIANG重庆 返回的是 CHONGQING。而一个 字只返回 CHONG

@joker-fu
Copy link
Author

joker-fu commented Jan 2, 2019

实在没闹明白算法 我的意思 一个字重 能不能返回它所有的拼音 不加多音字表的情况下

@joker-fu
Copy link
Author

joker-fu commented Jan 2, 2019

还有 哦和哈等多音都不对样 哦 正确的应该是E/O 多音查出来是 E/O/WO 哈 正确的应该是HA 多音查出来是HA/KA 类似的好像还有很多

@pansong291
Copy link
Owner

实在没闹明白算法 我的意思 一个字重 能不能返回它所有的拼音 不加多音字表的情况下

算法你可以参考打造最好的Java拼音库TinyPinyin(一):单字符转拼音的极致优化,因为原本TinyPinyin就没有做单字的多音,所以不加多音表没办法返回多个拼音,只能返回其中一个。换句话说,每个字都成了单音字,没有多音字的存在。因为我是在TinyPinyin的基础上改的,所以我加了一个多音表,如果全部重新写的话,就稍微麻烦一点。

@pansong291
Copy link
Owner

还有 哦和哈等多音都不对样 哦 正确的应该是E/O 多音查出来是 E/O/WO 哈 正确的应该是HA 多音查出来是HA/KA 类似的好像还有很多

关于这个问题,汉字转拼音的数据库最终来源是pinyin4j项目, 字的Unicode码是 54E6,在pinyin4j源代码里有一个unicode_to_hanyu_pinyin.txt文件,里面有一行内容是 54E6 (e2,o2,o4,wo2,wo4),数字表示声调,不看声调的话,pinyin4j里面的 字的确是有 eowo 这三个拼音,如果确实拼音有错误,那应该是pinyin4j的错误。

@joker-fu
Copy link
Author

joker-fu commented Jan 4, 2019

好的 谢谢

@joker-fu joker-fu closed this as completed Jan 4, 2019
@pansong291 pansong291 added good first issue Good for newcomers question Further information is requested labels Jan 5, 2019
@pansong291 pansong291 pinned this issue Sep 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants