Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use SymSpell with hunspell dictionaries? #35

Open
danielnaber opened this issue Jul 9, 2018 · 3 comments
Open

use SymSpell with hunspell dictionaries? #35

danielnaber opened this issue Jul 9, 2018 · 3 comments

Comments

@danielnaber
Copy link

This is not really a bug report, sorry for using the issue tracker for this, but I'd like to see if someone has worked on this. Please point me to a better place if you know one:

Has anybody worked on using SymSpell with German hunspell dictionaries? German uses compounds, so you cannot just export a long list from the hunspell dictionaries and use them as input for SymSpell. The hunspell dictionary has special flags that indicate which words can be used in a compound, these would need to be considered somehow.

@wolfgarbe
Copy link
Owner

You could use the German dictionary from Aspell .
This is a plain word list. In order to add the word frequencies required by SymSpell you can combine/intersect it with Google Books Ngram data .
The required code to create such custom word frequency dictionaries you can find here: #15

That being said, making SymSpell compatible with hunspell dictionaries would be certainly useful. I will add this to the todo list.

@akaSourav
Copy link

Please add suffix and prefix support like Hunspell. It will be very useful. Hunspell dictionaries are pretty good and up to date.

@zephyrous
Copy link

Wolf, should we expect any update with regards to Hunspell dictionaries anytime soon? Thank you for your excellent work so far.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants