-
Notifications
You must be signed in to change notification settings - Fork 107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Input contains NaN, infinity or a value too large for ('float64') #105
Comments
I also tried running locally, although with a different version and environment Octis: 1.10.2 And I got this full traceback, and by inspection I got a value of -inf for f_val
|
It was related to a topic that was absent in the dataset --- due to some bug, I had a vocabulary with words that were not in the primary dataset. |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Description
I am trying to run the Google Colab example provided in the repo README. I only changed the dataset, to load a custom dataset using the
load_custom_dataset_from_folder()
in the .tsv format. I executed the algorithm with a small vocab (39 words) without problems, but with a "big" vocabulary (7894 words), I got an error from sklearn.utils.validation.py as follows:Also, note that my dataset is split into train (70%), val (10%) and test (20%).
What I Did
The text was updated successfully, but these errors were encountered: