-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to use with gridsearch? #1
Comments
It's been a while since I made this. Back then I don't think having a
For your error with the Feel free to comment if you have further questions! |
@StevenReitsma I think perplexity, topic_coherence can be qualitative metrics to determine the goodness of fit. Since we need to determine the number of topics for LDA, I think score() function can help to choose the best number of topics. |
Thanks for those links. Looks like you can definitely use those metrics to get an approximation of the goodness of fit and that should be fine if your ultimate goal is to have a good topic coherence or a good perplexity. However, in a real world use-case your goal is usually not to have a good topic coherence or a good perplexity but to have a good classification or regression performance. Hence my comment to add a classifier in your pipeline to be sure of the performance of your actual problem. But again, if you're instead working on a research problem where the goal is to have good topic coherence, perplexity, or another metric, then using those to do a GridSearch should be a perfect solution! Adding that as a |
@StevenReitsma |
I've got this error while I was fitting by GridSearchCV.
So, I read the manual(http://scikit-learn.org/stable/developers/contributing.html#rolling-your-own-estimator).
Some functions should be implemented to use GridSearchCV.
How did you do it?
The text was updated successfully, but these errors were encountered: