max_features #203

mvalsania · 2024-12-08T00:30:20Z

How should we talk about the impact of max_features on the dvc?

I am trying to reference the formula we have ( O(klog(kd)) ) but I am still conflicted with the fact that we are not technically affecting the number of dimensions with this hyperparameter, we are affecting the number of dimensions that we take into consideration before deciding which dimension would lead to the best split. What is the right way to think about this? Should we think of this hyperparameter as indirectly and artificially affecting d?

Thanks!

mikeizbicki · 2024-12-09T07:28:51Z

The $d$ in the VC dimension formula for the decision tree is the dimension of the data space and unaffected by the max_features hyperparameter. The later does not affect the VC dimension, and is primarily used for reducing the runtime of training.

mvalsania · 2024-12-09T08:30:09Z

Thanks!

mikeizbicki added the Question label Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

max_features #203

max_features #203

mvalsania commented Dec 8, 2024

mikeizbicki commented Dec 9, 2024

mvalsania commented Dec 9, 2024

max_features #203

max_features #203

Comments

mvalsania commented Dec 8, 2024

mikeizbicki commented Dec 9, 2024

mvalsania commented Dec 9, 2024