Problem 6.2 - Optimization #215

mvalsania · 2024-12-11T09:06:03Z

Is there a reason to believe that using SGD on the full dataset would not always be a better idea compared to sampling a smaller dataset and using 2GD on it?

Isn't it true that "[in the realm of big data], approximate optimization can achieve better expected risk because more training examples can be processed within the allowed time"?

I feel like there is something I am missing.

mvalsania changed the title ~~Problem 6.2~~ Problem 6.2 - Optimization Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem 6.2 - Optimization #215

Problem 6.2 - Optimization #215

mvalsania commented Dec 11, 2024

Problem 6.2 - Optimization #215

Problem 6.2 - Optimization #215

Comments

mvalsania commented Dec 11, 2024