Hi Dear Course Support Team,
I reviewed the Recommended Datasets for Machine Learning Course Project and while there is diverse set of datasets therein, I wonder which dataset is more manageable with regards to the project time/resources constraint given the project acceptance criteria which might not be in line with some of the datasets.
For example, majority of datasets require model training for several hours without any interruption in Jupyter Notebook connectivity to Server (GPU-powered machine needed) or require extensive data preparation, merging & math-heavy feature engineering beyond the scope of the course.
Majority requires transformation beyond imputation, scaling & encoding which may in turn cause some hiccups in the middle of the project up to the point of switching to other datasets in order to meet the project criteria.
I would be grateful of any recommendation in terms of manageable dataset selection.