Today’s challenge is about creating your own dataset. I have already discussed how important is to understand this topic: since a lot of introductory Machine Learning examples use toy datasets, we don’t always have the opportunity to understand the issues of creating our own dataset.
Since every project is different and you may choose to build a dataset of basically anything, there’s no notebook to share today 🙁
From my side, I am scrapping a social media website to be able to create marketing personas. I am looking for information such as city, job, education, personal tastes and so on.
During the next days, I will explore data for at least 1 hour per day and post the notebooks, data and models, when they are available, to this repository.