Lima Vallantin
Lima Vallantin
Data scientist, Master's student, and interested in everything concerning Data, Natural Language Processing, and modern web.

Contents

Don't forget to share:

Share on linkedin
Share on twitter
Share on facebook

Don't forget to share:

Share on linkedin
Share on twitter
Share on whatsapp
Share on facebook

Today’s challenge is about creating your own dataset. I have already discussed how important is to understand this topic: since a lot of introductory Machine Learning examples use toy datasets, we don’t always have the opportunity to understand the issues of creating our own dataset.

Since every project is different and you may choose to build a dataset of basically anything, there’s no notebook to share today 🙁

From my side, I am scrapping a social media website to be able to create marketing personas. I am looking for information such as city, job, education, personal tastes and so on.

During the next days, I will explore data for at least 1 hour per day and post the notebooks, data and models, when they are available, to this repository.

Don't forget to share:

Share on linkedin
Share on twitter
Share on whatsapp
Share on facebook

Leave a Reply