Lima Vallantin
Wilame
Marketing Data scientist and Master's student interested in everything concerning Data, Text Mining, and Natural Language Processing. Currently speaking Brazilian Portuguese, French, English, and a tiiiiiiiiny bit of German. Want to connect? Tu peux m'envoyer un message. Pour plus d'informations sur moi, tu peux visiter cette page.

Sommaire

N'oublies pas de partager :

Partager sur linkedin
Partager sur twitter
Partager sur facebook

N'oublies pas de partager :

Partager sur linkedin
Partager sur twitter
Partager sur whatsapp
Partager sur facebook

Today’s challenge is about creating your own dataset. I have already discussed how important is to understand this topic: since a lot of introductory Machine Learning examples use toy datasets, we don’t always have the opportunity to understand the issues of creating our own dataset.

Since every project is different and you may choose to build a dataset of basically anything, there’s no notebook to share today 🙁

From my side, I am scrapping a social media website to be able to create marketing personas. I am looking for information such as city, job, education, personal tastes and so on.

During the next days, I will explore data for at least 1 hour per day and post the notebooks, data and models, when they are available, to this repository.

N'oublies pas de partager :

Partager sur linkedin
Partager sur twitter
Partager sur whatsapp
Partager sur facebook

Laisser un commentaire