Hey guys,
I need help with a data mining assignment. I have attached dummy data that will help you understand how my data is structured.
What I need to achieve:
- Being able to filter the data by multiple queries (people who have used the app "MO" + they are from "USA")
- Being able to filter the data by interests as well (people who have an interest in "Movies" & they are from "GB")
- After the filtering is done, I need to know the numbers of records that meet my criteria as well as their interests in the case of 1. So I want to know the top 5 interests of the users who used the app "MO" & that are from "USA".
After that, creating charts to show these numbers would be of added value.
My objective from this assignment is to answer the question HOW MANY & not WHO. So if I query "MO" users + from "GB" then I want to know how many people that fit these criteria exist as well as what are their top 5 interests.
Which direction should I take to achieve this objective? Should I use pivot tables? (I'm having problems with the interests data being divided in 3) Should I not use excel at all for this?
Awaiting your answers.
Cheers.
data-mining.xlsx
Bookmarks