Pivot table- How does excel calculate the average of a column sliced by another column?

**trenzalore888** · 04-07-2017, 06:34 AM

Hi there, I am learning the kaggle tutorial on learning patterns from data regarding the sinking of the titanic:

One method is using pivot tables, but, I don't understand the math excel is using! Please could some help explain the calculation. Thank you!

There are 891 rows of data, each row for a passenger.

There are several columns which have data regarding this passengers. One column is "survived", which has binary data, 0 representing not survived, and 1 = survived.

342 of these passengers survived. In a pivot table, this would put the Average of survived as 0.38383838 etc .
I understand this calculation, because it is simply 342(number of survivors) divided by 891 (number of passengers) No problem!

However, the calculation I do not understand is when I introduce another column into the row labels in the pivot.

This new column is the gender of the passengers. Now, the pivot table breaks down the Average of Survived by female and male. But I don't understand the calculation.

Row Labels	Average of Survived	Sum of Survived	count of gender
female	0.742038217	233	314
male	0.188908146	109	577
Grand Total	0.383838384	342	891

I don't understand this, there is no missing data, but the average of survived for female (0.742038217) and male (0.188908146) do not add up me? There were 233 women, and 342 survivors, so the percentage of women who survived would be about 68%?

Essentially what I don't understand is the difference between the the percentage of women who survived, and the average of women who survived. How does Excel come up with this figure of 0.742038217 for women and 0.188908146 for men?

Sorry I cant upload the excel spreadsheet as I am unable to from my work computer. But the link to the CSV file if needed is https://www.kaggle.com/c/titanic/data (it is the csv file called train)

Thank you for your help!

**trenzalore888** · 04-07-2017, 07:01 AM

Sorry I messed up! I have now included the Count of gender, (314 females, 577 males)

I can now see excel has calculated
Female average survived: 233/314 = 0.742038217
male average survived: 109/577=0.188908146

But I still find it weird it doesn't add up to 100%?

**trenzalore888** · 04-07-2017, 07:15 AM

Ok I nearly understand it! so it is because I have not taken into account the not survived women and the not survived men.

not survived women = 81 (91/314 = 0.257961783 , which plus 0.742038217 = 1

not survived men = 468 (468/577)= 0.81109186, which plus 0.188908146 = 1

My only question now though is how can you create this column on the pivot table? I had to manually put this in separate cells. When you drag the "survived" column to values, the sum of survived automatically takes those who survived, is there a way to pull it into the values to show as not survived??

Thank you

**Richard Buttrey** · 04-07-2017, 07:20 AM

Are you able to upload the csv file here.

Kaggle is wanting me to sign up to its service

**trenzalore888** · 04-07-2017, 07:47 AM

Unfortunately I cannot upload it to this website, but I can upload it to a file host website? Is there one that you can use/recommend and I can upload it there? Thank you

when I try to upload it here I just get a blank box: upload error.png

**trenzalore888** · 04-07-2017, 07:56 AM

Here is a screenshot of my pivot table so far.

You can see below the pivot table I manually worked out the count of not survived women/men and the average of not survived women/men

But is there a way to do this within the pivot table? Ie get those yellow highlighted cells into the pivot table pivot table unfinished.png

thank you

**Richard Buttrey** · 04-07-2017, 08:23 AM

Have you played around with the 'Show Values As % of....' options where you can choose Parent Rows/Columns or Total Rows/Columns.

What about putting two helper columns in your data for 'Not Survived Men' & Not survived Women' and using these in the PT?

Are you able to put the file in DropBox and attach a link here?

**trenzalore888** · 04-07-2017, 08:45 AM

https://www.dropbox.com/l/s/AABftbDZ...kqTw8V1xKCZdM8

Does this link work?

I was hoping to avoid helper columns if possible, I tried using shows values as % of, but couldn't get it to work. I thought difference from would work but no luck. I think the key is to get the new column to be Count of gender minus sum of survived.

Thanks for your help!

Pivot table- How does excel calculate the average of a column sliced by another column?

LinkBack

Thread Tools

Rate This Thread

Display

Pivot table- How does excel calculate the average of a column sliced by another column?

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Re: Pivot table- How does excel calculate the average of a column sliced by another column

Thread Information

Users Browsing this Thread

Similar Threads

Pivot Table, average of sums in column

I want to add an average column to a pivot table

Pivot Table - calculate a new column

[SOLVED] Pivot Table -- Add column to average Grand Total

[SOLVED] Pivot Table -- Add column to average Grand Total

Bookmarks

Bookmarks

Posting Permissions