Creating standard deviations with complicated data set

**mounstahman** · 09-06-2015, 09:52 PM

Hello,

I am trying to create standard deviations for a dataset of chat, which consists of lines of chat coded with 1 or 0 along a multitude of categories. For example, a chat line that talks about money will be coded 1 in the money column and 0 in the others. Each line has a player name associated with it, and the chat is organized into different 'phases'.

Now here is where it gets tricky. I want to be able to calculate the standard deviation of each player's mention of money, for example, within each phase. SO essentially comparing the average times money was mentioned versus each player's mention of money, but only in that phase. I am really struggling to be able to organize the data in a way that allows me to compare the data this way.

Any tips are greatly appreciated, and clarifying questions are encouraged!

Thank you so much for your time and advice!

**MarvinP** · 09-06-2015, 11:53 PM

Hi mounstahman and welcome to the forum.

It seems to me you need to decide what the dimensions of your study are. Say you have 5000 pharses and 10 categories you are tracking. Do you want to say category "Money" was used in 100 of the 5000 phrases? Do you want to say the Bill mentioned it most and he was 2 standard deviations above the others in mentioning money? If you are using the standard deviation you need an average and spread of somethings. Say you have 5 people who are in this chat study and person 1 mentions money 4 times in 100 posts and person 2 mentions money 15 times in 15 posts. How do you compare these two different people? Do you want to compare this chat by people, posts or what???

**mounstahman** · 09-07-2015, 02:52 PM

Hey MarviP,

Thanks for answering. It is the variance of players' use of each speech act within each phase. So what I am essentially trying to do is compare the total number of mentions of money, for example, in each of the phases of the game to each player's individual mentioning of money within that same phase.

There will be as many measures of variance as there are averages and as many as there are phases, and each average will have a measure of variance that corresponds to it.

What I am really struggling with is how to gather such information efficiently, when I am looking at a long list of data listing seconds since start, player name, chat data, and the categories coded as 1 or 0.

Thanks so much for the help!

Creating standard deviations with complicated data set

LinkBack

Thread Tools

Rate This Thread

Display

Creating standard deviations with complicated data set

Re: Creating standard deviations with complicated data set

Re: Creating standard deviations with complicated data set

Thread Information

Users Browsing this Thread

Similar Threads

Conditional Formatting for Standard Deviations

Conditional Formatting with standard deviations

4 Standard Deviations

[SOLVED] Calculate 2 Standard Deviations

Mean of standard deviations across columns?

Graph Standard Deviations

Standard deviations in Excel

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions