Hello,
I am trying to create standard deviations for a dataset of chat, which consists of lines of chat coded with 1 or 0 along a multitude of categories. For example, a chat line that talks about money will be coded 1 in the money column and 0 in the others. Each line has a player name associated with it, and the chat is organized into different 'phases'.
Now here is where it gets tricky. I want to be able to calculate the standard deviation of each player's mention of money, for example, within each phase. SO essentially comparing the average times money was mentioned versus each player's mention of money, but only in that phase. I am really struggling to be able to organize the data in a way that allows me to compare the data this way.
Any tips are greatly appreciated, and clarifying questions are encouraged!
Thank you so much for your time and advice!
Bookmarks