+ Reply to Thread
Results 1 to 10 of 10

How to assume a distribution to be normal distribution or log-normal distribution?

  1. #1
    Registered User
    Join Date
    01-17-2015
    Location
    Japan
    MS-Off Ver
    2007
    Posts
    7

    How to assume a distribution to be normal distribution or log-normal distribution?

    I have a 2 column and 100 rows table with internal body concentration and accumulation distribution(any two data column). How do I "assume" it to be in normal distribution or log normal distribution?
    I need to calculate the average and std after i assume it to be in those distribution.
    Thank you!

    Exact question: Assume that the distribution of internal body concentration of chemical A among Japanese can be approximated to normal distribution. Estimate its average and std.
    Last edited by k4k4shi; 07-23-2015 at 11:07 PM.

  2. #2
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    [.... deleted ....]
    Last edited by joeu2004; 07-23-2015 at 11:18 PM.

  3. #3
    Registered User
    Join Date
    01-17-2015
    Location
    Japan
    MS-Off Ver
    2007
    Posts
    7

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Thank you very much for the quick reply. So I can just use the general AVERAGE and STDEV formula? Sorry for my ignorance. So I will get three values in Normal distrubtion right? Average, stdev and the one using NORMDIST?

  4. #4
    Forum Expert shg's Avatar
    Join Date
    06-20-2007
    Location
    The Great State of Texas
    MS-Off Ver
    2003, 2010
    Posts
    40,678

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    I'm no statistician, but ...

    Things that are constrained to be >= 0 (like the concentration of a chemical, which cannot be < 0), cannot be normally distributed -- but most natural distributions (Poisson and five others I can't think of) converge toward normal, and become negligably different.

    If the directed assumption is that is it normal, then the measures you have suggested are its parameters.

    If your preferred assumption is that it is log-normal, then the only way I know to measure it is via regression.
    Last edited by shg; 07-23-2015 at 11:54 PM.
    Entia non sunt multiplicanda sine necessitate

  5. #5
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Quote Originally Posted by shg View Post
    Things that are constrained to be >= 0 (like the concentration of a chemical, which cannot be < 0), cannot be normally distributed
    Theoretically correct, since the normal distribution goes from -infinity to +infinity.

    (So even data with negative values cannot be truly normally distributed, since +/-infinity are unattainable. )

    But in actual practice, I would disagree.

    For example, consider the distribution of salaries from 100,000 to 1,000,000 with a mean of 550,000 and a std dev of 56,250.

    If the salaries are (or should be) normally distributed, the probability of non-positive salaries (including zero) is 7.01E-23 = NORMDIST(0,550000,56250,1). Even the probability of salaries less than 100,000 is about 6.22E-16 = NORMDIST(99999.99,550000,56260,1).

    Both probabilities are so small that we do not need to be concerned. We can still say the salaries are (or can be) normally distributed within the prescribed range.

  6. #6
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Quote Originally Posted by k4k4shi View Post
    Thank you very much for the quick reply. So I can just use the general AVERAGE and STDEV formula?
    I think you are responding to my first response, which I subsequently "deleted". I deleted the posting because the details of your description are unclear, and my original response might have compounded the confusion.

    But the substance of my original response is correct.

    Yes, AVERAGE and STDEV are just arithmetic calculations. They do not depend on the type of distribution.

    However, the usefulness and interpretation of those statistics, especially the std dev, does depend on assumptions about the type of distribution.

    For example, we might want to say that 99.99% of a larger set of data should between mean - 4*stdDev and mean + 4*stdDev.

    That is based on the assumption that the larger set of data is normally distributed, and that the data that you have was randomly selected ("sampled") from the larger set of data.

    Note: I suggested that you use STDEV instead of STDEVP because your assignment asks you to "estimate" the std dev. I interpret that to mean: estimate the std dev of a theoretical larger set of data based on a presumedly random sample.


    Quote Originally Posted by k4k4shi View Post
    So I will get three values in Normal distrubtion right? Average, stdev and the one using NORMDIST?
    As I recall, in my original response, NORMDIST was merely one example of how we might rely on the assumption of normal distribution when we apply the AVERAGE and STDEV.

    We are not limited to just those 3 statistics.

  7. #7
    Forum Expert shg's Avatar
    Join Date
    06-20-2007
    Location
    The Great State of Texas
    MS-Off Ver
    2003, 2010
    Posts
    40,678

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Quote Originally Posted by shg
    ... but most natural distributions converge toward normal, and become negligably different.

    Quote Originally Posted by joe
    Theoretically correct, since ... So even data with negative values cannot be truly normally distributed, since ... But in actual practice, ... For example, consider ... If the salaries are (or should be) ... Even the probability of salaries less than 100,000 is about ... Both probabilities are so small We can still say ... are (or can be) normally distributed within the prescribed range.
    Thank you for the world's longest definition of "negligable"

  8. #8
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Quote Originally Posted by shg View Post
    Thank you for the world's longest definition of "negligable"
    And thank you for presenting a distinction without a difference, in the first place.

    I'm sorry that I presumed you thought you were making an important point, which I thought required clarification lest it be misunderstood by k4k4shi, who obviously does not have enough grounding in statistical application to realize just how negligible your original comment was.

  9. #9
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    Quote Originally Posted by k4k4shi View Post
    How do I "assume" it to be in normal distribution or log normal distribution?
    Quote Originally Posted by shg View Post
    If your preferred assumption is that it is log-normal, then the only way I know to measure it is via regression.
    When we say data is log-normally distributed ("log normal"), we simply mean that the log of the data is normally distributed. That does not require regression to "measure it", whatever that means.

    However, it does require that the log of the data is computable. That is, the data must be greater than zero.

    If the data is in B1:B100, we calculate the mean and std dev of the log data simply by array-entering the following formulas (press ctrl+shift+Enter instead of just Enter):

    =AVERAGE(LOG(B1:B100))
    =STDEV(LOG(B1:B100))

    Again, we can do those calcuations without assuming the log data is normally distributed. And the fact that we can do those calculations does not imply the log data is normally distributed.

    However, if we assume the data is log-normally distributed, we might expect, for example, that 99.99% of a larger set of log data -- the log of the "population" of data from which B1:B100 is a random sample -- lies between mean - 4*stdDev and mean + 4*stdDev.

  10. #10
    Forum Expert
    Join Date
    05-01-2014
    Location
    California, US
    MS-Off Ver
    Excel 2010
    Posts
    1,795

    Re: How to assume a distribution to be normal distribution or log-normal distribution?

    @k4k4kshi: Also see my responses #6 and #9.

    Quote Originally Posted by k4k4shi View Post
    Exact question: Assume that the distribution of internal body concentration of chemical A among Japanese can be approximated to normal distribution. Estimate its average and std.
    I apologize for my incessant posting, but it continues to bother me that the assignment says to assume a normal distribution when no such assumption is necessary to calculate an average and std dev.

    I wonder if by "estimate its average", the assignment means: calculate a confidence interval around the mean based on some degree of confidence (typically 95%). In other words, calculate mean +/- delta, where delta is z*stdErr or t*stdErr.

    The stdErr of the mean is: STDEV(B1:B100)/SQRT(COUNT(B1:B100)).

    Alternatively: STDEVP(B1:B100)/SQRT(COUNT(B1:B100)-1).

    For 95% confidence, z is -NORMSINV((1-95%)/2), which is about 1.96. t is TINV(1-95%,COUNT(B1:B100)-1), which is about 1.98. The larger the sample size, the closer Student's t is to z.

    Although we can calculate the CI around the mean of random samples from a non-normal distribution, it tends to require large samples. So it is indeed "better" if we can assume the data is a random sample from a normal distribution.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Normal Distribution
    By dalai in forum Excel General
    Replies: 1
    Last Post: 08-23-2014, 05:58 PM
  2. Normal distribution
    By tnknsnj in forum Excel General
    Replies: 1
    Last Post: 01-17-2012, 08:47 PM
  3. Replies: 5
    Last Post: 01-31-2011, 12:21 AM
  4. different colors in a normal distribution
    By Pasha81 in forum Excel Charting & Pivots
    Replies: 11
    Last Post: 10-29-2009, 11:55 AM
  5. Normal distribution
    By smaug82 in forum Excel Programming / VBA / Macros
    Replies: 3
    Last Post: 03-07-2008, 02:56 AM
  6. [SOLVED] NORMAL DISTRIBUTION
    By FLKULCHAR in forum Excel Formulas & Functions
    Replies: 13
    Last Post: 10-27-2005, 06:05 PM
  7. Normal Distribution?
    By Anthony Slater in forum Excel General
    Replies: 3
    Last Post: 03-08-2005, 04:06 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1