+ Reply to Thread
Results 1 to 10 of 10

Simple statistical analysis

  1. #1
    Steve Wylie
    Guest

    Simple statistical analysis

    I have a workbook going across several dozen sheet tabs, containing
    demographic information on about 10,000 people (I work for a local authority
    - I'm not some direct marketer or spammer or anything!)

    One of the items of data is their date of birth. I need to use this in
    Excel somehow to extract percentages & counts of how many people fit into
    certain age groups (18-30, 31-40, 41-50 etc).

    Is this possible without moving the data out of Excel, using formulas? We
    have a survey/data analysis program but I am loath to transfer all that data
    unnecessarily when it could just be done in Excel...

    Thank you for any help anyone can give.

    Steve Wylie
    Canterbury
    United Kingdom


  2. #2
    Jerry W. Lewis
    Guest

    Re: Simple statistical analysis

    =SUMPRODUCT((ROUNDDOWN((TODAY()-dates)/365.25,0)<=40)*(ROUNDDOWN((TODAY()-dates)/365.25,0)>30))
    will count the number on a single sheet that are in the 31-40 age group.
    Unfortunately, it will not work with 3-D ranges.

    Jerry

    Steve Wylie wrote:

    > I have a workbook going across several dozen sheet tabs, containing
    > demographic information on about 10,000 people (I work for a local authority
    > - I'm not some direct marketer or spammer or anything!)
    >
    > One of the items of data is their date of birth. I need to use this in
    > Excel somehow to extract percentages & counts of how many people fit into
    > certain age groups (18-30, 31-40, 41-50 etc).
    >
    > Is this possible without moving the data out of Excel, using formulas? We
    > have a survey/data analysis program but I am loath to transfer all that data
    > unnecessarily when it could just be done in Excel...
    >
    > Thank you for any help anyone can give.
    >
    > Steve Wylie
    > Canterbury
    > United Kingdom
    >
    >



  3. #3
    Tom Ogilvy
    Guest

    Re: Simple statistical analysis

    =countif(sheet1!A:A,">=01/01/1985")-countif(sheet1!A:A,">31/12/1995")
    would give you the count of people born between 1985 and 1995 inclusive and
    listed on sheet1 assuming birthdate is in column A (as an example)

    from this, you should be able to figure out how to address other sheets.
    Perhaps a separate sheet where you gather data from each individual sheet
    and consolidate or combine formulas.

    --
    Regards,
    Tom Ogilvy



    "Steve Wylie" <[email protected]> wrote in message
    news:[email protected]...
    > I have a workbook going across several dozen sheet tabs, containing
    > demographic information on about 10,000 people (I work for a local

    authority
    > - I'm not some direct marketer or spammer or anything!)
    >
    > One of the items of data is their date of birth. I need to use this in
    > Excel somehow to extract percentages & counts of how many people fit into
    > certain age groups (18-30, 31-40, 41-50 etc).
    >
    > Is this possible without moving the data out of Excel, using formulas? We
    > have a survey/data analysis program but I am loath to transfer all that

    data
    > unnecessarily when it could just be done in Excel...
    >
    > Thank you for any help anyone can give.
    >
    > Steve Wylie
    > Canterbury
    > United Kingdom
    >




  4. #4

    Re: Simple statistical analysis

    Unfortunately, I cannot get this formula to work on the sheet I am
    using - it just says #VALUE! in the cell.

    On reflection, I think the sheet I am using is too messed-about with to
    use a formula. I'll paste the dates into my analysis program.

    Thanks anyway

    Steve


  5. #5
    Jerry W. Lewis
    Guest

    Re: Simple statistical analysis

    If you are not trying to use 3D references, then the only way to get
    #VALUE! is if there is a #VALUE error in your dates range, or if at
    least one of the cells in the dates range contains text that cannot be
    coerced into a date.

    Jerry

    [email protected] wrote:

    > Unfortunately, I cannot get this formula to work on the sheet I am
    > using - it just says #VALUE! in the cell.
    >
    > On reflection, I think the sheet I am using is too messed-about with to
    > use a formula. I'll paste the dates into my analysis program.
    >
    > Thanks anyway
    >
    > Steve



  6. #6
    Steve Wylie
    Guest

    Re: Simple statistical analysis

    Yeah, that's the trouble. The dates have not been inputted consistently.
    There are many false entries where people have put "16 Dec" and no year (it
    should all be dd.mm.yy) or just "age 42" or rubbish like that. The analysis
    program I use just ignores all that, whereas Excel throws up an error.

    And I suspect your formula doesn't like the years in two-digit format
    either...

    Thanks, but I did it using the analysis program in the end. Shame tho.

    Steve


  7. #7
    Steve Wylie
    Guest

    Re: Simple statistical analysis

    I just did a quick "example" run of your formula on some dummy data in
    uniform format, and needless to say it worked. I shall make a note of the
    formula for future use if I ever get any decent data sent to me that allows
    me to use it!

    Thanks again
    Steve


  8. #8
    Jerry W. Lewis
    Guest

    Re: Simple statistical analysis

    Regardless of how they are formatted, Excel dates are stored as the
    number of days since the beginning of 1900. Provided that the entry is
    an Excel date or can be coerced into an Excel date, the formula should work.

    Data QC is often the biggest portion of data analysis.

    Jerry

    Steve Wylie wrote:

    > Yeah, that's the trouble. The dates have not been inputted consistently.
    > There are many false entries where people have put "16 Dec" and no year (it
    > should all be dd.mm.yy) or just "age 42" or rubbish like that. The analysis
    > program I use just ignores all that, whereas Excel throws up an error.
    >
    > And I suspect your formula doesn't like the years in two-digit format
    > either...
    >
    > Thanks, but I did it using the analysis program in the end. Shame tho.
    >
    > Steve



  9. #9
    Myrna Larson
    Guest

    Re: Simple statistical analysis

    Hi, Jerry:

    I was surprised by the OP's statement that "the analysis program ... just
    ignores all that". If it just throws out the data, the results will be
    worthless. If it in fact interprets those entries by calculating a date, the
    user should be aware of that.

    But the bottom line is that there should be data validation in place that
    disallows entries that aren't dd.mm.yy; and after all of the fuss about Y2K, 2
    digit years should have been disallowed too.



    On Mon, 09 May 2005 14:46:11 -0400, "Jerry W. Lewis"
    <post_a_reply@no_e-mail.com> wrote:

    >Regardless of how they are formatted, Excel dates are stored as the
    >number of days since the beginning of 1900. Provided that the entry is
    >an Excel date or can be coerced into an Excel date, the formula should work.
    >
    >Data QC is often the biggest portion of data analysis.
    >
    >Jerry
    >
    >Steve Wylie wrote:
    >
    >> Yeah, that's the trouble. The dates have not been inputted consistently.
    >> There are many false entries where people have put "16 Dec" and no year (it
    >> should all be dd.mm.yy) or just "age 42" or rubbish like that. The

    analysis
    >> program I use just ignores all that, whereas Excel throws up an error.
    >>
    >> And I suspect your formula doesn't like the years in two-digit format
    >> either...
    >>
    >> Thanks, but I did it using the analysis program in the end. Shame tho.
    >>
    >> Steve



  10. #10
    Jerry W. Lewis
    Guest

    Re: Simple statistical analysis

    Ignoring inappropriate values is not unreasonable, provided that it
    calls your attention to what it has done. The accuracy of (pre-2003)
    LINEST is comparable to PROC GLM in SAS. Excel gets slammed and SAS
    doesn't because SAS warns the user when results are not numerically
    trustworthy.

    Jerry

    Myrna Larson wrote:

    > Hi, Jerry:
    >
    > I was surprised by the OP's statement that "the analysis program ... just
    > ignores all that". If it just throws out the data, the results will be
    > worthless. If it in fact interprets those entries by calculating a date, the
    > user should be aware of that.
    >
    > But the bottom line is that there should be data validation in place that
    > disallows entries that aren't dd.mm.yy; and after all of the fuss about Y2K, 2
    > digit years should have been disallowed too.
    >
    >
    >
    > On Mon, 09 May 2005 14:46:11 -0400, "Jerry W. Lewis"
    > <post_a_reply@no_e-mail.com> wrote:
    >
    >
    >>Regardless of how they are formatted, Excel dates are stored as the
    >>number of days since the beginning of 1900. Provided that the entry is
    >>an Excel date or can be coerced into an Excel date, the formula should work.
    >>
    >>Data QC is often the biggest portion of data analysis.
    >>
    >>Jerry
    >>
    >>Steve Wylie wrote:
    >>
    >>
    >>>Yeah, that's the trouble. The dates have not been inputted consistently.
    >>>There are many false entries where people have put "16 Dec" and no year (it
    >>>should all be dd.mm.yy) or just "age 42" or rubbish like that. The
    >>>

    > analysis
    >
    >>>program I use just ignores all that, whereas Excel throws up an error.
    >>>
    >>>And I suspect your formula doesn't like the years in two-digit format
    >>>either...
    >>>
    >>>Thanks, but I did it using the analysis program in the end. Shame tho.
    >>>
    >>>Steve



+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1