+ Reply to Thread
Results 1 to 3 of 3

Parsing a word document into a csv or other machine readable format

  1. #1
    Registered User
    Join Date
    03-19-2021
    Location
    New York, USA
    MS-Off Ver
    Microsoft Office 2016
    Posts
    2

    Parsing a word document into a csv or other machine readable format

    Hi all,

    I have a non-delimited text file of health data called "2014_data.txt". This text file doesn't have any structure to it and the only way to structure it into columns and rows is to use the specified fixed length positions for each variable as indicated by the data dictionary "UserGuide2014_sample.docx".

    So basically I would like to transfer each variable's character fixed positions (e.g. variable MAGER is characters 75-76) into a csv or machine-readable format. All of the other information about the variable I don't really care.

    I want "UserGuide2014_sample.docx" -> properly formatted csv (it's ok if it's not PERFECT, I can always do manually edits, but as long as it takes some of the work out of it, because I have many files like this)

    Many thanks,
    David
    Attached Files Attached Files
    Last edited by dal2111; 03-19-2021 at 03:53 PM.

  2. #2
    Forum Expert Logit's Avatar
    Join Date
    12-23-2012
    Location
    North Carolina
    MS-Off Ver
    Excel 2019 Professional Plus - 2007 Enterprise
    Posts
    7,014

    Re: Parsing a word document into a csv or other machine readable format

    Is the following how you want the data in final presentation ?
    Attached Files Attached Files

  3. #3
    Registered User
    Join Date
    03-19-2021
    Location
    New York, USA
    MS-Off Ver
    Microsoft Office 2016
    Posts
    2

    Re: Parsing a word document into a csv or other machine readable format

    Hi Logit,

    Thanks for your help. Not exactly - it has to be according to the position as indicated by the "User_Guide_2014_sample.docx‎". For example, the first 9 characters to 16 characters should be the variable "Birth Year" or 2014. Characters 13 to 14 should be the Birth Month. Characters 19 to 22 should be Time of Birth, etc...

    Much appreciated

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Unable to do data parsing into readable and useful format in Office 2010
    By theo28 in forum Excel Formulas & Functions
    Replies: 6
    Last Post: 01-16-2014, 04:29 AM
  2. [SOLVED] How to import formatted Excel data into a readable Word document?
    By 12 Major Chords in forum Excel General
    Replies: 4
    Last Post: 01-06-2014, 12:59 PM
  3. Replies: 3
    Last Post: 02-27-2013, 01:54 PM
  4. Replies: 4
    Last Post: 08-31-2012, 11:52 AM
  5. Replies: 2
    Last Post: 01-14-2008, 09:41 PM
  6. Replies: 0
    Last Post: 06-23-2006, 01:15 PM
  7. send document from Excel to Word in readable form
    By BillyG in forum Excel General
    Replies: 4
    Last Post: 05-01-2006, 07:50 PM
  8. Retain text format from a WORD document
    By Willisbar in forum Excel - New Users/Basics
    Replies: 2
    Last Post: 02-09-2005, 11:06 PM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1