+ Reply to Thread
Results 1 to 5 of 5

Cleaning up data from scraping website URL

  1. #1
    Registered User
    Join Date
    03-27-2016
    Location
    UK
    MS-Off Ver
    2013
    Posts
    18

    Cleaning up data from scraping website URL

    Hi I'm making a workbook scraping multiple URLs using macros in Excel 2013.

    The problem is the way the website is constructed and it only allows me to select the entire webpages.
    This then gets dumped into a worksheet with all the links and loads of other stuff I do not require.

    I'm wanting it just to save 2 certain parts. These are on the lines below 2 key headers. Last Activity and Join Date.

    Is there anyway to select just these 2 parts. I know in Data > Select from Web will not allow me to select just those fields Shame

    But is there anyway to clean all the stuff out I do not want OR is there another way to go about it to get just this data?

    Thanks in advance.

  2. #2
    Registered User
    Join Date
    01-30-2016
    Location
    Legazpi City, Philippines
    MS-Off Ver
    2013
    Posts
    1

    Re: Cleaning up data from scraping website URL

    Have you tried Microsoft Power Query. Can be downloaded free from Microsoft

  3. #3
    Registered User
    Join Date
    03-27-2016
    Location
    UK
    MS-Off Ver
    2013
    Posts
    18

    Re: Cleaning up data from scraping website URL

    Thank you I will check it out now and get back if i have any problems.

    Cheers

  4. #4
    Registered User
    Join Date
    03-27-2016
    Location
    UK
    MS-Off Ver
    2013
    Posts
    18

    Re: Cleaning up data from scraping website URL

    Not liking it tbh but it probably is the best thing to use.

    I just need a little more advice and guidance.

    I have a list of ending URLS in a spreadsheet and a name in the second column.

    Page | Name | Join Date | Activity
    user1.html | User1 | |
    user2.html | User2 | |
    user3.html | User3 | |

    The URL is the same (http://blah.com).

    I'm trying to extract Join Date and Activity from the URL + A1-A5 (blank line) when a refresh button or worksheet is opened. (prefered button)

    The date is within the html tag <dd> (0) and (1).

    Is this Power Query able to do this?

    I've managed to find where the JD and Activity is when viewing in Power Query after expanding 20 or so columns.

    If you could help further I'd really appreciate it.

    Thank you

  5. #5
    Registered User
    Join Date
    03-27-2016
    Location
    UK
    MS-Off Ver
    2013
    Posts
    18

    Re: Cleaning up data from scraping website URL

    This is what I get when I put website in.

    1.png

    I know it does not say 'Table 0' or whatever. The data is like this:

    div
    dl
    dt Last Activity dt
    dd 26.10.15 span 00:26 /span /dd
    /dl
    /div

    VBulletin profile page

    Then once I've loaded 'Document' It comes to this:

    2.png

    The data is in there somewhere. Just need to know how I can specifically extract it and put in into the worksheet like I said in my previous post and then loop onto next page in the worksheet.

    Many thanks. Hope the pictures help in some sort of way.

    Cheers
    Attached Images Attached Images
    Last edited by king0r1; 03-27-2016 at 08:44 AM.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. [SOLVED] Scraping data from website
    By novicevba in forum Excel Programming / VBA / Macros
    Replies: 4
    Last Post: 01-22-2016, 11:02 AM
  2. Web Scraping Data From A Website Where Hyperlink Does Not Change
    By NeedForExcel in forum Excel Programming / VBA / Macros
    Replies: 4
    Last Post: 10-01-2015, 02:27 AM
  3. Data Scraping from the WebSite
    By sathis in forum Excel Programming / VBA / Macros
    Replies: 11
    Last Post: 07-26-2015, 10:46 AM
  4. problem while scraping data from website
    By dps700 in forum Excel Programming / VBA / Macros
    Replies: 0
    Last Post: 12-09-2014, 09:46 AM
  5. Automated data scraping from website into excel
    By kavin in forum Excel Programming / VBA / Macros
    Replies: 1
    Last Post: 09-10-2014, 01:42 PM
  6. scraping data from a website
    By redpanda in forum Excel Programming / VBA / Macros
    Replies: 9
    Last Post: 07-05-2012, 01:20 PM
  7. Scraping website data into Excel
    By oliver79 in forum Excel General
    Replies: 0
    Last Post: 06-13-2010, 02:15 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1