+ Reply to Thread
Results 1 to 15 of 15

How to extract all the URLs from this list with excel?

  1. #1
    Registered User
    Join Date
    11-25-2010
    Location
    Australia
    MS-Off Ver
    Excel 2007
    Posts
    82

    How to extract all the URLs from this list with excel?

    Hi Guys,

    I want to extract all the URLs from this sitemap: http://tinyurl.com/zhhd2vj

    The URLs are between a lot of different text which makes it tricky.

    The URLs start with https:// and then end with /

    Example:

    screenshot.2051.jpg

    I have tried using text to columns with excel but have had no luck.

    Any ideas on other ways to do this?

    I have uploaded the sitemap in notepad format at this link if it helps: http://www.filedropper.com/sitemapdata

    Thanks,
    Chris

  2. #2
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    52,944

    Re: How to extract all the URLs from this list with excel?

    Please upload your file to the forum. Not all members are able - or willing - to access file-hosting sites
    1. Use code tags for VBA. [code] Your Code [/code] (or use the # button)
    2. If your question is resolved, mark it SOLVED using the thread tools
    3. Click on the star if you think someone helped you

    Regards
    Ford

  3. #3
    Registered User
    Join Date
    11-25-2010
    Location
    Australia
    MS-Off Ver
    Excel 2007
    Posts
    82

    Re: How to extract all the URLs from this list with excel?

    Sorry attached it here.

    Thankyou.
    Attached Files Attached Files

  4. #4
    Forum Expert bebo021999's Avatar
    Join Date
    07-22-2011
    Location
    Vietnam
    MS-Off Ver
    Excel 2016
    Posts
    9,481

    Re: How to extract all the URLs from this list with excel?

    Could you paste it into a worksheet?
    We could, but we want to follow your desired layout.
    Quang PT

  5. #5
    Forum Expert
    Join Date
    08-12-2012
    Location
    Sydney, Australia
    MS-Off Ver
    Excel 2010
    Posts
    5,636

    Re: How to extract all the URLs from this list with excel?

    is this what you desire as output?
    Attached Files Attached Files
    If you are satisfied with the solution(s) provided, please mark your thread as Solved.
    Select Thread Tools-> Mark thread as Solved. To undo, select Thread Tools-> Mark thread as Unsolved.

  6. #6
    Registered User
    Join Date
    11-25-2010
    Location
    Australia
    MS-Off Ver
    Excel 2007
    Posts
    82

    Re: How to extract all the URLs from this list with excel?

    Hi, excel sheet attached.

    Thankyou.
    Attached Files Attached Files

  7. #7
    Registered User
    Join Date
    11-25-2010
    Location
    Australia
    MS-Off Ver
    Excel 2007
    Posts
    82

    Re: How to extract all the URLs from this list with excel?

    Quote Originally Posted by humdingaling View Post
    is this what you desire as output?
    yes that is correct!

  8. #8
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    52,944

    Re: How to extract all the URLs from this list with excel?

    I dont see any easy formula way to do this, that is a real mess. However, here is a bit of a work around that does work, but is going to take some work from you.

    1. Copy/paste 5-5 rows of that data into excel (excel wont work with more than 8000~ characters, so keep that in mind)
    2. repeat that until you have copied over all your data
    3. Use Find/Replace with FIND http and REPLACE with &http
    4 use Text2Columns using & as the delimiter

  9. #9
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    52,944

    Re: How to extract all the URLs from this list with excel?

    @ humding, how did you do that?

  10. #10
    Forum Expert
    Join Date
    08-12-2012
    Location
    Sydney, Australia
    MS-Off Ver
    Excel 2010
    Posts
    5,636

    Re: How to extract all the URLs from this list with excel?

    its not quite excel but here is what i did ...bear with me its wordy

    open file in notepad or wordpad or whatever word processor you want

    Find the timestamp information
    in this case it was "daily2016-01-29T14:35:01+11:00"
    find and replace with blank
    then save File

    open this file into excel and it should all be in cell
    text to column using ":" delimiter
    discard the first https and copy and paste special>>transpose

    then find and replace "https" with blank

    after which you can add "https:" & cell value to combine into the URL
    copy and paste values and then bobs your uncle, done

    ps need some minor touch up at the last entry
    Last edited by humdingaling; 01-29-2016 at 01:22 AM.

  11. #11
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    52,944

    Re: How to extract all the URLs from this list with excel?

    OK thanks, mine was kinda a messy work around as well

  12. #12
    Forum Expert bebo021999's Avatar
    Join Date
    07-22-2011
    Location
    Vietnam
    MS-Off Ver
    Excel 2016
    Posts
    9,481

    Re: How to extract all the URLs from this list with excel?

    Quote Originally Posted by humdingaling View Post
    in this case it was "daily2016-01-29T14:35:01+11:00"
    find and replace with blank
    then save File
    Expanded issue:
    What if it was not fixed?
    It should be "2016-01-29" or "2016-01-30", if data is downloaded from 2 days?

  13. #13
    Forum Expert
    Join Date
    08-12-2012
    Location
    Sydney, Australia
    MS-Off Ver
    Excel 2010
    Posts
    5,636

    Re: How to extract all the URLs from this list with excel?

    OP can correct me if i am wrong

    from what i can gather, this is a time stamp of when the document was downloaded
    meaning it would be the same throughout the document in question

  14. #14
    Forum Expert
    Join Date
    08-12-2012
    Location
    Sydney, Australia
    MS-Off Ver
    Excel 2010
    Posts
    5,636

    Re: How to extract all the URLs from this list with excel?

    ok so i found a better way of doing it instead without the need to worry about the timestamp being consistent

    open file in MS WORD
    FIND "https:"
    replace all with "^lhttps:"

    ^l = manual Line Break

    what you end up with is a line break for every single URL (attached text file after doing this)

    after which you can use a formula to delete everything after the last "/"
    Formula: copy to clipboard
    Please Login or Register  to view this content.
    Attached Files Attached Files
    Last edited by humdingaling; 01-29-2016 at 01:56 AM.

  15. #15
    Forum Expert bebo021999's Avatar
    Join Date
    07-22-2011
    Location
    Vietnam
    MS-Off Ver
    Excel 2016
    Posts
    9,481

    Re: How to extract all the URLs from this list with excel?

    Thx humdingaling

    I try to open a blank word document, then open that txt file
    Then copy/paste to excel.
    It works for me, without replacing "http"

    OK. I've learnt a lot from you, thx again.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Extract Specific Data From URLs
    By censo in forum Excel Formulas & Functions
    Replies: 6
    Last Post: 01-08-2016, 12:41 AM
  2. Extract xml elements from mutiple urls into Excel
    By superdesi in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 06-17-2015, 01:42 PM
  3. [SOLVED] Scraping Data from a URLs List Into Excel Using VBA
    By mayaincaztec in forum Excel Programming / VBA / Macros
    Replies: 8
    Last Post: 03-20-2015, 09:12 AM
  4. Replies: 4
    Last Post: 07-28-2014, 05:37 PM
  5. EXCEL VBA: web query from a list of URLs within cells
    By IDONTTELLYOU in forum Excel Programming / VBA / Macros
    Replies: 1
    Last Post: 09-11-2012, 01:10 AM
  6. Loop thru URLs to extract web data
    By trwlch in forum Excel Programming / VBA / Macros
    Replies: 1
    Last Post: 02-20-2012, 04:26 AM
  7. Making hyperlinks to a whole list of urls in excel
    By max11 in forum Excel - New Users/Basics
    Replies: 2
    Last Post: 09-03-2007, 12:24 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1