Closed Thread
Results 1 to 4 of 4

Compare lists of URLs for duplicates, and remove duplicates

  1. #1
    Registered User
    Join Date
    06-13-2018
    Location
    Perth, Australia
    MS-Off Ver
    Office 365
    Posts
    1

    Compare lists of URLs for duplicates, and remove duplicates

    Hi everyone,

    First post here!

    For work at the moment I need to sort through a bunch of URLs, which may have duplicates in them in different formats, eg:

    They may start with:

    http
    https
    have www
    not have www
    have a trailing slash /
    have a trailing slash with a # at the end, eg. /#

    (Sorry not to be more specific, can't post links yet)

    These could sometimes be very long and URLs in general are all different lengths, but the bits I want to match should always ignore the final 2 characters in the url, and everything before the root domain on the site, eg. http, www, etc.

    I'd like to look for and delete duplicates, while still retaining one instance of the URL - which one doesn't matter so much, as they all the different variations point to the same content anyway.

    Is there any way, or any functions in excel I can use to make this as easy and hands-free as possible?

    Thanks!

  2. #2
    Banned User!
    Join Date
    02-05-2015
    Location
    San Escobar
    MS-Off Ver
    any on PC except 365
    Posts
    12,168

    Re: Compare lists of URLs for duplicates, and remove duplicates

    try to use PowerQuery aka Get&Transform (Data tab, Get&Transform section) which change urls to text, split url, then remove duplicates from appropriate column(s), after that merge columns with appropriate delimiteres and then compare result to source list (because you want working urls not a text - I suppose) - this is idea only but attached example excel file would help
    Last edited by sandy666; 06-13-2018 at 05:36 AM.

  3. #3
    Registered User
    Join Date
    05-25-2021
    Location
    nagar
    MS-Off Ver
    2018
    Posts
    3

    Re: Compare lists of URLs for duplicates, and remove duplicates

    First post here!

    For work at the moment I need to sort through a bunch of URLs, which may have duplicates in them in different formats, eg:

    They may start with:

    http
    https
    have www
    not have www
    have a trailing slash /
    have a trailing slash with a # at the end, eg. /#

  4. #4
    Forum Moderator AliGW's Avatar
    Join Date
    08-10-2013
    Location
    Retired in Ipswich, Suffolk, but grew up in Sawley, Derbyshire (England)
    MS-Off Ver
    MS 365 Subscription Insider Beta Channel v. 2404 (Windows 11 22H2 64-bit)
    Posts
    80,647

    Re: Compare lists of URLs for duplicates, and remove duplicates

    Administrative Note:

    Welcome to the forum.

    We are happy to help, however whilst you feel your request is similar to this thread, experience has shown that things soon get confusing when answers refer to particular cells/ranges/sheets which are unique to your post and not relevant to the original.

    Please see Forum Rule #4 about hijacking and start a new thread for your query.

    If you are not familiar with how to start a new thread see the FAQ: How to start a new thread
    Ali


    Enthusiastic self-taught user of MS Excel who's always learning!
    Don't forget to say "thank you" in your thread to anyone who has offered you help.
    You can reward them by clicking on * Add Reputation below their user name on the left, if you wish.

    Forum Rules (updated August 2023): please read them here.

Closed Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Compare columns and remove duplicates
    By Goodstart14 in forum Excel Programming / VBA / Macros
    Replies: 2
    Last Post: 07-21-2015, 10:59 AM
  2. compare two short lists for duplicates
    By stockgoblin42 in forum Excel Formulas & Functions
    Replies: 4
    Last Post: 08-25-2013, 08:18 AM
  3. Remove duplicates from both Lists
    By Sircrayon in forum Excel General
    Replies: 1
    Last Post: 09-16-2010, 10:38 AM
  4. How to compare two columns and remove duplicates?
    By username123 in forum Excel General
    Replies: 15
    Last Post: 07-05-2006, 11:06 AM
  5. how do i compare two columns and remove duplicates?
    By aljernon805 in forum Excel - New Users/Basics
    Replies: 1
    Last Post: 12-09-2005, 12:10 PM
  6. [SOLVED] compare two columns and remove duplicates
    By Moni39 in forum Excel Formulas & Functions
    Replies: 3
    Last Post: 05-05-2005, 02:06 PM
  7. How do I compare 2 lists and show duplicates as a new list?
    By AnaBannana in forum Excel Programming / VBA / Macros
    Replies: 3
    Last Post: 01-07-2005, 12:06 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1