How do I import in from power query the pages that has multiple pages
From webpage I insert url , I click inside the table, but import the first page only in power query
Can anyone help how to import all the pages
How do I import in from power query the pages that has multiple pages
From webpage I insert url , I click inside the table, but import the first page only in power query
Can anyone help how to import all the pages
might be helpful if you posted URL and page you wished to retrieve so that we can test our potential solution.
Alan עַם יִשְׂרָאֵל חַי
Change an Ugly Report with Power Query
Database Normalization
Complete Guide to Power Query
Man's Mind Stretched to New Dimensions Never Returns to Its Original Form
It will depend on "How" the website is constructed.
There is no one size fits all step/code that will handle all web sites. Each site has it's own structure, and you'll need to change your query to accommodate.
First step is to use "Developer Tool" or something similar to trace what requests are sent to URL when data is populated.
There are couple of common method for your case.
1. Request to URL sends back JSON, XML or other formatted data and that's parsed by java, ASP .NET or some other means to populate data (i.e. all data in single request).
2. Each page is separate request and request has page # as parameter. (i.e. data in separate requests)
3. Only end-point accessible is via API. How to set up will depend on authentication method, along with other header requirements (if any) set by the site.
Each will require significantly different method to parse query. I'd recommend that you supply URL of the site that you are trying to scrape (if it does not require login. i.e. public).
If it is private... it would help if you can trace the request string that's being sent to URL to get the data.
?Progress isn't made by early risers. It's made by lazy men trying to find easier ways to do something.?
― Robert A. Heinlein
Below web page but however need to extract all pages storage guide table
http://www.chcl.mu/info/?id=30&page=1
Last edited by JEAN1972; 10-01-2019 at 01:23 PM. Reason: typo
Ok, here's what you could do since it's pattern 2 (each page is separate request).
Personally, since it has total of 115 pages. I'd recommend breaking it apart (ex: do 12 to 15 at a time).
Sample is for first 12 pages.
Please Login or Register to view this content.
FYI - If you want to delay your query steps (i.e. finish first 12 then move to next 12 etc), since 115 query executed concurrently may get you blocked by site's security measures...
You'll need to combine filtering operation with timing delay.
https://blog.crossjoin.co.uk/2014/11...n-power-query/
To measure how long it takes for one set of query to finish...
https://blog.crossjoin.co.uk/2014/11...query-queries/
thanks for assistance , have a try and some reading .
You are welcome and thanks for the rep.
I have another site where I need to extract data either
option 1. from date to date through out date pickler
option 2 by vessel name
there are other combination but I prefer to use either option 1 or option 2
but in doing a query i got message unable to connect
Details: "The request was aborted: Could not create SSL/TLS secure channel."
However, when using for first time it asks me to use my windows credential , I put the password I use .but it fails.
my goal is either to make query either by date range or vessel name only .
Needs assistance
url
https://tradenetmns.mu/MASTActionSer...anifestSea.xml
Your error message indicates issue with SSL/TLS. It's likely that you are using Windows 7 and is on TLS 1.0 instead of current standard TLS 1.2
You may need to enable TLS 1.2 by editing registry key. Or install .NET 4.5 runtime.
Have a careful read of...
https://stackoverflow.com/questions/...col-in-net-4-5
I'm afraid I can't help you further with this one as it requires login.
I am using windows 8.1 and windows 10 on 2 different laptops.but I have no issue .
Just using power query give me that message .
However thanks as this one is a bit complicated using login.
Last edited by JEAN1972; 10-02-2019 at 10:53 AM. Reason: typo
No you wouldn't have issue with browser. As all modern browsers should be updated to current standard (IE 11 not included).
The error message you are receiving is due to SSL/TSL using older non-secure protocol.
There are couple of cases where this error is raised in PowerQuery.
1. .NET 4.0 is used rather than .NET 4.5 (4.0 uses TSL 1.0 by default)
2. Registry Key is set to use older protocol as default rather than new one (this is always the case for 4.0, but in rare instance, may be true for 4.5 as well)
The site complies with GDPR, and definitely not using TSL 1.0.
Last edited by CK76; 10-02-2019 at 10:55 AM.
There are currently 1 users browsing this thread. (0 members and 1 guests)
Bookmarks