I am currently using my script to extract information from various webpages with very similar layout and type of information, som examples;
http://faktaark.naturbase.no/naturtype?id=BN00058122
http://faktaark.naturbase.no/naturtype?id=BN00087722
http://faktaark.naturbase.no/naturtype?id=BN00067722
So I made this script;
The issue here is that I have coded it to extract the information from "dd" at the 22end line BUT I just noticed thatPlease Login or Register to view this content.
its not always the same. On other links the wanted data might be at "dd" at the 21st line! Which is a big problem.
Also the paragraph with text that I am wanting does not seem to have a name, see picture;
dd.noname.JPG
Can anyone tell me how I can always select the appropiate dd without saying which line its on, if its possible?
Thanks
Bookmarks