'VBA options to scrape contents from currently loaded webpage in edge (or IE), or Excel directly?
Background: Many years ago, I used Excel VBA to scrape some dynamic web pages; I had to scrape it from IE after it was loaded (the page wasn't directly accessible from Excel without a lot of sendkeys & password hassles). I find myself in a similar situation and am wondering if MS tools have progressed in recent years to provide an easier way.
Situation: I'm not a UI person, haven't looked at the page code yet to untangle anything; I have a dynamic, visual org chart where each person has a box/tile (image, title, name, etc) and the whole chart changes depending on who is selected. The page is served from a passworded SAAS, so I'm not excited about trying to access the site directly with Excel - I think the right answer is still to scrape what is already loaded in the browser.
The core problem is formatting - I need to be able to move the boxes around (closer together, and change the layout a little) so it fits better on powerpoint slides.
Since it has been a while since the last time I did this, I'm just posting to see what I should know before starting: am I stuck with IE, or does Edge allow other programs to access the loaded pages via VBA? Do any of the 'data source' options in Excel do a good job of parsing visual page content in a way that would make it easier to scrape? Is there a completely different method I should learn?
Thank you in advance for any advice!
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|
