Skip to content
Advertisement

Scrape Goutte and waiting for full loading

I try to scrape via Goutte list from popular auction web service, but pieces of they code are rendered by javascript and problem is that Goutte return me only DOM without JS job. So if symphony are headless browser is it posiibile to wait with scraping until full content is rendered? I checked phantomjs where is normal, but I’ve never found that ability in Goute..

Advertisement

Answer

https://github.com/FriendsOfPHP/Goutte

Goutte does not parse Javascript, it is not a headless browser it just downloads the content and parse it without running javascript.

Technical Information

Goutte is a thin wrapper around the following fine PHP libraries:

Symfony Components: BrowserKit, CssSelector and DomCrawler;
Guzzle HTTP Component.

User contributions licensed under: CC BY-SA
10 People found this is helpful
Advertisement