I try to scrape via Goutte list from popular auction web service, but pieces of they code are rendered by javascript and problem is that Goutte return me only DOM without JS job. So if symphony are headless browser is it posiibile to wait with scraping until full content is rendered? I checked phantomjs where is normal, but I’ve never found that ability in Goute..
Advertisement
Answer
https://github.com/FriendsOfPHP/Goutte
Goutte does not parse Javascript, it is not a headless browser it just downloads the content and parse it without running javascript.
Technical Information
Goutte is a thin wrapper around the following fine PHP libraries:
Symfony Components: BrowserKit, CssSelector and DomCrawler;
Guzzle HTTP Component.