Skip to content
Advertisement

Tag: web-crawler

In Symfony/Panther when scraping, waitfor function will throw exception if it timesout – i need it to continue if item is not found

I have a database of clinics, and an url to each clinic. All clinic pages are the same in terms of html/css, with different content to scrape. However, some clinics have no content on their page, and this causes trouble for me. I have: If .facility is not present, the waitFor() will throw exception because of timeout. I need to

How to crawl page in PHP?

I get the error: “error code: 1020″. The page I’m trying to crawl for form data is: https://v2.gcchmc.org/medical-status-search/. This is my code: $initial = file_get_contents(‘https://v2.gcchmc.org/medical-status-search/’); $check = preg_replace(‘/.+?input type=”hidden” name=”csrfmiddlewaretoken” value=”(.+?)”.*/sim’, ‘$1’. $initial); print $check; “error code: 1020” the page I am trying to crawl for form data is https://v2.gcchmc.org/medical-status-search/. Can you help me what’s wrong in the code below.

How to get price value with regular expressions

I am trying to write a crawler for an Online Store and now I need to get the price value of the webpage. Here is my try: Basically $html holds the source code of the webpage and the price value is stored at the document like this: <div class=”c-product__seller-price-pure js-price-value”>10,699,000</div> But when I run this I get this as result:

PHP code for moving the cursor using the twitter API

So I already have a script that collects the first 4999 followers ids of a twitter user using the API in xml format. I semi understand how the cursor process works but I am confused how to implement it to loop until it gathers all the followers. The user I am attempting to gather will take about 8 calls. Any

Advertisement