Semalt Expert Explains How To Scrape An AJAX Website Using Python

Page 1

23.05.2018

Semalt Expert Explains How To Scrape An AJAX Website Using Python

Web scraping is a method that employs the use of software to extract data from a web page. There are lots of tools to use for scraping the web with python, some of them being; Sky, Scrapy, Requests, and Beautiful Soup. However, most of these tools are limited by the fact that they only retrieve static HTML that comes from the server and not the dynamic part rendered by JavaScript. However, there are some techniques in which this problem can be overcome:

1. Automated Browsers You can make use of automated browsers such as Selenium or Splash which are full browsers that run headless. However, setting them up can be quite complex, and so we will focus on the second option below.

2. Intercept AJAX calls This involves trying to intercept the AJAX calls from the page and trying to replay or reproduce them. https://rankexperience.com/articles/article2137.html

1/2


Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.